The only changes I've made to my indexes migrating from ES 2.4.4 to ES 5.3.0 are such mappings upgrade (rest of the fields are compatible with 5.x):
{ x: string, index: not_analyzed } => { x: keyword }
{ x: string, term_vector: yes } => { x: text, term_vector: yes }
Reindexing from scratch I've got such stats:
dataset1=560.392 docs.
ES 2.4.4 index size=99.7M
ES 5.3.0 index size=104M
dataset2=2.583.604 docs
ES 2.4.4 index size=623M
ES 5.3.0 index size=662M
Is it a general rule: 5.x index size is larger than 2.x one for the same docs?
Mby it matters: 2.4.4 comes from the official deb repository, 5.3.0 comes from the official docker image (I mean how they were configured etc).