Sorting and querying semantic version strings

Jeff_Potts · November 13, 2014, 3:52pm

I have a property called "version" that will have semantic version strings
like "1.1.0", "10.0.1", and "2.0.0". There will always be three integers
separated by periods and no other characters. I need to be able to sort my
docs by version and do bool comparisons. For example, using the values
above I need the ascending order to be "1.1.0", "2.0.0", "10.0.1".

My approach so far, documented in this gist
https://gist.github.com/jpotts/375e3c8894f93648995d, is to use a groovy
script to normalize the string at index time using a transform. The script
pads each element in the string with spaces to a length of eight and stores
the result in a field (version.sortable). So "1.1.0" would become "
1 1 0" while "10.0.1" would become " 10 0 1".

As I put in the gist, this works, but I suspect there is a better way to
handle this. One drawback of my current approach is that queries have to
use the normalized term. I'd much rather use the natural format and have
the normalization be invisible to the front-end.

I suspect I could handle this with a custom analyzer but I'm not exactly
sure where to start with that.

Any suggestions appreciated.

Jeff

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f708ffff-e46c-4447-bd26-808fbbf20211%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

jprante · November 13, 2014, 7:52pm

You can use my plugin

if you want to sort on such fields

https://github.com/jprante/elasticsearch-analysis-naturalsort/blob/master/src/test/java/org/xbib/elasticsearch/naturalsort/NaturalSortKeyTests.java#L164

I haven't tested search yet though.

Jörg

On Thu, Nov 13, 2014 at 4:52 PM, Jeff Potts jeffpotts01@gmail.com wrote:

I have a property called "version" that will have semantic version strings
like "1.1.0", "10.0.1", and "2.0.0". There will always be three integers
separated by periods and no other characters. I need to be able to sort my
docs by version and do bool comparisons. For example, using the values
above I need the ascending order to be "1.1.0", "2.0.0", "10.0.1".

My approach so far, documented in this gist
https://gist.github.com/jpotts/375e3c8894f93648995d, is to use a groovy
script to normalize the string at index time using a transform. The script
pads each element in the string with spaces to a length of eight and stores
the result in a field (version.sortable). So "1.1.0" would become "
1 1 0" while "10.0.1" would become " 10 0 1".

As I put in the gist, this works, but I suspect there is a better way to
handle this. One drawback of my current approach is that queries have to
use the normalized term. I'd much rather use the natural format and have
the normalization be invisible to the front-end.

I suspect I could handle this with a custom analyzer but I'm not exactly
sure where to start with that.

Any suggestions appreciated.

Jeff

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f708ffff-e46c-4447-bd26-808fbbf20211%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f708ffff-e46c-4447-bd26-808fbbf20211%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoE1FA2FoCoU6KYCanaymw_k2dYi7ExyyjLbPXWP6T%2BnWw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Sorting and range filtering semantic versions Elasticsearch	3	2049	July 6, 2017
Sort string versions in kibana Kibana	6	450	July 3, 2020
Sort by doc _version descending Elasticsearch	1	345	July 6, 2017
Querying ElasticSearch to order empty strings last Elasticsearch	3	4189	July 6, 2017
How to perform natural sort on a field in Elasticsearch? Elasticsearch	2	1810	November 10, 2021

Sorting and querying semantic version strings

Related topics