[ANN] Denormalization plugin for Elasticsearch analysis


(Jörg Prante) #1

Hi,

I just released this plugin

It uses a reference mechanism for including content from other documents in
the cluster during the analysis field mapping phase.

This process is also known as denormalization.

Denormalization can be defined as the copying of the same data into
multiple documents in order to simplify query processing or to fit the
user’s data into a particular data model.

See the example at the github page how to create a library catalog entry
for the book "Goethe's Faust" and making it searchable by referencing
variant forms of the author's name at indexing time.

As always, all feedback is welcome!

Best,

Jörg

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoGWSTZGEJZkySM_%3DabY1nnu7sdY_JrSEtbTj2tH_nXewg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(Ivan Brusic) #2

Extremely interesting. Should be ideal for a fixed set of terms. I am still
hoping for Matt Weber's pull request to be merged (
https://github.com/elasticsearch/elasticsearch/pull/3278) since I would
like to denormalize against a dynamic set of terms, but your approach would
be more efficient if handled at index time.

Pretty cool,

Ivan

On Tue, Jul 1, 2014 at 3:25 PM, joergprante@gmail.com <joergprante@gmail.com

wrote:

Hi,

I just released this plugin

https://github.com/jprante/elasticsearch-analysis-reference

It uses a reference mechanism for including content from other documents
in the cluster during the analysis field mapping phase.

This process is also known as denormalization.

Denormalization can be defined as the copying of the same data into
multiple documents in order to simplify query processing or to fit the
user’s data into a particular data model.

See the example at the github page how to create a library catalog entry
for the book "Goethe's Faust" and making it searchable by referencing
variant forms of the author's name at indexing time.

As always, all feedback is welcome!

Best,

Jörg

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoGWSTZGEJZkySM_%3DabY1nnu7sdY_JrSEtbTj2tH_nXewg%40mail.gmail.com
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoGWSTZGEJZkySM_%3DabY1nnu7sdY_JrSEtbTj2tH_nXewg%40mail.gmail.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQD1fTCn4_xK-H6%2BQurRQ-1ZLvwvti0mMpmVwYhP7Kqy_w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #3