I'm currently doing a POC on ES and am new to ES.
Would like to find out more of linguistics support for ES, like will it be
able to do language detection? Entity detection?
What are the linguistics support out of the box and what pluggins are out
there?
As far as I know, there is nothing ES-specific here. For example, we have
a generic, trainable Language Identifier as well as a Key Phrase Extractor
and we've used them with both Solr and ES - they are generic. Similarly,
you could grab OpenNLP and use it do NER. This is typically done in a
pipeline of components that process/prepare documents for indexing, and
indexing to ES would just end up being the last component in the pipeline.
On Tuesday, May 14, 2013 11:05:09 PM UTC-4, Neal Ang wrote:
Hi All,
I'm currently doing a POC on ES and am new to ES.
Would like to find out more of linguistics support for ES, like will it be
able to do language detection? Entity detection?
What are the linguistics support out of the box and what pluggins are out
there?
On Monday, 20 May 2013 09:36:24 UTC+8, Otis Gospodnetic wrote:
Hi,
As far as I know, there is nothing ES-specific here. For example, we have
a generic, trainable Language Identifier as well as a Key Phrase Extractor
and we've used them with both Solr and ES - they are generic. Similarly,
you could grab OpenNLP and use it do NER. This is typically done in a
pipeline of components that process/prepare documents for indexing, and
indexing to ES would just end up being the last component in the pipeline.
On Tuesday, May 14, 2013 11:05:09 PM UTC-4, Neal Ang wrote:
Hi All,
I'm currently doing a POC on ES and am new to ES.
Would like to find out more of linguistics support for ES, like will it
be able to do language detection? Entity detection?
What are the linguistics support out of the box and what pluggins are out
there?
Sorry, didn't see your email from May 23. Nothing in ES can do this
automatically for you, AFAIK. You have to use OpenNLP to do the
extraction and then explicitly store the extracted entity in some ES
index field.
On Monday, 20 May 2013 09:36:24 UTC+8, Otis Gospodnetic wrote:
Hi,
As far as I know, there is nothing ES-specific here. For example, we have
a generic, trainable Language Identifier as well as a Key Phrase Extractor
and we've used them with both Solr and ES - they are generic. Similarly,
you could grab OpenNLP and use it do NER. This is typically done in a
pipeline of components that process/prepare documents for indexing, and
indexing to ES would just end up being the last component in the pipeline.
On Tuesday, May 14, 2013 11:05:09 PM UTC-4, Neal Ang wrote:
Hi All,
I'm currently doing a POC on ES and am new to ES.
Would like to find out more of linguistics support for ES, like will it
be able to do language detection? Entity detection?
What are the linguistics support out of the box and what pluggins are out
there?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.