How to index stop words AND special characters using standard analyzer


(sowjanya) #1

Hi,
I need to index stop words AND Special characters with the features of
Standard Analyzer.

I have the mapping like this
{
"abc_template" : {
"template" : "*",
"order": 0,
"settings" :{

},
"mappings" : {
"default" : {
"_source" : {"enabled" : true },
"_all" : {"enabled" : false },
"_timestamp" : {"enabled" : true, "store" : "yes"},
"abc_record_id" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"},
"abc_object_name" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"},
"abc_field_name" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"}
}
}
}
}

And it automatically considering "text" as default field. We need to index
stop words and special characters for field "text".

Please help me.

Thanks In Advance.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/79176818-bc80-41ec-99e4-0b83616c70af%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Ivan Brusic) #2

You would need to create a custom analyzer (using the elasticsearch syntax)
that is identical to the standard analyzer, but without the stop filter and
extra analysis.

Here is the code for the standard analyzer:
http://grepcode.com/file/repo1.maven.org/maven2/org.apache.lucene/lucene-analyzers-common/4.3.1/org/apache/lucene/analysis/standard/StandardAnalyzer.java#110

IIRC, you should be fine with the standard tokenizer and we just need to
remove the standard and stop filter. Just create a custom analyzer and use
the analysis API to debug it.

Cheers,

Ivan

On Wed, Dec 11, 2013 at 12:07 AM, Elastic Sowjanya <sowjanyachalla@gmail.com

wrote:

Hi,
I need to index stop words AND Special characters with the features of
Standard Analyzer.

I have the mapping like this
{
"abc_template" : {
"template" : "*",
"order": 0,
"settings" :{

},
"mappings" : {
"default" : {
"_source" : {"enabled" : true },
"_all" : {"enabled" : false },
"_timestamp" : {"enabled" : true, "store" : "yes"},
"abc_record_id" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"},
"abc_object_name" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"},
"abc_field_name" : {"type" : "string", "store" : "yes", "index" :
"not_analyzed"}
}
}
}
}

And it automatically considering "text" as default field. We need to
index stop words and special characters for field "text".

Please help me.

Thanks In Advance.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/79176818-bc80-41ec-99e4-0b83616c70af%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQAixify%3DF63UvwMgUtee9kni6F0C52Qam6%2BN-5zrLp2rA%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #3