How is data stored

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Use the analyze API to get a view into how your analysis chain (tokenizer
and filters) affect text.

The index itself is all jumbled together with all the documents and there
isn't a good way to dig the data for a single document out of it.
On Dec 31, 2014 10:36 PM, "Bruno Kamiche" bkamiche@gmail.com wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Beside analyze API, the explain API in the query can tell why a document
has been included into the result set, and allows conclusions about the
terms how they are stored in the index.

http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-explain.html

Jörg

On Thu, Jan 1, 2015 at 5:26 AM, Nikolas Everett nik9000@gmail.com wrote:

Use the analyze API to get a view into how your analysis chain (tokenizer
and filters) affect text.

The index itself is all jumbled together with all the documents and there
isn't a good way to dig the data for a single document out of it.
On Dec 31, 2014 10:36 PM, "Bruno Kamiche" bkamiche@gmail.com wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic
question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field
(for example removing urls), and performing aggregations on that field, but
the aggregations return "http" as a used word, and guess that is not
correct, as it shall be removed...

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com
https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoHEQO1jf-8xxwFd2ZCdm3zrraPUBEBuGXgLwYrje9Jmgg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Thanks for your replies, that gave me the clue for what I was looking for,
and now it is solved!

On Wednesday, December 31, 2014 10:36:27 PM UTC-5, Bruno Kamiche wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/9196629f-7bf8-4256-8b16-8ecee2fbd020%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

have a look
at https://github.com/polyfractal/elasticsearch-inquisitor#analyzer-testing

On Thursday, 1 January 2015 17:05:25 UTC, Bruno Kamiche wrote:

Thanks for your replies, that gave me the clue for what I was looking for,
and now it is solved!

On Wednesday, December 31, 2014 10:36:27 PM UTC-5, Bruno Kamiche wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic
question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field
(for example removing urls), and performing aggregations on that field, but
the aggregations return "http" as a used word, and guess that is not
correct, as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/59accfbd-46a4-437c-81b1-6978b752ac4c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.