Wikipedia River: Index only title

I'm using the Wikipedia River, is there a way that I can index only the
title of Wikipedia article?

If I do this can I still get access of the entire article text through
ElasticSearch using standard /index/type/id?

Thanks so much,

Matt

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

IIRC, the wikipedia river does not provide a mapping at start up, so you
can supply your own mapping with only one field and dynamic set to false.

http://www.elasticsearch.org/guide/reference/mapping/object-type/

--
Ivan

On Mon, Jun 24, 2013 at 5:35 PM, Matt Arkin arkin@endlessm.com wrote:

I'm using the Wikipedia River, is there a way that I can index only the
title of Wikipedia article?

If I do this can I still get access of the entire article text through
ElasticSearch using standard /index/type/id?

Thanks so much,

Matt

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

So probably a very stupid question (its my 2nd day in Elastic Search), can
I supply a mapping when I make this curl request, or do I need to make the
river first, create a mapping then do the below call?

curl -XPUT localhost:9200/ptwikiss/page/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "
http://dumps.wikimedia.org/ptwiki/20130622/ptwiki-20130622-pages-articles.xml.bz2
"
}
}
'

Thanks,
Matt

On Monday, June 24, 2013 5:43:59 PM UTC-7, Ivan Brusic wrote:

IIRC, the wikipedia river does not provide a mapping at start up, so you
can supply your own mapping with only one field and dynamic set to false.

http://www.elasticsearch.org/guide/reference/mapping/object-type/

--
Ivan

On Mon, Jun 24, 2013 at 5:35 PM, Matt Arkin <ar...@endlessm.com<javascript:>

wrote:

I'm using the Wikipedia River, is there a way that I can index only the
title of Wikipedia article?

If I do this can I still get access of the entire article text through
ElasticSearch using standard /index/type/id?

Thanks so much,

Matt

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Create the index with the correct mapping first, then create the river
using the same index name. The river will attempt to create the index, so
make sure it exists first.

I have never used the Wikipedia river, but that is what I can gather by
looking at the source:

On Mon, Jun 24, 2013 at 5:50 PM, Matt Arkin arkin@endlessm.com wrote:

So probably a very stupid question (its my 2nd day in Elastic Search), can
I supply a mapping when I make this curl request, or do I need to make the
river first, create a mapping then do the below call?

curl -XPUT localhost:9200/ptwikiss/page/_**meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "http://dumps.wikimedia.org/ptwiki/20130622/ptwiki-
20130622-pages-articles.xml.**bz2http://dumps.wikimedia.org/ptwiki/20130622/ptwiki-20130622-pages-articles.xml.bz2
"
}
}
'

Thanks,
Matt

On Monday, June 24, 2013 5:43:59 PM UTC-7, Ivan Brusic wrote:

IIRC, the wikipedia river does not provide a mapping at start up, so you
can supply your own mapping with only one field and dynamic set to false.

http://www.elasticsearch.org/**guide/reference/mapping/**object-type/http://www.elasticsearch.org/guide/reference/mapping/object-type/

--
Ivan

On Mon, Jun 24, 2013 at 5:35 PM, Matt Arkin ar...@endlessm.com wrote:

I'm using the Wikipedia River, is there a way that I can index only the
title of Wikipedia article?

If I do this can I still get access of the entire article text through
ElasticSearch using standard /index/type/id?

Thanks so much,

Matt

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@**googlegroups.com.

For more options, visit https://groups.google.com/**groups/opt_outhttps://groups.google.com/groups/opt_out
.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.