How can i identify? Is it indexed successful or not?


(Sigehere) #1

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #2

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #3

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0,
"_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0,
"_source" :
{"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/
192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com//webmaster.swapnil@gmail.com)
a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'

{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #4

So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0, "_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0, "_source" : {"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:
You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #5

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:

So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0,
"_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0,
"_source" :
{"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/
192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com//webmaster.swapnil@gmail.com)
a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'

{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #6

Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:
curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which is a bz2 file.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:
So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0, "_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0, "_source" : {"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:
You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #7

Q . If I index on wiki dump then Is it take some same?
Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:

Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and
spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index
http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which
is a bz2 file.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:

So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0,
"_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0,
"_source" :
{"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/
192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com//webmaster.swapnil@gmail.com)
a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'

{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #8

Sorry. Not sure I understand your questions.
Elasticsearch is streaming wikipedia content. So it does not really "download" the file somewhere.

Was that your question?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:47:18, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Q . If I index on wiki dump then Is it take some same?
Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:
Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which is a bz2 file.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:
So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0, "_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0, "_source" : {"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:
You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b41746.32fff902.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #9

If it does not download file then every curl request for searching data it
hit on wiki server then it will give result to user?
Is I am right?

On Fri, Dec 20, 2013 at 3:39 PM, David Pilato david@pilato.fr wrote:

Sorry. Not sure I understand your questions.
Elasticsearch is streaming wikipedia content. So it does not really
"download" the file somewhere.

Was that your question?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 10:47:18, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Q . If I index on wiki dump then Is it take some same?
Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:

Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and
spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index
http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which
is a bz2 file.

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:

So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0,
"_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0,
"_source" :
{"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/
192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com//webmaster.swapnil@gmail.com)
a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'

{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe
.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b41746.32fff902.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRBoOr7mVq6NN4TBroKmwL2PJn81XU%2BNBUZa0LUVFsjiBQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #10

When you create the river, it streams the content from wikipedia and index all documents locally in elasticsearch.
When you query you only query on local index.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 11:11:37, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

If it does not download file then every curl request for searching data it hit on wiki server then it will give result to user?
Is I am right?

On Fri, Dec 20, 2013 at 3:39 PM, David Pilato david@pilato.fr wrote:
Sorry. Not sure I understand your questions.
Elasticsearch is streaming wikipedia content. So it does not really "download" the file somewhere.

Was that your question?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:47:18, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Q . If I index on wiki dump then Is it take some same?
Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:
Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which is a bz2 file.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:
So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0, "_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0, "_source" : {"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:
You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b41746.32fff902.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRBoOr7mVq6NN4TBroKmwL2PJn81XU%2BNBUZa0LUVFsjiBQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b418ae.3dc240fb.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #11

Oh !!!
How many time required for execution of this curl request?

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

If server had restarted in then index are flash or still it exist?

when I had downloaded wiki dump it takes 7 days on torrent

On Fri, Dec 20, 2013 at 3:45 PM, David Pilato david@pilato.fr wrote:

When you create the river, it streams the content from wikipedia and index
all documents locally in elasticsearch.
When you query you only query on local index.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 11:11:37, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

If it does not download file then every curl request for searching data
it hit on wiki server then it will give result to user?
Is I am right?

On Fri, Dec 20, 2013 at 3:39 PM, David Pilato david@pilato.fr wrote:

Sorry. Not sure I understand your questions.
Elasticsearch is streaming wikipedia content. So it does not really
"download" the file somewhere.

Was that your question?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 10:47:18, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Q . If I index on wiki dump then Is it take some same?

Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:

Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and
spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index
http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which
is a bz2 file.

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup
Plus Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:

So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0,
"_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0,
"_source" :
{"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/
192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:

You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

 --

David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com//webmaster.swapnil@gmail.com)
a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus
Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'

{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe
.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe
.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b41746.32fff902.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRBoOr7mVq6NN4TBroKmwL2PJn81XU%2BNBUZa0LUVFsjiBQ%40mail.gmail.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b418ae.3dc240fb.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRBFtdAQyEQmp%2BwBQtq%3DN7pAtT-oKD6dqbNFUD3aABugTA%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #12

Only once.
When you restart ES, documents still existing.

That said regarding your first concern, do you see anything in logs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 11:20:44, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Oh !!!
How many time required for execution of this curl request?

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'
If server had restarted in then index are flash or still it exist?

when I had downloaded wiki dump it takes 7 days on torrent

On Fri, Dec 20, 2013 at 3:45 PM, David Pilato david@pilato.fr wrote:
When you create the river, it streams the content from wikipedia and index all documents locally in elasticsearch.
When you query you only query on local index.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 11:11:37, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

If it does not download file then every curl request for searching data it hit on wiki server then it will give result to user?
Is I am right?

On Fri, Dec 20, 2013 at 3:39 PM, David Pilato david@pilato.fr wrote:
Sorry. Not sure I understand your questions.
Elasticsearch is streaming wikipedia content. So it does not really "download" the file somewhere.

Was that your question?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:47:18, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Q . If I index on wiki dump then Is it take some same?
Q . is elasticsearch are extract file or not?

Thank You

On Fri, Dec 20, 2013 at 3:08 PM, David Pilato david@pilato.fr wrote:
Could you try to move your file on a local disk?
I mean that this is the first time I see it used with file:// URL and spaces in pathname.

Yes elasticsearch index bz2 file.

By default, if you create the river without parameters:

curl -XPUT localhost:9200/_river/my_river/_meta -d '{ "type" : "wikipedia" }'

We index http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2 which is a bz2 file.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 10:32:05, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Hi David,

My file is still exist at same location file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2

Can elasticsearch indexed on *.bz2 file?

this enwiki-20131202-pages-articles.xml.bz2 file approx 9GB in size.

On Fri, Dec 20, 2013 at 2:28 PM, David Pilato david@pilato.fr wrote:
So no other document in the cluster. It sounds like it did not work.
May be your URL is incorrect here?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 09:37:22, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

I got following response:

How I know total index count?

$ curl http://192.168.1.13:9200/_search?q=*
{"took":88,"timed_out":false,"_shards":{"total":2,"successful":2,"failed":0},"hits":{"total":2,"max_score":1.0,"hits":[{"_index":"_river","_type":"my_wikipedia","_id":"_meta","_score":1.0, "_source" :
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
},{"_index":"_river","_type":"my_wikipedia","_id":"_status","_score":1.0, "_source" : {"ok":true,"node":{"id":"drlilI2KQHO123LWXwsqEQ","name":"Spitfire","transport_address":"inet[/192.168.1.13:9300]"}}}]}}

On Fri, Dec 20, 2013 at 1:04 PM, David Pilato david@pilato.fr wrote:
You could run a query?

curl http://localhost:9200/_search?q=*

And see if you docs?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 08:10:32, Sigehere (webmaster.swapnil@gmail.com) a écrit:

Hi Friends,

I have installed elasticsearch and elasticsearch wikipedia river plagin

then i have run following curl request and i got this response

$ curl -XPUT 192.168.1.13:9200/_river/my_wikipedia/_meta -d '
{
"type" : "wikipedia",
"wikipedia" : {
"url" : "file:///media/Seagate Backup Plus Drive/enwiki-20131202-pages-articles.xml.bz2"
}
}
'
{"ok":true,"_index":"_river","_type":"my_wikipedia","_id":"_meta","_version":3}

What is the meaning of this message?

this response got me with in 2 sec.

How can I identify is it indexed successful or not?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d28eca4c-2cbc-4e68-b9c4-bab8d015d861%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b3f316.3006c83e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAWWDHoribYRsMKd%2Bgc5C7KqkYUBbku7WAiXs_xREmPwQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b406d0.7724c67e.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRB4Zwnx19hYUB7prKUCjzuwcDioOCwc-w9ZB0-o8EYnaQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b4100d.7a6d8d3c.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRAbZLCv2-gPn9Q5m-HWWbX5%2BHB6ahE5jaQhGieMjp%2B3Cw%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b41746.32fff902.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRBoOr7mVq6NN4TBroKmwL2PJn81XU%2BNBUZa0LUVFsjiBQ%40mail.gmail.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b418ae.3dc240fb.111%40MacBook-Air-de-David.local.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRBFtdAQyEQmp%2BwBQtq%3DN7pAtT-oKD6dqbNFUD3aABugTA%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b41a6d.12e685fb.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #13

Now i have change file location and create new curl request

at that time i have check log

[2013-12-20 17:13:01,192][INFO ][river.wikipedia ] [Hermes]
[wikipedia][wikipedia] starting wikipedia stream
[2013-12-20 17:13:01,266][INFO ][cluster.metadata ] [Hermes]
[wikipedia] creating index, cause [api], shards [1]/[0], mappings []
[2013-12-20 17:13:01,301][ERROR][river.wikipedia ] [Hermes]
[wikipedia][wikipedia] failed to parse stream
java.io.FileNotFoundException:
/opt/wiki/enwiki-20131202-pages-articles.xml.bz2 (Permission denied)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:137)
at java.io.FileInputStream.(FileInputStream.java:96)
at
sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:87)
at
sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:178)
at java.net.URL.openStream(URL.java:1031)
at
org.elasticsearch.river.wikipedia.support.WikiXMLParser.getInputSource(WikiXMLParser.java:77)
at
org.elasticsearch.river.wikipedia.support.WikiXMLSAXParser.parse(WikiXMLSAXParser.java:68)
at
org.elasticsearch.river.wikipedia.WikipediaRiver$Parser.run(WikipediaRiver.java:192)
at java.lang.Thread.run(Thread.java:679)
[2013-12-20 17:13:01,362][INFO ][cluster.metadata ] [Hermes]
[_river] update_mapping [wikipedia] (dynamic)

my log status as like this

but i had given all file permission to xxx.bz2 file.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRDoiLtcKJo%2BmVgKWb9AxC-iwRr_XHQPoc%3DYHZG2FrQtuQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #14

So probably a permission issue.
change owner of file to be the same as the one who runs elasticsearch.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 20 décembre 2013 at 12:48:45, swapnil joshi (webmaster.swapnil@gmail.com) a écrit:

Now i have change file location and create new curl request

at that time i have check log

[2013-12-20 17:13:01,192][INFO ][river.wikipedia ] [Hermes] [wikipedia][wikipedia] starting wikipedia stream
[2013-12-20 17:13:01,266][INFO ][cluster.metadata ] [Hermes] [wikipedia] creating index, cause [api], shards [1]/[0], mappings []
[2013-12-20 17:13:01,301][ERROR][river.wikipedia ] [Hermes] [wikipedia][wikipedia] failed to parse stream
java.io.FileNotFoundException: /opt/wiki/enwiki-20131202-pages-articles.xml.bz2 (Permission denied)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:137)
at java.io.FileInputStream.(FileInputStream.java:96)
at sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:87)
at sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:178)
at java.net.URL.openStream(URL.java:1031)
at org.elasticsearch.river.wikipedia.support.WikiXMLParser.getInputSource(WikiXMLParser.java:77)
at org.elasticsearch.river.wikipedia.support.WikiXMLSAXParser.parse(WikiXMLSAXParser.java:68)
at org.elasticsearch.river.wikipedia.WikipediaRiver$Parser.run(WikipediaRiver.java:192)
at java.lang.Thread.run(Thread.java:679)
[2013-12-20 17:13:01,362][INFO ][cluster.metadata ] [Hermes] [_river] update_mapping [wikipedia] (dynamic)

my log status as like this

but i had given all file permission to xxx.bz2 file.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRDoiLtcKJo%2BmVgKWb9AxC-iwRr_XHQPoc%3DYHZG2FrQtuQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b4368d.d34b6a8.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(Sigehere) #15

Hi David,

I am really very happy because, finally i have successfully done indexing
on wiki dump with elasticsearch.

Issue was my elasticsearch executed by "elasticsearch" user and dump file
have "sigehere" user there for he doesn't access dump file and give
(permission issue) in log file.

Q. Is it required 4 to 5 hour for complete indexing on
enwiki-20131202-pages-articles.xml.bz2?
Q. How Can I get status of indexing process?

Thanks for your valuable support !!!
:slight_smile:

On Fri, Dec 20, 2013 at 5:52 PM, David Pilato david@pilato.fr wrote:

So probably a permission issue.
change owner of file to be the same as the one who runs elasticsearch.

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr

Le 20 décembre 2013 at 12:48:45, swapnil joshi (
webmaster.swapnil@gmail.com //webmaster.swapnil@gmail.com) a écrit:

Now i have change file location and create new curl request

at that time i have check log

[2013-12-20 17:13:01,192][INFO ][river.wikipedia ] [Hermes]
[wikipedia][wikipedia] starting wikipedia stream
[2013-12-20 17:13:01,266][INFO ][cluster.metadata ] [Hermes]
[wikipedia] creating index, cause [api], shards [1]/[0], mappings []
[2013-12-20 17:13:01,301][ERROR][river.wikipedia ] [Hermes]
[wikipedia][wikipedia] failed to parse stream
java.io.FileNotFoundException:
/opt/wiki/enwiki-20131202-pages-articles.xml.bz2 (Permission denied)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:137)
at java.io.FileInputStream.(FileInputStream.java:96)
at
sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:87)
at
sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:178)
at java.net.URL.openStream(URL.java:1031)
at
org.elasticsearch.river.wikipedia.support.WikiXMLParser.getInputSource(WikiXMLParser.java:77)
at
org.elasticsearch.river.wikipedia.support.WikiXMLSAXParser.parse(WikiXMLSAXParser.java:68)
at
org.elasticsearch.river.wikipedia.WikipediaRiver$Parser.run(WikipediaRiver.java:192)
at java.lang.Thread.run(Thread.java:679)
[2013-12-20 17:13:01,362][INFO ][cluster.metadata ] [Hermes]
[_river] update_mapping [wikipedia] (dynamic)

my log status as like this

but i had given all file permission to xxx.bz2 file.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAFBccRDoiLtcKJo%2BmVgKWb9AxC-iwRr_XHQPoc%3DYHZG2FrQtuQ%40mail.gmail.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/xwKKS_ltv_0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/etPan.52b4368d.d34b6a8.111%40MacBook-Air-de-David.local
.

For more options, visit https://groups.google.com/groups/opt_out.

--
Regards,
Swapnil K. Joshi

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAFBccRDOctKU%3DXfH%2BkGLJqKFekSfK81DGv93ZypfF_6ZyA9Gfg%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #16