I have ES "0.20.3" .. with a single index replicated on 2 servers .. with
5 shards ..* size: 57.2gb / docs: 36060297 *
I have webpages(the docs) indexed since 2013 ... so I want to delete
everything and keep 1 year worth of documents
how can I do this safely on a production setup
am thinking of the following:
1- make a copy of the current index ... put it next to it (with a different
name ofcourse) ... (how do I accomplish this)
2- delete documents before May 2014 from the copied index
3- Rename the old index and leave / Delete it
OR
1- make an empty index
2- pull documents after May2014 from the current index to the new one
*(how do I accomplish this)*3- Rename the old index and leave / Delete it
I have ES "0.20.3" .. with a single index replicated on 2 servers .. with 5 shards .. size: 57.2gb / docs: 36060297
I have webpages(the docs) indexed since 2013 ... so I want to delete everything and keep 1 year worth of documents
how can I do this safely on a production setup
am thinking of the following:
1- make a copy of the current index ... put it next to it (with a different name ofcourse) ... (how do I accomplish this)
2- delete documents before May 2014 from the copied index
3- Rename the old index and leave / Delete it
OR
1- make an empty index
2- pull documents after May2014 from the current index to the new one (how do I accomplish this)
3- Rename the old index and leave / Delete it
do you know how I perform step (2) pull documents after May2014 from the
current index to the new one
On Monday, May 4, 2015 at 12:15:22 AM UTC+3, David Pilato wrote:
Definitely the second option.
Use scan and scroll (search for reindex on the website).
Instead of renaming, I would use aliases and switch the alias from old to
new index.
Then close or remove the old index.
--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs
Le 3 mai 2015 à 22:24, AALISHE <aal...@gmail.com <javascript:>> a écrit :
Hi,
I have ES "0.20.3" .. with a single index replicated on 2 servers .. with
5 shards ..* size: 57.2gb / docs: 36060297 *
I have webpages(the docs) indexed since 2013 ... so I want to delete
everything and keep 1 year worth of documents
how can I do this safely on a production setup
am thinking of the following:
1- make a copy of the current index ... put it next to it (with a
different name ofcourse) ... (how do I accomplish this)
2- delete documents before May 2014 from the copied index
3- Rename the old index and leave / Delete it
OR
1- make an empty index
2- pull documents after May2014 from the current index to the new one
*(how do I accomplish this)*3- Rename the old index and leave / Delete
it
do you know how I perform step (2) pull documents after May2014 from the current index to the new one
On Monday, May 4, 2015 at 12:15:22 AM UTC+3, David Pilato wrote:
Definitely the second option.
Use scan and scroll (search for reindex on the website).
Instead of renaming, I would use aliases and switch the alias from old to new index.
Then close or remove the old index.
--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs
I have ES "0.20.3" .. with a single index replicated on 2 servers .. with 5 shards .. size: 57.2gb / docs: 36060297
I have webpages(the docs) indexed since 2013 ... so I want to delete everything and keep 1 year worth of documents
how can I do this safely on a production setup
am thinking of the following:
1- make a copy of the current index ... put it next to it (with a different name ofcourse) ... (how do I accomplish this)
2- delete documents before May 2014 from the copied index
3- Rename the old index and leave / Delete it
OR
1- make an empty index
2- pull documents after May2014 from the current index to the new one (how do I accomplish this)
3- Rename the old index and leave / Delete it
I have ES "0.20.3" .. with a single index replicated on 2 servers ..
with 5 shards ..* size: 57.2gb / docs: 36060297 *
I have webpages(the docs) indexed since 2013 ... so I want to delete
everything and keep 1 year worth of documents
how can I do this safely on a production setup
am thinking of the following:
1- make a copy of the current index ... put it next to it (with a
different name ofcourse) ... (how do I accomplish this)
2- delete documents before May 2014 from the copied index
3- Rename the old index and leave / Delete it
OR
1- make an empty index
2- pull documents after May2014 from the current index to the new one
*(how do I accomplish this)*3- Rename the old index and leave / Delete
it
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.