Sizing RAM vs DISK

Stanislas_Polu · January 18, 2012, 11:20am

Hi,

After stuffing a large amount of data into a pre-production cluster made of
7 shards, replication 1 on 3 machines, I have a better understanding of how
my index will grow.
What I am still missing is how much memory vs disk space I'm supposed to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime Photo
Search

Stanislas_Polu · January 18, 2012, 12:53pm

I'm using a disk based index with local gateway

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime Photo
Search

On Wed, Jan 18, 2012 at 12:20 PM, Stanislas Polu
polu.stanislas@gmail.comwrote:

Hi,

After stuffing a large amount of data into a pre-production cluster made
of 7 shards, replication 1 on 3 machines, I have a better understanding of
how my index will grow.
What I am still missing is how much memory vs disk space I'm supposed to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime
Photo Search

Karussell1 · January 18, 2012, 1:09pm

What do you mean with disc size? You'll need as many GB for disc as
you have documents so it depends on your documents + document
count ...

Regarding the RAM try to use only the half of the maximum RAM of the
system, so that the OS cache can be used.

Peter.

On 18 Jan., 13:53, Stanislas Polu polu.stanis...@gmail.com wrote:

I'm using a disk based index with local gateway

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime Photo
Search

On Wed, Jan 18, 2012 at 12:20 PM, Stanislas Polu
polu.stanis...@gmail.comwrote:

Hi,

After stuffing a large amount of data into a pre-production cluster made
of 7 shards, replication 1 on 3 machines, I have a better understanding of
how my index will grow.
What I am still missing is how much memory vs disk space I'm supposed to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo Search

Stanislas_Polu · January 18, 2012, 1:34pm

Agreed for the disk. Thanks for the clarification + advise on the RAM.

Does the _status indices.xxx.index.size include everything?
Does more RAM mean better perf?

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime Photo
Search

On Wed, Jan 18, 2012 at 2:09 PM, Karussell tableyourtime@googlemail.comwrote:

What do you mean with disc size? You'll need as many GB for disc as
you have documents so it depends on your documents + document
count ...

Regarding the RAM try to use only the half of the maximum RAM of the
system, so that the OS cache can be used.

Peter.

On 18 Jan., 13:53, Stanislas Polu polu.stanis...@gmail.com wrote:

I'm using a disk based index with local gateway

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime Photo
Search

On Wed, Jan 18, 2012 at 12:20 PM, Stanislas Polu
polu.stanis...@gmail.comwrote:

Hi,

After stuffing a large amount of data into a pre-production cluster
made
of 7 shards, replication 1 on 3 machines, I have a better
understanding of
how my index will grow.
What I am still missing is how much memory vs disk space I'm supposed
to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo Search

kimchy · January 18, 2012, 9:24pm

Use node stats to see where memory is spent. Mainly look at field data
cache (sort and facet related) and the jvm memory used. This will give you
an indication if you are running low on memory. Use bigdesk plugin to
visualize it.

On Wed, Jan 18, 2012 at 3:34 PM, Stanislas Polu polu.stanislas@gmail.comwrote:

Agreed for the disk. Thanks for the clarification + advise on the RAM.

Does the _status indices.xxx.index.size include everything?

Does more RAM mean better perf?

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime
Photo Search

On Wed, Jan 18, 2012 at 2:09 PM, Karussell tableyourtime@googlemail.comwrote:

What do you mean with disc size? You'll need as many GB for disc as
you have documents so it depends on your documents + document
count ...

Regarding the RAM try to use only the half of the maximum RAM of the
system, so that the OS cache can be used.

Peter.

On 18 Jan., 13:53, Stanislas Polu polu.stanis...@gmail.com wrote:

I'm using a disk based index with local gateway

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo
Search

On Wed, Jan 18, 2012 at 12:20 PM, Stanislas Polu
polu.stanis...@gmail.comwrote:

Hi,

After stuffing a large amount of data into a pre-production cluster
made
of 7 shards, replication 1 on 3 machines, I have a better
understanding of
how my index will grow.
What I am still missing is how much memory vs disk space I'm supposed
to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo Search

Stanislas_Polu · January 18, 2012, 9:38pm

Thanks!

bigdesk is awesome.

Cheers,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime Photo
Search

On Wed, Jan 18, 2012 at 10:24 PM, Shay Banon kimchy@gmail.com wrote:

Use node stats to see where memory is spent. Mainly look at field data
cache (sort and facet related) and the jvm memory used. This will give you
an indication if you are running low on memory. Use bigdesk plugin to
visualize it.

On Wed, Jan 18, 2012 at 3:34 PM, Stanislas Polu polu.stanislas@gmail.comwrote:

Agreed for the disk. Thanks for the clarification + advise on the RAM.

Does the _status indices.xxx.index.size include everything?

Does more RAM mean better perf?

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu | http://teleportd.com | Realtime
Photo Search

On Wed, Jan 18, 2012 at 2:09 PM, Karussell tableyourtime@googlemail.comwrote:

What do you mean with disc size? You'll need as many GB for disc as
you have documents so it depends on your documents + document
count ...

Regarding the RAM try to use only the half of the maximum RAM of the
system, so that the OS cache can be used.

Peter.

On 18 Jan., 13:53, Stanislas Polu polu.stanis...@gmail.com wrote:

I'm using a disk based index with local gateway

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo
Search

On Wed, Jan 18, 2012 at 12:20 PM, Stanislas Polu
polu.stanis...@gmail.comwrote:

Hi,

After stuffing a large amount of data into a pre-production cluster
made
of 7 shards, replication 1 on 3 machines, I have a better
understanding of
how my index will grow.
What I am still missing is how much memory vs disk space I'm
supposed to
provision for acceptable performance.

Is there any rule of thumb here?

Best,

-stan

--
Stanislas Polu
Mo: +33 6 83 71 90 04 | Tw: @spolu |http://teleportd.com| Realtime
Photo Search

Topic		Replies	Views
RAM / memory sizing Elasticsearch	4	379	July 6, 2017
Calculate optimal number of nodes Elasticsearch	3	397	July 6, 2017
Memory allocation to elasticsearch component for a cluster setup with n nodes Elasticsearch	4	1279	September 11, 2017
How to determine optimum RAM for an elasticsearch node Elasticsearch	5	365	July 6, 2017
Abrupt performance drop above a certain index size Elasticsearch	16	1480	July 6, 2017

Sizing RAM vs DISK

Related topics