Elastic search configuration for windows server

saineshwar_bageri · June 26, 2018, 8:00am

If I have a data of 20 Million and I want to push into elastic search single instance.

what will be the configuration of the server, I will be using windows server.
Does single instance is enough.
Creating Elasticsearch cluster is mandatory.

dadoonet · June 26, 2018, 8:33am

May I suggest you look at the following resources about sizing:

https://www.elastic.co/elasticon/conf/2016/sf/quantitative-cluster-sizing

saineshwar_bageri · June 30, 2018, 9:35am

Hi Dadoonet sir,

I have one more query i want to update data in elastic search , what if data to update is 1 Million and which api to use and tools (Logstash) or any other tools.

dadoonet · June 30, 2018, 10:01am

It depends on what tool you used at first I believe.

saineshwar_bageri · June 30, 2018, 10:15am

I am using Logstash with jdbc an it is on windows server.

dadoonet · June 30, 2018, 10:39am

Then use the same tools to update. Note that if you are going to update a lot of documents, it might be better to reindex the whole dataset instead.

saineshwar_bageri · June 30, 2018, 12:03pm

Hi dadoonet sir,

Any documents links for Update API and Re-indexing document.

dadoonet · June 30, 2018, 1:01pm

Updating a document is the same API as creating a document.
When I say reindex, I meant index again as you did the first time.

saineshwar_bageri · July 2, 2018, 8:06am

Hi dadoonet sir,

how to speed document insert into elastic search using logstash.

Can you give me some suggestion on it sir.

I am using JDBC plugin below is the code of it.

input {  
    jdbc {  
        jdbc_driver_library => "D:\sqljdbc_6.4.0.0_enu\sqljdbc_6.4\enu\mssql-jdbc-6.4.0.jre8.jar"  
        jdbc_driver_class => "com.microsoft.sqlserver.jdbc.SQLServerDriver"  
        jdbc_connection_string => "jdbc:sqlserver://SAI-PC;user=sa;password=Pass$123;"  
        jdbc_user => "sa"  
        jdbc_password => "Pass$123"  
        statement => "SELECT * FROM [AdventureWorks2008R2].[HumanResources].[Employee]"  
    }  
}  
filter {}  
output {  
    stdout {  
        codec => json_lines  
    }  
    elasticsearch {  
        hosts => "http://localhost:9200"  
        index => "humanresources"  
    }  
}

dadoonet · July 2, 2018, 8:43am

In my experience, most of time is spent on reading the source database.
In that case, you can may be add a WHERE clause in your query to select only a subset of your documents and then run multiple logstash pipelines at once in parallel?

system · July 30, 2018, 8:43am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Configuration of elasticsearch to index 300 Million documents Elasticsearch	4	5926	July 6, 2017
Cluster recommended for crawling 150 million documents Elasticsearch	6	941	February 27, 2020
Indexing 570 millions rows Elasticsearch	4	3694	July 5, 2017
Infrastructure for Elastissearch Elasticsearch	5	564	February 5, 2018
Need pc configuration for large data Elasticsearch	2	350	July 24, 2020

Elastic search configuration for windows server

Related topics