Writing custom scripts for indexing data in Elasticsearch

Hi,

I'm a newbie to Elasticsearch, and I would like to know about writing
custom scripts in Elasticsearch. Basically, I want to have a script which
takes a table name and the columns in it as parameters, and would then
start a JDBC river plugin and index the corresponding data into my
Elasticsearch. Basically, I'm kind of looking for a mechanism through which
I can automatically index data into my Elasticsearch on just specifying my
required table and columns. And I would very much like to know if this
viable and also any other ideas through which I could implement this?

Thank you.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f2c89e51-f690-433e-b320-f18659958eee%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

If you can set up shell scripting, it should be viable to define a curl
command in a script that copies the table and columns into an SQL statement
and performs something similar to step 7 in

Jörg

On Fri, Jan 9, 2015 at 2:45 PM, Amtul Nazneen amtulnazneen7@gmail.com
wrote:

Hi,

I'm a newbie to Elasticsearch, and I would like to know about writing
custom scripts in Elasticsearch. Basically, I want to have a script which
takes a table name and the columns in it as parameters, and would then
start a JDBC river plugin and index the corresponding data into my
Elasticsearch. Basically, I'm kind of looking for a mechanism through which
I can automatically index data into my Elasticsearch on just specifying my
required table and columns. And I would very much like to know if this
viable and also any other ideas through which I could implement this?

Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f2c89e51-f690-433e-b320-f18659958eee%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f2c89e51-f690-433e-b320-f18659958eee%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoEr02XoB%3Dh1bE7qko7HSavJKZm7Zp5EYNg_aifwQMi%2B4g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Thank you. I have a doubt though, once I run the script, the river plugin
is started and the data gets indexed into Elasticsearch, I want to know, if
the plugin would be running after that, or does it stop once the script
execution comes to an end?

On Sunday, January 11, 2015 at 12:16:05 AM UTC+5:30, Jörg Prante wrote:

If you can set up shell scripting, it should be viable to define a curl
command in a script that copies the table and columns into an SQL statement
and performs something similar to step 7 in
https://github.com/jprante/elasticsearch-river-jdbc/wiki/Quickstart

Jörg

On Fri, Jan 9, 2015 at 2:45 PM, Amtul Nazneen <amtuln...@gmail.com
<javascript:>> wrote:

Hi,

I'm a newbie to Elasticsearch, and I would like to know about writing
custom scripts in Elasticsearch. Basically, I want to have a script which
takes a table name and the columns in it as parameters, and would then
start a JDBC river plugin and index the corresponding data into my
Elasticsearch. Basically, I'm kind of looking for a mechanism through which
I can automatically index data into my Elasticsearch on just specifying my
required table and columns. And I would very much like to know if this
viable and also any other ideas through which I could implement this?

Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f2c89e51-f690-433e-b320-f18659958eee%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f2c89e51-f690-433e-b320-f18659958eee%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e66698ab-cef9-4e91-bdc4-db0c5e4d3a72%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Thank you. I have a doubt though, once I run the script, the river plugin
is started and the data gets indexed into Elasticsearch, I want to know, if
the plugin would be running after that, or does it stop once the script
execution comes to an end?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/72e9e2e1-20fe-460d-afed-98527b7004ec%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

It executes once. You could consider running that script on a schedule and
doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:

Thank you. I have a doubt though, once I run the script, the river plugin
is started and the data gets indexed into Elasticsearch, I want to know, if
the plugin would be running after that, or does it stop once the script
execution comes to an end?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e671b48a-a57d-4917-a9ac-c23face41f43%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Ohkay. So the river runs only once when the script starts? And after that
won't it be running in the background to fetch the updates according to a
schedule?

On Monday, January 12, 2015 at 1:23:08 PM UTC+5:30, Ed Kim wrote:

It executes once. You could consider running that script on a schedule and
doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:

Thank you. I have a doubt though, once I run the script, the river plugin
is started and the data gets indexed into Elasticsearch, I want to know, if
the plugin would be running after that, or does it stop once the script
execution comes to an end?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

I guess you need to set interval. See doc plugin on the home page of the JDBC river.

interval - a time value for the delay between two river runs (default: not set)

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 14 janv. 2015 à 06:01, Amtul Nazneen amtulnazneen7@gmail.com a écrit :

Ohkay. So the river runs only once when the script starts? And after that won't it be running in the background to fetch the updates according to a schedule?

On Monday, January 12, 2015 at 1:23:08 PM UTC+5:30, Ed Kim wrote:
It executes once. You could consider running that script on a schedule and doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:
Thank you. I have a doubt though, once I run the script, the river plugin is started and the data gets indexed into Elasticsearch, I want to know, if the plugin would be running after that, or does it stop once the script execution comes to an end?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/BC10D79E-335C-4D89-8F41-EE95FCDAA464%40pilato.fr.
For more options, visit https://groups.google.com/d/optout.

Thank you. Is it the "interval" parameter or "schedule" parameter? If I set
the schedule parameter, then the Elasticsearch will poll the tables
accordingly right?

On Wednesday, January 14, 2015 at 2:31:07 PM UTC+5:30, David Pilato wrote:

I guess you need to set interval. See doc plugin on the home page of the
JDBC river.

interval - a time value for the delay between two river runs (default:
not set)

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 14 janv. 2015 à 06:01, Amtul Nazneen <amtuln...@gmail.com <javascript:>>
a écrit :

Ohkay. So the river runs only once when the script starts? And after that
won't it be running in the background to fetch the updates according to a
schedule?

On Monday, January 12, 2015 at 1:23:08 PM UTC+5:30, Ed Kim wrote:

It executes once. You could consider running that script on a schedule
and doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:

Thank you. I have a doubt though, once I run the script, the river
plugin is started and the data gets indexed into Elasticsearch, I want to
know, if the plugin would be running after that, or does it stop once the
script execution comes to an end?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/9db75940-2ca3-4140-a681-cba55ac3725a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

"schedule" is triggering the JDBC plugin by wall clock time of the machine,
where "interval" simply waits the given time period between two runs.

Jörg

On Fri, Jan 16, 2015 at 11:12 AM, Amtul Nazneen amtulnazneen7@gmail.com
wrote:

Thank you. Is it the "interval" parameter or "schedule" parameter? If I
set the schedule parameter, then the Elasticsearch will poll the tables
accordingly right?

On Wednesday, January 14, 2015 at 2:31:07 PM UTC+5:30, David Pilato wrote:

I guess you need to set interval. See doc plugin on the home page of the
JDBC river.

interval - a time value for the delay between two river runs (default:
not set)

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 14 janv. 2015 à 06:01, Amtul Nazneen amtuln...@gmail.com a écrit :

Ohkay. So the river runs only once when the script starts? And after that
won't it be running in the background to fetch the updates according to a
schedule?

On Monday, January 12, 2015 at 1:23:08 PM UTC+5:30, Ed Kim wrote:

It executes once. You could consider running that script on a schedule
and doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:

Thank you. I have a doubt though, once I run the script, the river
plugin is started and the data gets indexed into Elasticsearch, I want to
know, if the plugin would be running after that, or does it stop once the
script execution comes to an end?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%
40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/9db75940-2ca3-4140-a681-cba55ac3725a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/9db75940-2ca3-4140-a681-cba55ac3725a%40googlegroups.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoFXKhuNY54R4UAVAw%2Bo8eYVshQiqmuQgHaPjY7qd2f6Zw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Thank you :slight_smile:

On Friday, January 16, 2015 at 6:27:48 PM UTC+5:30, Jörg Prante wrote:

"schedule" is triggering the JDBC plugin by wall clock time of the
machine, where "interval" simply waits the given time period between two
runs.

Jörg

On Fri, Jan 16, 2015 at 11:12 AM, Amtul Nazneen <amtuln...@gmail.com
<javascript:>> wrote:

Thank you. Is it the "interval" parameter or "schedule" parameter? If I
set the schedule parameter, then the Elasticsearch will poll the tables
accordingly right?

On Wednesday, January 14, 2015 at 2:31:07 PM UTC+5:30, David Pilato wrote:

I guess you need to set interval. See doc plugin on the home page of the
JDBC river.

interval - a time value for the delay between two river runs (default:
not set)

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 14 janv. 2015 à 06:01, Amtul Nazneen amtuln...@gmail.com a écrit :

Ohkay. So the river runs only once when the script starts? And after
that won't it be running in the background to fetch the updates according
to a schedule?

On Monday, January 12, 2015 at 1:23:08 PM UTC+5:30, Ed Kim wrote:

It executes once. You could consider running that script on a schedule
and doing incremental updates using timestamps.

On Sunday, January 11, 2015 at 9:24:28 PM UTC-8, Amtul Nazneen wrote:

Thank you. I have a doubt though, once I run the script, the river
plugin is started and the data gets indexed into Elasticsearch, I want to
know, if the plugin would be running after that, or does it stop once the
script execution comes to an end?

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%
40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/a0997950-4ca4-4036-9550-d1da3816b503%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/9db75940-2ca3-4140-a681-cba55ac3725a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/9db75940-2ca3-4140-a681-cba55ac3725a%40googlegroups.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/2c72317f-1d59-4b12-955a-5c990b2491ee%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.