How do I import several files?


(Burleigh) #1
  • deleted -

(Ask Bjørn Hansen) #2

On Sep 16, 10:57 pm, Burleigh inco...@hotmail.com wrote:

I can't believe there isn't a whitepaper for beginners that explains how to
use
this software. Anyway, I have a huge number of PDF files that I need to
either
directly index in a directory, or I can convert them out into Json docs, I
don't
care which; but what I can't find is the methodology used to import several
documents instead of just typing the data into the engine, as in the
examples.

Your question is a little confusing -- it sounds like you know what
you need to do.

Convert the PDFs to text strings and submit them as JSON documents to
the search engine.

  • ask

(Burleigh) #3
  • deleted -

(Shay Banon) #4

There is the REST index API documented here:
http://www.elasticsearch.com/docs/elasticsearch/rest_api/index/. There is
also a built in Java client fully documented on the site:
http://www.elasticsearch.com/docs/elasticsearch/java_api/, and many client
libraries written in different languages:
http://www.elasticsearch.com/products/.

-shay.banon

p.s. I know its very culture specific (one thing is one culture can be
considered very rude in another), and in my several open source years I have
learned not to judge what people say solely based on my cultural bias (and
we all "suffer" from it), but I encourage people not to "shout" (uppercase)
in the mailing list, and try and keep the discussion civil.

On Mon, Sep 20, 2010 at 12:05 PM, SumnerClaridge incomex@hotmail.comwrote:

You still didn't answer the question. HOW do I submit an entire
document, and more importantly, how do I submit thousands
of documents? What do I need to read, or what is the syntax
used to load the docs into the engine?

On Sep 18, 2:12 pm, Ask Bjørn Hansen a...@develooper.com wrote:

On Sep 16, 10:57 pm, Burleigh inco...@hotmail.com wrote:

I can't believe there isn't a whitepaper for beginners that explains
how to

use
this software. Anyway, I have a huge number of PDF files that I need
to

either
directly index in a directory, or I can convert them out into Json
docs, I

don't
care which; but what I can't find is the methodology used to import
several

documents instead of just typing the data into the engine, as in the
examples.

Your question is a little confusing -- it sounds like you know what
you need to do.

Convert the PDFs to text strings and submit them as JSON documents to
the search engine.

  • ask

(system) #5