Is this possible in ES?

The problem I'm trying to solve is:

  1. Users create folders that contain on average 500 docs. These folders
    change every day for active users which could be 5-30% of all users. The
    change is usually adding a few docs.

  2. There will be 10-100 Million docs and 100,000 users each with a dozen
    folders.

  3. I will have a list of query words that is around 30-100 and a list of
    50-200 folders that are interesting.

  4. I want to search all my docs with this list of words and folders without
    requiring any of them and getting the docs that have the most of them to
    rank the highest (topics I'm interested in from people I like). Query time
    needs to be around a minute.

Questions

  1. Is this possible with ES?
  2. How would you do it?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

On Fri, 2013-02-08 at 15:30 -0800, David Hagar wrote:

The problem I'm trying to solve is:

  1. Users create folders that contain on average 500 docs. These
    folders change every day for active users which could be 5-30% of all
    users. The change is usually adding a few docs.

  2. There will be 10-100 Million docs and 100,000 users each with a
    dozen folders.

  3. I will have a list of query words that is around 30-100 and a list
    of 50-200 folders that are interesting.

  4. I want to search all my docs with this list of words and folders
    without requiring any of them and getting the docs that have the most
    of them to rank the highest (topics I'm interested in from people I
    like). Query time needs to be around a minute.

Questions

  1. Is this possible with ES?

yes.

  1. How would you do it?

that's a big question :slight_smile:

I'd start by playing with Elasticsearch, indexing some documents, with
fields for user, folder, date and content, then come back with more
specific questions

clint

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.