Use Graph for document classification


(Michel Mooren) #1

The Graph introduction video mentions that vertices in a graph could be used to construct a percolate query. These queries then could be used to classify a document. The example given is that of names of death metal bands. I’m looking into a use case in which I want to define a number of graphs, each graph representing a topic. Each topic graph will contain vertices from multiple fields. I’ve read that in 5.0 graphs can be stored. 2 questions:

  1. Will it be possible to store a graph as a percolate query?
  2. If so, will only the vertices be part of the query or can (the weights of) the connections somehow also be taken into account to determine how well the document matches the query?

(Mark Harwood) #2

Hi Michel,

Will it be possible to store a graph as a percolate query?

Not directly but it should be easy to implement. Another feature coming in 5.0 is the ability to configure "drill-down urls". These allow selections in the graph workspace to be passed as parameters to other URLs e.g. Kibana visualizations or 3rd party websites. Here's how it is configured in the new UI:

Once configured you have a button that allows users to click to perform that action on a selection.
If you can provide a simple web app that takes the parameters passed in the URL it could make a call to save them as a percolator query (perhaps pausing first to ask the user to configure a choice of tag e.g. "death metal"). We haven't added such a percolator-saving function as yet but it would be easy to implement.

For the existing 2.4 release, there is a more manual set of steps you can follow to get hold of what would be your classifier query which I have documented here:

(system) #3