Visualizing time series counting first time a term appears in index

DigitalMachinist · November 28, 2018, 4:40pm

I'm just spending my first week or so with Kibana and trying to build dashboards to visualize performance indicators that my team is interested in tracking. At the moment, what I'd like to make a line chart time series tracking the number of first-time deposits made by users each week.

I have an index called "event-deposit-finished" that tracks completed deposit events as they occur, but for the sake of my testing right now I'm just working with historical data.

If I were querying this in SQL, I'd probably approach this in roughly the following manner:

Get a unique set of all users who have a completed deposit (event-deposit-finished index)
For each unique user, find their first finished deposit
Count the number of first-time deposits in each week
Plot the resultant series of values

My event-deposit-finished index includes a date field as well as the user ID that made the deposit.

Any ideas?

Edit:
Moving on from here, I'm also interested in a chart that can track the time between a user registering and their first completed deposit, as we'd like to minimize this metric. I have another index "event-user-registered" that I can get the date and user ID of user registrations from. I gather this might be something I have to use timelion for, but I'm only just beginning to scratch the surface of what timelion can do.

Brandon_Kobel · November 28, 2018, 5:28pm

Hey @DigitalMachinist, i assume the complexity that you're running into is how to determine whether or not a deposit is the very first one for a user at query-time?

DigitalMachinist · November 28, 2018, 6:31pm

@Brandon_Kobel Yeah, that's basically right. I just can't seem to understand how to apply the aggregations that I'd need to do this via the visualization tools.

Brandon_Kobel · November 28, 2018, 7:11pm

@DigitalMachinist, this is one of those situations where Elasticsearch differs from traditional SQL. Elasticsearch has really limited join based capabilities, as discussed here and we can use features like pipeline aggregations to fill in some of the gaps as well.

The other option we have is calculating some of this on ingest. How are you currently ingesting your data into Elasticsearch, are you using the ingest node or perhaps logstash?

DigitalMachinist · November 28, 2018, 7:17pm

At the moment we have a fairly naive indexing strategy. We're running a Laravel application in which we index documents into Elastic Cloud using Elasticsearch-PHP when certain events are triggered. I don't believe we have an ingest node configured, and we're not using logstash as of yet (although maybe in the future).

Brandon_Kobel · November 28, 2018, 7:32pm

Gotcha, if you can augment your ingest pipeline to determine whether an event is the first for a specific user it'll make creating the various visualizations inside of Kibana really easy. Otherwise, we're stuck trying to use the pipeline aggregations to try to calculate these, and there will likely be limitations to how we're able to present this data.

DigitalMachinist · November 28, 2018, 7:43pm

Thanks. I'll spend a bit of time reading about the ingest process and see if I can come up with an appropriate way to handle this at that time. If I add a pipeline/processor to handle this data on ingestion, I assume I'll have to reindex the appropriate data so it can be ingested/processed properly?

Is there a particular type of processor that I should look into for this kind of task?

Brandon_Kobel · November 28, 2018, 7:53pm

You will have to reindex your data, using Logstash and the elasticsearch filter should make this not too painful.

system · December 26, 2018, 7:53pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to (pre)filter data used in a visualization? Kibana	4	621	March 11, 2020
Information about users that will become paying Kibana	4	273	October 26, 2020
Time series graph that summarizes the index state for each time point Kibana vega	5	1092	February 7, 2022
How to visualise nested time series data Kibana	3	418	January 11, 2022
Aggregation of existing visualization Kibana	3	689	July 6, 2017

Visualizing time series counting first time a term appears in index

Related topics