Aggregation by time messes up data, need to rethink about a whole new solution

odede · February 6, 2019, 5:06pm

Hi Everyone,

Colleagues and myself encountered a problem with one of our analytics set.

We are open for any suggestion, even if we need to change the way we collect the data and the DB we are using.

The case:

Our users creates hundreds of ec2 instances per day, in many regions, zones, types, etc…

We have an analytics task, written in Python that collects the amount of all ec2 instances, every 15 minutes, per many types/attributes.

Collected record example:

We store these records in Influx DB, which, of course declares a time stamp for every record.

Grafana is querying the data from influx db and aggregates the values (amount of instances) by attributes, such as ‘User’, and by time interval (which uses the time stamp that created by influxDB).
Query example:
SELECT sum(“count”) FROM "system_usage_table” WHERE $timeFilter GROUP BY time($interval), “zone” fill(null)

The problem:

The default aggregation by time can show wrong data when, for example, the time interval is 30 minutes, 2 records such as the above (presented in influxDB with different timestamp), will resulted in the graph as user Dave have '16 machines’ within this time range, which is not true, he never raised the amount over 8.

More than that, sometimes the records arrive in a time that is not fixed 15 minutes, so even if I aggregate with usual 15 minutes (as the scheduled task run) I get wrong data.

My naive suggestion:

Moving to SQL DB (PostgresSQL or MySql) and add new field per each record- ‘iteration’ which represents the analytics task ‘run number’. Then in Grafana, aggregating by iteration and then by any other added attribute to the query and present (not aggregate) the data on a time line.

Questions:

Do you think I can find a solution for this in our influxDB and Grafana?

If not, should my suggestion work?

Anybody else encounter the same problem?

Thanks for your help,

Oded

jangaraj · February 6, 2019, 8:11pm

It looks like a wrong aggregation in your InfluxDB query, but nobody will be able to help because you didn’t provide used query.

odede · February 7, 2019, 3:48pm

Thanks
added an example, I hope it will enlighten the problem

jangaraj · February 7, 2019, 5:31pm

My guess is that you want see average (MEAN()) value per period, not a sum (SUM())

SELECT MEAN(“count”) 
FROM "system_usage_table” 
WHERE $timeFilter 
GROUP BY time($interval), “zone” fill(null)

See doc: https://docs.influxdata.com/influxdb/v1.7/query_language/functions/

Topic		Replies	Views
Aggregation issues Configuration	2	1489	April 14, 2022
Hassle with aggregation over time in Grafana and/or InfluxDB 2 InfluxDB	1	1144	January 4, 2022
Aggregation query with GROUP BY time() Grafana	0	844	October 7, 2019
Aggregate _Value for each Min and Max timestamp of that value InfluxDB query-help	3	314	June 13, 2024
Grafna show strange result for influxDB InfluxDB	2	1175	January 10, 2020

Aggregation by time messes up data, need to rethink about a whole new solution

Related topics