K6 is memory hungry when using several modules

mstoykov · February 25, 2022, 2:44pm

Yeah, I understand that @bbarin , but I tried locally, and I can’t make it leak - it likely is dependent on your usage.

Can you share any snippet of code that represented your usage? A full-blown script that leaks and uses https://httpbin.test.k6.io will be best but even “this is the code we had with some name changes and moving to this resolved the leak” will also be very useful.

The code in jslib is actually pulled from a fairly old version of core-js but I can not see any suggestion that older version were leaky :(.

bbarin · March 3, 2022, 6:03pm

Hey, I’m sorry for the late response was out for carnival

Turned out that we have been fooled ourselves several times, but I believe we were able to reproduce
with the simplest test. Here you are:

import http from 'k6/http';
import { check, sleep } from 'k6';

export const options = {
  scenarios: {
    mockbin: {
      executor: 'ramping-vus',
      startVUs: 8640,
      stages: [
        { target: 8640, duration: '1h' },
        { target: 0, duration: '30s' },
      ],
      tags: {
        service: 'mockbin',
        testid: __ENV.TEST_ID,
      },
      exec: 'mockbin',
    },
  },
};


// eslint-disable-next-line @typescript-eslint/explicit-module-boundary-types
export function mockbin () {
  const result = http.get('https://mockbin.org/bin/a0f5cf3b-fec6-4a9b-a7a0-2c370af771ff');

  check(result, {
    'http response status code is 200': result?.status === 200,
  });

  sleep(0.1);
}

The only thing is the endpoint returns HTTP 500

mstoykov · March 3, 2022, 8:05pm

No problem @bbarin, thanks for the script

Are you experiencing the memory growing with this script and --no-thresholds --no-summary?

Do you use any outputs?

Also, I guess you run this through something as the optional chaining (?.) currently doesn’t work very well in k6. Can you provide the final script? Maybe in a gist if it’s huge.

bbarin · March 3, 2022, 8:23pm

@mstoykov, yep we are experiencing the memory leak even with --no-thresholds --no-summary.
We have also tested without the check and the results are the same.

You can find the final version here:

gist.github.com

https://gist.github.com/bbarin/877fa22622fdcd1648fa826fa32143d5

k6-mockbin-memory-leak

(()=>{"use strict";var e={n:t=>{var r=t&&t.__esModule?()=>t.default:()=>t;return e.d(r,{a:r}),r},d:(t,r)=>{for(var o in r)e.o(r,o)&&!e.o(t,o)&&Object.defineProperty(t,o,{enumerable:!0,get:r[o]})},o:(e,t)=>Object.prototype.hasOwnProperty.call(e,t),r:e=>{"undefined"!=typeof Symbol&&Symbol.toStringTag&&Object.defineProperty(e,Symbol.toStringTag,{value:"Module"}),Object.defineProperty(e,"__esModule",{value:!0})}},t={};e.r(t),e.d(t,{mockbin:()=>i,options:()=>n});const r=require("k6/http");var o=e.n(r);const a=require("k6");var n={scenarios:{mockbin:{executor:"ramping-vus",startVUs:8640,stages:[{target:8640,duration:"1h"},{target:0,duration:"30s"}],tags:{service:"mockbin",testid:__ENV.TEST_ID},exec:"mockbin"}}};function i(){o().get("https://mockbin.org/bin/a0f5cf3b-fec6-4a9b-a7a0-2c370af771ff"),(0,a.sleep)(.1)}var s=exports;for(var c in t)s[c]=t[c];t.__esModule&&Object.defineProperty(s,"__esModule",{value:!0})})();
//# sourceMappingURL=test.js.map

mstoykov · March 4, 2022, 11:55am

My k6 gets to around 5.3GB (after a minute or two to stabilize) and then over 1 hour inches very slowly to 6GB. And this is actually the highest it has gone on one run the rest were closer to 5.8GB

There might be a leak, but I mostly expect it’s due to it being CPU bound and the GC just not being able to keep up.

Your graphs looked a lot steeper.

I’ve also did some proffiling really quickly and … nothing. In between heap dumps things go up and down, but ultimately this is normal. Sometimes there are more http2 machinery stuff still in memory, sometimes there are more k6, sometimes more js. But ultimately the memory seem to get reused. Actually in between most of my heap dumps it seemed like the amount of stuff in memory went down after the first few minutes .

Ultimately I think in my case the growing memory is due to 100% CPU usage and GC just not keeping up every once in a while.

bbarin · March 4, 2022, 12:23pm

@mstoykov did you run with 8640 VUs? My machine is not able to run it, we run that on a server with 16 cores and 64 GB of RAM. Small numbers of VUs don’t reveal in the memory leak - perhaps the object allocation is so high that the GC cannot keep up. I wonder how in the docs says a single machine handles ~30k VUs (of course the resources are bigger) but somehow I don’t see us reaching not even close to these numbers.

mstoykov · March 4, 2022, 12:35pm

Yes I ran it on my laptop as it’s listed here. Again it only uses up to 6GB of memory and all my CPUs.

What version of k6 are you running?

bbarin · March 4, 2022, 12:37pm

We are using 0.36 are you suppressing the output or using JSON for example?

mstoykov · March 4, 2022, 12:38pm

I am doing k6.v0.36.0 run --no-summary --no-thresholds scripts.js

bbarin · March 4, 2022, 12:42pm

Seems to be the output… enabling it seems to have a big effect on memory. Turning it off keeps the memory stable.

mstoykov · March 4, 2022, 2:10pm

From my local testing at least with this script on my laptop the json output has problems writing the huge amount of metrics the script is generating. Which basically means that they keep piling in the process while the json output still doesn’t output them fast enough.

What is your RPS in your real case? And does removing -o json there help?

bbarin · March 4, 2022, 2:14pm

The real scenario is around 1M rpm. Yes, removing the output helps to keep the memory under control, but having no metrics is not an option for now as we have no other ways to check the metrics (error rate, throughput, etc).

mstoykov · March 4, 2022, 2:35pm

The real scenario is around 1M rpm

That is around 17K RPS.

The JSON or CSV output aren’t really … great to begin with and at 17KRPS I see no way in which they will be able to keep up without some kind of aggregation.

I would recommend either:

fork the json output and try to make it work for you. You can probably only keep http_req_duration and skip all other metrics which I would think will fix the issue for you. You can also just write to some other format - JSON isn’t known for it’s good decode/encode speeds
you can probably use the cloud output and build something that receive metrics from it. It at least support HTTP request metric compression which is specific for it and you will need to work with it. IT also is likely that we will change the format at some point so you might want to keep that in mind.
Given the above - the k6 cloud will also handle this kind of load ;).
Spreading this on multiple machines is also possible through direct usage of execution segments. But then you will need to merge the jsons at the end so not really certain this will work great. k6-operator can also be looked at, but will also need to merge the output at the end.
Try a different output … I would expect a telegraf+statsd combo like the one I have some configs here might be able to handle 17k RPS

bbarin · March 4, 2022, 4:16pm

We are currently using the output to kafka, the result is quite similar when compared to JSON. I believe the problem is related to the unbounded nature of the channel which probably receives too much data and as the CPU is under pressure, the amount of data starts to piles up to be sent to the output.
We took the approach to scale horizontally and reduce the VUs and the throughput of individual pods.
Thank you very much for your support! It was very much appreciated!

mstoykov · March 14, 2022, 11:23am

@bbarin I have made a PR with some optimizations for the JSON output. It likely still won’t manage this rate, but you can try it by building it from source.

If you do please write back how better it actually behave for your real case scenario

bbarin · March 16, 2022, 11:13am

Great! Gonna test and let you know!

mstoykov · April 13, 2022, 1:29pm

Hi @bbarin, did you have the chance to try it out?

bbarin · April 21, 2022, 5:03pm

Hey! Sorry for the delay. Unfortunately, I haven’t noticed a substantial reduction in memory consumption.
Anyhow thanks for the effort here!

giadat · June 19, 2023, 2:17am

I’m testing a k6 script with 2600 ccu running in 3600s, 15gb Ram but I run it for about 20 minutes and it runs out of ram.
I used discardResponseBodies, my stream has about 30 api including using metrix Gauge, Counter, Trend, Rate to custom report.
Is there any way to reduce the amount of CPU?

executor: ‘ramping-vus’,
startTime: ‘0s’,
startVUs: 0,
stages: [
{ duration: 1200 , target: 2600 },
{ duration: 2400 , target: 2600 },
],
gracefulRampDown: ‘0s’,
exec: ‘transferNapasAccount’,

eyeveebee · June 19, 2023, 10:52am

Hi @giadat

Please do not post on old threads, especially once you have created a new topic in Out of memory K6. Can you follow up on that thread?

Cheers!

Topic		Replies	Views
[fatal error] k6 memory leak OSS Support	2	1721	August 6, 2019
K6 pod goes to "Evicted" status on the Kubernetes cluster after running the load test for few minutes OSS Support	2	1137	July 16, 2019
After running 600 RPS in linux, after 15 mins it is getting killed (out of memory) OSS Support	4	998	December 11, 2021
Using compatibility-mode base OSS Support	13	2047	June 30, 2021
Request for Assistance in Reducing Memory Usage of K6 Script (Diameter protocol) Grafana k6	1	97	April 16, 2024

K6 is memory hungry when using several modules

Related topics