Performance Degradation During Load Testing with Resource-Constrained Tasks (Playwright & Fargate) #2755
Replies: 1 comment 7 replies
-
Resource contention inside the same Fargate task can definitely start happening when the number of concurrent VUs running in that task crosses a threshold. The exact number depends on the app you're testing and what the tests themselves are doing. Are you running your tests on tasks with 16 vCPUs right now? How much memory are you allocating to those tasks? I'd normally expect more than 4 concurrent VUs to run per task, unless the app is unusually memory-heavy or CPU-intensive. From the cost perspective, if you were to allocate 4k vCPUs and 4k GB of RAM, you'd be looking at: 4000 * ($0.012144 + $0.0013335) = $53.91/hour (numbers from https://aws.amazon.com/fargate/pricing/) One way to reduce the amount of resources each VU takes is to not set |
Beta Was this translation helpful? Give feedback.
-
We are conducting a stepped load test using Artillery to simulate approximately 4,000 concurrent users. The test configuration involves ramping up from 1 to 10 virtual users (VUs) in 10-minute steps, using the largest ECS task configuration.
However, we've noticed that as the number of VUs scales beyond about 4, we start to see performance degradation in the metrics being monitored. We suspect that this degradation is due to resource constraints, as each task has limited resources available.
One potential solution we've considered is to create 4,000 individual tasks, each with 1 vCPU allocated. However, this approach becomes expensive and resource-intensive from an infrastructure perspective. It also would mean we would be unable to ramp up the load - @hassy is there a workaround for this, or is there are any relevant features or capabilities within Artillery that could help address this problem, we would appreciate if you could highlight them.
We would appreciate any guidance or recommendations from the Artillery team and community on how to best handle this issue. Specifically, we're looking for strategies or techniques to effectively simulate many resource-constrained users without compromising the accuracy of the load test or incurring significant infrastructure costs.
Thank you in advance for your assistance! :)
Beta Was this translation helpful? Give feedback.
All reactions