Constant CPU spikes by Rocket.Chat on Kubernetes

jordan808 · May 21, 2020, 8:45pm

Description

For the past couple weeks our issue has gotten worse with massive CPU spikes every 20-30 minutes. It does not correlate to any usage metrics or outside bandwidth. It also seems to cause our Apps from the Marketplace to become constantly disabled.

Server Setup Information

Version of Rocket.Chat Server: 3.0.9
Operating System: Container-Optimized OS
Kubernetes: GKE master version 1.15.9-gke.24
Deployment Method: Official Helm chart, Official Docker image, Gitlab CI/CD deployment
Number of Running Instances: 4 rocketchat instances, 1 large one with ~1000 concurrent users each day
DB Replicaset Oplog: Enabled. We use MongoDB Atlas.
NodeJS Version: v12.14.0
MongoDB Version: 4.2.6
Proxy: Cloudflare, nginx ingress
Firewalls involved: Cloudflare firewall

Any additional Information

For our Kubernetes helm chart we have:

  requests_memory = "512Mi"
  requests_cpu    = "300m"
  limits_memory   = "2048Mi"
  limits_cpu      = "1500m"

We also have horizonal pod autoscaling enabled, which expands to the max of 15 pods every time there is a spike. The cause of the spikes I believe is due to the database migration that is ran each time a new pod is created, which eats up much of the CPU, and causes a feedback loop of over-usage causing the spikes to increase even more.

Another possible cause is the Apps, which take up much of the CPU when they are uninstalled or reinstalled, causing the system to crash. It might be when new pods are created, apps somehow eat up CPU causing a spike.

Is there a way to turn off this migration? Or a Helm configuration that will allow us to have the right combination of cpu requests and HPA percentage cliff? Any help to figure out a cause to this would be very appreciated. Let me know if more information is needed.

Topic		Replies	Views
Two instances of Rocket.Chat leap to 100% CPU utilisation and after a while return to normal Community Support	2	476	April 1, 2021
High CPU making chat unusable Community Support	10	3726	August 14, 2020
Rocket.Chat Becomes Unresponsive Shortly After Startup - High CPU Usage and No Error Logs Community Support	1	53	September 2, 2024
Rocket.Chat causes high CPU load Community Support	0	362	September 26, 2023
Support 4k active users with RocketChat Community Support	3	1255	November 10, 2021

[Secure CommsOS™ Launch] Join our next Roadmap Reveal webinar to learn more Register now ->

Constant CPU spikes by Rocket.Chat on Kubernetes

Description

Server Setup Information

Any additional Information

Related topics