UI Slowness, Nodes temporarily showing Offline, or Instances temporarily showing Error

Scenarios

Loading the UI seems to be very slow
Node status all seem to be delayed or alternating between different states randomly
Instance states all seem to be delayed or alternating between different states randomly

Confirm an average_queue_time of > 5 when curling the controller's api/v1/stats endpoint

Common reasons

Lots of VMS, Nodes, and not enough load balancing/instances of the controller, etcd, etc.
ETCD is very sensitive to disk latency; not using SSDs for the etcd storage

Solutions:

Increase the space quota from the default 2GB: [https://etcd.io/docs/current/op-guide/maintenance/#space-quota]
Rejoin your nodes with a higher --heartbeat value (> 20s)
Upgrade the host's disk to an SSD or faster disk

State changes can also be caused by network issues between the Node and the controller. Check the node's /var/log/veertu/anka_agent.ERROR log to confirm you're not seeing timeouts or connection errors.

Still experiencing problems?

Talk to us! we are available via slack or email

Feedback

Was this page helpful?

Glad to hear it! Please tell us how we can improve.

Sorry to hear that. Please tell us how we can improve.

Last modified March 25, 2021 : better UI strangeness guide (aae2bde) by Nathan Pierce