Incident description

ProxySQL keeps crashing and reloading every 5 minutes and client connections keep being dropped continuously.

Incident Severity: CRITICAL

Data Loss: YES

Timeline

Time (UTC)
10 July 2021 7:13an upgrade from version 2.0.7 to version 2.1.1 was triggered.
10 July 2018 11:00we started receiving notifications on Telegram about CAMS not being available.
10 July 2018 17:00Massimiliano Adamo identified an issue with ProxySQL. PRoxySQL was reloaded and logging was increased.
12 July 2018 10:30

Bjarke Madsen and Robert Latta sent a message to Massimiliano Adamo , to notify an issue with the Web UI of BRIAN.

12 July 2018 13:30ProxySQL was pinned to an older version

Total Downtime: it was an intermittent downtime of ~  48 hours

Details of Solution

A parameter in the Puppet module defaults the version number to 'latest': https://gitlab.com/maxadamo/galera_proxysql/-/blob/master/manifests/params.pp#L54

I have enabled version and version pinning in puppet to avoid future updates.

Future mitigation

In general, it's a good practice to always pin critical applications.


  • No labels