Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Incident Description

Dashboard stopped processing traps from Sunday 08:37 UTC until Monday approximately 08:10 UTC

The impact of this service degradation was:

  • The production dashboard stopped processing traps and therefore no alarms were presented to NOC.
  • No alarms or notifications were generated during the outage period.


Incident severity: 

Status
colourRed
titleCritical
 Complete service outage

Data loss:

Status
colourRed
titleYes

Total duration of incident: 1 day


Timeline

All times are in UTC

DateTimeDescription

 

08:36

LON/AMS ... etc

 

08:37rmq partition, correlator stopped proce3ssing

  

time ...

notifications

  

time

oc/josep ... etc (todo)

 

timeerik restarted rmq (todo)

Proposed Solution

  • ....