Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Incident description

Wordpress site (wordpress1.geant.org) become unreachable at 12:10 CET

Incident severity: CRITICAL

Data loss: NO

Timeline

Time (CET)
12:10Apache server stop accepting incoming requests
12:12

Chris Atherton reported on #it channel that site aac-project.eu is not working correnctly

12:21

Konstantin Lepikhov confirmed the issue with worldpress1 site on #devops channel

12:23

Dick Visser connected to VM via console and confirmed that network is down (router not reachable)

12:29

Massimiliano Adamo have restarted network service inside VM, after that everything started working and network link came up.

12:30

Konstantin Lepikhov announced that problem fixed.

Total downtime: 20 minutes.

Current situation

We're currently investigating the nature of the issue. I could either VMWware network adapter issue or something related network configuration in VMWhare cluster.

Monitoring alerted: YES