Reddthat Announcements

694 readers

1 users here now

Main Announcements related to Reddthat.

For all support relating to Reddthat, please go to !community@reddthat.com

founded 1 year ago

MODERATORS

ticoombs@reddthat.com

reddthat@reddthat.com

example@reddthat.com

Outage: Feb 14 04:39 to 06:35 ~2 Hours (reddthat.com)

submitted 9 months ago* (last edited 9 months ago) by ticoombs@reddthat.com to c/reddthat@reddthat.com

2 comments fedilink hide all child comments

Unfortunately we had a Valentine's Day outage of around 2 hours.

Incident Timeline: (times in UTC)

04:39 - Our monitoring sees a 50x error.
04:41 - I am alerted via email & phone.
04:48 - I acknowledge the incident and start investigating
04:50 - I cannot access the VM via SSH. I issue a reboot via our control panel.
04:54 - Our server has a load of 12 and an 57% of all IO operations are IOWait.
05:30 - I issue another reboot and can't seem to figure out what's wrong
05:58 - I lodge a ticket with our provider to check the host, and to power off and on again as we still have huge IOWait values, and 100% Memory usage.
06:30 - hosting company hasn't got back to me and I start investigating by rolling back the latest configuration changes I've done & reboot.
06:35 - sites are back online.

Resolution

Latest change included turning on huge pages with a value of 100MB to allow postgres to get some performance gains.
This change was done on Monday morning and I had planned to do a power cycle this week to confirm everything was on the up-and-up. Turns out my host did that for me.

The outage lasted longer than it should have due to some $job and $life.

Until next time,
Cheers,
Tiff

you are viewing a single comment's thread
view the rest of the comments

[–] doctortofu@reddthat.com 2 points 9 months ago (1 children)

Still love ya though - partially thanks to all the transparency and keeping us updated. Thanks! ❤

[–] ticoombs@reddthat.com 1 points 9 months ago

❤️