Server Stability Issues

Announcements that don't fit into another category.
Post Reply
User avatar
Jirato
DEV
Posts: 2728
Joined: Sat Apr 14, 2012 12:17 pm

Server Stability Issues

Post by Jirato » Mon Feb 27, 2017 6:36 pm

Good evening,

I felt I needed to explain a bit about what's going on with CLOK, as it's been down pretty much all day, and I find that pretty much unacceptable.
  • I received an email from our hosting provider at 10:10AM this morning informing me that there was a critical kernel vulnerability affecting my server, and that they had installed the kernel update but needed me to reboot the server to apply it.
  • From work, around 10:12AM I used my tablet to remote into the server, gracefully shut down the MUD engine, and reboot the server. I immediately reconnected to the server, ensured everything was up and running, and had the server, flash policy listener (for the HMUD web based client), and discord bot all up and running around 10:40AM.
  • The new version of the CentOS kernel included a problem that was causing the network interface controller to randomly enter power saving mode. At around 10:49AM, several players were disconnected from the server and were unable to access the MUD or website.
  • This continued to be an issue for the majority of the day, while I was unaware and busy at work. I had some downtime around 1:35PM and checked Discord, only to see talk about the server being down. I also received a call from Rias moments later telling me the same.
  • Upon investigation, I was able to connect at 1:50PM, noting that the server's official uptime was 9 minutes. I restarted the MUD engine at that time. I opened a ticket with the hosting provider and informed them of the unexpected downtime, and then proceeded with work.
  • While I was working, they had sent me a reply stating that a technician had connected to the server and "gracefully" rebooted it (not properly killing the MUD engine) for the aforementioned security update, not bothering to check that I had already done it. They also informed me of the NIC issue and that they had applied a fix but I would need to restart the server to correct it.
  • After I got back from dinner with my girlfriend, around 7:47PM, I checked discord to see that the MUD was down again. I logged in and checked the uptime to see that it had been up for about 3 and a half hours, meaning that the server was rebooted again sometime around 4:30PM this afternoon, but the MUD engine had not been started back up (it has to be manually run).
  • I currently have an open ticket with the hosting provider to investigate the second unexpected shutdown, though I'm betting it's going to be another case of them rebooting it "for" me, to fix the aforementioned NIC issue. Right now, everything is back up, and I'll continue to monitor it as closely as my time will allow tonight to ensure it doesn't go down again.
I deeply regret everyone's inability to play CLOK today. One of the things I take pride in is our outstanding uptime record. We frequently see 3000+ hours of MUD uptime, and rarely need to reboot for any reason, and usually when we do, it's only for 30-60 minutes.
[GMCHAT Uyoku]: Octum is when the octumbunny comes around and lays pumpkins everywhere right?
[GMCHAT Rias]: Dimmes says "oh hai :) u need healz? ill get u dont worry thaum lasers pew pew pew lol"
[CHAT - GameMaster Rias would totally nuke Rooks]: Here's how elemancy works: The freeblegreeble and the zippoflasm have to be combined with the correct ration of himbleplimp, then you add the gargenheimer and adjust the froopulon for the pattern you want, apply some tarratarrtarr, yibble the wantaban, and let 'er rip!

User avatar
Jirato
DEV
Posts: 2728
Joined: Sat Apr 14, 2012 12:17 pm

Re: Server Stability Issues

Post by Jirato » Mon Feb 27, 2017 8:59 pm

As compensation for the server issues earlier today, the threshold required to activate the population based skillgain multiplier has been lowered from 10 to 3. (10% per each player over 3 up to 300%). This will remain in effect for the remainder of the week.
[GMCHAT Uyoku]: Octum is when the octumbunny comes around and lays pumpkins everywhere right?
[GMCHAT Rias]: Dimmes says "oh hai :) u need healz? ill get u dont worry thaum lasers pew pew pew lol"
[CHAT - GameMaster Rias would totally nuke Rooks]: Here's how elemancy works: The freeblegreeble and the zippoflasm have to be combined with the correct ration of himbleplimp, then you add the gargenheimer and adjust the froopulon for the pattern you want, apply some tarratarrtarr, yibble the wantaban, and let 'er rip!

Post Reply

Return to “General Announcements”