NTS Forums

Please login or register.

Login with username, password and session length
 

News:

Welcome to the Newtek Technology Services Forum!


Author Topic: Outage this morning  (Read 36204 times)

Derek [CT]

  • Guest
Outage this morning
« on: August 19, 2009, 08:36:48 AM »
Had a hiccup this morning that affected a large number of dedicated servers and many shared servers as well. More info is coming - we'll send out a notice to affected customers as well, with more details. Things should be coming back online now - total time of the  issue was about 20 minutes.  

UPDATE: One of the UPSs lost power, which caused the outage. Several segments were affected across the board (shared and dedicated, mail, database, etc.) including our phone server, which is why calls are slow getting connected.

Power was restored quickly but servers are coming online slowly as servers tend to do. We're working as fast as we can to get all servers back online...more info as it becomes available.

UPDATE 2: As mentioned, power was restored but we're still seeing issues with some servers recovering from the immediate shut down. A few database servers are being worked on, and there are some dedicated and VPS boxes that are being addressed as well. AND I renamed this forum topic so as to not appear to belittle the severity of the problem.

UPDATE 3: All but about a 5 dedicated servers are back online. Those 5 are getting the full attention of our Network Admins, so they should be back up shortly. No restoration of data was required. We are still working with our datacenter provider on what exactly occurred. I've been here 8 years and this is the only power-related issue I know of, so it's not a common issue.
« Last Edit: August 19, 2009, 11:59:21 AM by Derek [CT] »

Offline ak732

  • Happily Verbose
  • Hero Member
  • *****
  • Posts: 1,512
  • Karma: +219/-39
    • Agile Web Technologies
Re: Hiccup this morning
« Reply #1 on: August 19, 2009, 08:40:34 AM »
Thanks for the info.  Was quite a hiccup.  The CT site and forums were down as well, at least from NJ and FL.  What was the problem?
Andy

Offline ak732

  • Happily Verbose
  • Hero Member
  • *****
  • Posts: 1,512
  • Karma: +219/-39
    • Agile Web Technologies
Re: Hiccup this morning
« Reply #2 on: August 19, 2009, 08:44:17 AM »
My client's sites seem to be back up.  But my site is still down.
Andy

Offline justin.b

  • Hosting Newbie
  • *
  • Posts: 12
  • Karma: +0/-0
Re: Hiccup this morning
« Reply #3 on: August 19, 2009, 08:45:22 AM »
I've still got a server down in the dedE segment - have submitted a support ticket just so its logged.

Offline Toddski07

  • Full Member
  • ***
  • Posts: 732
  • Karma: +13/-13
    • http://www.brightsideinteractive.com/
Re: Hiccup this morning
« Reply #4 on: August 19, 2009, 08:45:41 AM »
Yep...this stinks...all mail services are not working for my clients and Crystaltech says they don't have an ETA on when it will be working.  A little bit of info would help even if you are still working on the problem and don't exactly know when it will be up again.  Plus...it took several tries to get through the phone service only to learn that there is no additional info.


Offline cdeelsny

  • Hosting Newbie
  • *
  • Posts: 38
  • Karma: +0/-0
Re: Hiccup this morning
« Reply #5 on: August 19, 2009, 08:49:04 AM »
Sorry, a hiccup is not 20 minutes and to this degree.

Offline Astralis

  • Full Member
  • ***
  • Posts: 438
  • Karma: +9/-4
Re: Hiccup this morning
« Reply #6 on: August 19, 2009, 08:51:52 AM »
This is the second time this has happened in a couple of months (not counting the Comcast issue, of course).

My sites are still down.

And like the last time, your phones are also offline.

Now that I read the forum, I won't call again nor submit a ticket.
« Last Edit: August 19, 2009, 08:55:28 AM by Astralis »

Offline dani_17

  • Hosting Newbie
  • *
  • Posts: 5
  • Karma: +0/-0
Re: Hiccup this morning
« Reply #7 on: August 19, 2009, 08:53:59 AM »
For what I see it was a power related issue, since our dedicated servers were rebooted. I hope you guys can fix it as soon as possible. I didn't even call or send a ticket, as this is a general issue CT is well aware of, plus, they're most probably have their phone lines busy and ticket system flooded.

Derek [CT]

  • Guest
Re: Hiccup this morning
« Reply #8 on: August 19, 2009, 08:55:45 AM »
Looks like a UPS lost power, which in turn caused the outage. It's back online now. Servers are coming back online as well, but some servers are taking longer than other. NOC and Server Ops Teams are working as quickly as possible to get servers online.

Still looking at how and why we had an issue with the UPS.

« Last Edit: August 19, 2009, 09:13:18 AM by Derek [CT] »

Offline Corobori

  • Hero Member
  • *****
  • Posts: 2,587
  • Karma: +43/-11
    • Corobori Sistemas
Re: Hiccup this morning
« Reply #9 on: August 19, 2009, 08:57:16 AM »
On my side the websites with problems appear to be back online

Offline jamesedmunds

  • Full Member
  • ***
  • Posts: 680
  • Karma: +21/-0
    • http://jamesedmunds.com/poorclio
Re: Hiccup this morning
« Reply #10 on: August 19, 2009, 08:59:07 AM »
Thanks for the update, Derek. About half of my two dozen or so sites are still down so I'll keep checking.

Also, can't connect to one or two MySQL servers through data administrator, presumably this is the same issue.

-James
James Edmunds
New Iberia, LA

Offline kurogers

  • Hosting Newbie
  • *
  • Posts: 3
  • Karma: +0/-0
Re: Hiccup this morning
« Reply #11 on: August 19, 2009, 09:00:50 AM »
I think hiccup is an understatement.  20 Minutes is a lifetime when your customers can not reach you by web, email or phone.  I've have used your hosting service for many years and have recommended customers to CrystalTech (as recent as today actually).   Whatever the cause, it's a hot mess (hot mess > hiccup).


Offline HotBottleGuy

  • Hosting Newbie
  • *
  • Posts: 5
  • Karma: +1/-0
Re: Hiccup this morning
« Reply #12 on: August 19, 2009, 09:11:56 AM »
 :o :o :o ??? ??? ::) ::) >:( >:( :o :o :o ??? ???

Uh, pardon me... This was not a hiccup. This was full-on projectile vomiting. Some shared sites are still not up.

I don't understand how "UPS lost power" could cause an outage. I thought CT had bulletproof power backup capabilities. Guess not. You should have your prez redo the video to say that CT has total redundancy--oops, well except for power...

I love CT, don't get me wrong but someone's head should roll for this one.

Offline HotBottleGuy

  • Hosting Newbie
  • *
  • Posts: 5
  • Karma: +1/-0
Re: Hiccup this morning
« Reply #13 on: August 19, 2009, 09:16:28 AM »
At 9:15am all my sites were back up. Not good guys. Fix this please.

Offline ak732

  • Happily Verbose
  • Hero Member
  • *****
  • Posts: 1,512
  • Karma: +219/-39
    • Agile Web Technologies
Re: Hiccup this morning
« Reply #14 on: August 19, 2009, 09:19:33 AM »
While I agree that "hiccup" is a bit of a euphemism in this case, I think Derek's choice of terms is far outweighed by the goodness that is CT having this forum and posting about the problem quickly.  Given the choice between CT employees understandably wanting to employ a euphemism and not having them post anything at all, I'll live with the euphemism.

While I can easily overlook the terminology, I'm pretty curious about why a single failed UPS took down so many servers.  That seems kind of like a design flaw to me.  If I'm going to bitch about anything, it would be that.
Andy