![]() |
|
|||||||
| Network / Server Status Please check often for network / Server updates here! |
![]() |
|
|
Thread Tools | Display Modes |
|
#1
|
|||||||||||
|
|||||||||||
|
2003-08-23 unix5.hsphere.cc
Unix5 went down around 05:50 LT. I'm downtown and attempting to boot it right about now.
Service was restored around 06:45 LT. Last edited by alexc : 08-23-2003 at 07:31 AM. |
|
#2
|
|||||||||||
|
|||||||||||
|
Thanks Alex, does it normally take 55 minutes for a unix box to reboot ? I'm sure my Windows ones don't take that long
![]() |
|
#3
|
||||
|
||||
|
I suppose legwork etc included
I don't think ANY machine takes that long to boot. But then it certainly throws some light on how long b4 a reboot is decided. I think this should be much faster.
Other factors not considered, I would go for reboot by default and cause minimal disprution rather than opting for longer fault finding exercise. If reboot doesn't solve problem, then heavy analysis can follow. this way we may avoid lengthy investigation, when a short reboot suffices. Just my suggestion, which i consider worh considering. I dunno if you can innitiate a reboot from remote location?
__________________
::: Ashvin ::: www.klas5.net ~ Learn about psyllium husk and how it can help your colon health Excuse the misspellings, two-finger typist at work and no help from antique-ed keyboard! |
|
#4
|
|||||||||||
|
|||||||||||
|
Yes we can do reboots from a remote location easily, that is normally what is done FIRST, and then if things don't work the techs continue on with a fault finding "mission" and repair it, I do not know what the issue was at this time so I have no more details on that right now.
|
|
#5
|
||||
|
||||
|
That expalins it for now
Surely that explains the 55 minutes taken to get the machine reboot. So much for our hot-trail to fault finding. keep up the good work guys, and thanks again.
I suggest support setup a template with log of steps to be taken. This way, support just has to fill in time and comments (also serves as a checklist) so we can put these actions into perspective and not go wondering into fault-finding missions of our own. Again just a suggestion to making tech-support more transparent. It goes something like : 18:00:05 GMT fault detected 18:00:50 1-sound alarm bells 18:00:55 2-dump coffee and get on it 18:00:59 3.......5 reviving measures 19:00:59 9OKAY remote reboot 19:02:45 10Reboot not working 19:03:00 11Despatch personell ALEX to site ........so no and so on like those log thingy stuff. Maybe some handheld application to assist in logging process, post-mortem really good like this.....I am in fanatasy mode. Keep this in mind for future huh? Good going
__________________
::: Ashvin ::: www.klas5.net ~ Learn about psyllium husk and how it can help your colon health Excuse the misspellings, two-finger typist at work and no help from antique-ed keyboard! |
|
#6
|
|||||||||||
|
|||||||||||
|
Quote:
Personally I'd rather have a machine take a little longer to fix properly, than have the tech have to drive back to the datacentre becuase it's fallen over again 5 minutes after they left the building which just wastes more time. |
|
#7
|
|||||||||||
|
|||||||||||
|
In general it takes 15-20 minutes of downtime for someone to decide that it isn't going to recover nearly as spontaneously as it crashed (no such luck), take the datacentre key off the hook and make their way downtown. You have to allow that much time for the box to recover normally from a crash and do its filesystem check and all that.
Unix 5 didn't actually go down as far as I can tell but was slowly dying until I took it down. Had to cold boot it in the end, which is pretty much the last thing we try. And physically pulling the plug is not something you can do remotely without a degree in advanced telekinetics, so a corporeal entity has to be present to do the job. We're working on that too though. ![]() |
|
#8
|
||||||||||||
|
||||||||||||
|
You don't have remote reboot switches?
|
|
#9
|
|||||||||||
|
|||||||||||
|
Sure we do. As I said though, in this case it needed to be booted as cold as a cold boot can be. I ended up having to physically disconnect the power supply before it would boot up all the way and start services. No sitting comfortably in the office pushing RPC buttons for this one...
|
|
#10
|
|||||||||||
|
|||||||||||
|
Getting out of the office and seeing sunlight wouldn't have helped your daemon complextion at all Alex!
|
![]() |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| IP migration: unix5 | alexc | News and Announcements | 0 | 04-27-2004 09:16 PM |
| 2003-12-11 cp.hsphere.cc | alexc | Network / Server Status | 1 | 12-11-2003 06:36 PM |
| 11.29.2003 and 11.30.2003 - ns, mail, mysql, cp and pg.hsphere.cc | bigdave | Network / Server Status | 48 | 12-01-2003 08:21 AM |
| 07/18/2003: unix5.hsphere.cc | MichaelB | Network / Server Status | 1 | 07-18-2003 03:34 PM |