MVSFORUMS.com Forum Index MVSFORUMS.com
A Community of and for MVS Professionals
 
 FAQFAQ   SearchSearch   Quick Manuals   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Why Mainframes rarely crash?

 
Post new topic   Reply to topic   printer-friendly view    MVSFORUMS.com Forum Index -> Other Technical Topics
View previous topic :: View next topic  
Author Message
Himesh
CICS Forum Moderator
CICS Forum Moderator


Joined: 20 Dec 2002
Posts: 80
Topics: 21
Location: Chicago

PostPosted: Tue Jan 21, 2003 1:09 am    Post subject: Why Mainframes rarely crash? Reply with quote

Why Mainframes rarely crash?

An excerpt from the same article.

Quote:

Mainframes can achieve "four nines" or "five nines" availability: 99.99 or 99.999 percent uptime. That translates into only 5 to 53 minutes of downtime per year. In fact, IBM's Server Group claims that the mean time between critical failures (MTBCF) for its System/390 mainframes -- that is, the average time between failures that force a reboot and an initial program load -- is 20 to 30 years.


regards,
Himesh
Back to top
View user's profile Send private message Yahoo Messenger
Grant
Beginner


Joined: 02 Dec 2002
Posts: 45
Topics: 1
Location: Sydney, NSW, Australia

PostPosted: Tue Jan 21, 2003 5:11 pm    Post subject: Reply with quote

That's very interesting Himesh.
Our overnight incident report is full of server crashes/freezes/reboots , but the mainframe just keeps on keeping on....
Back to top
View user's profile Send private message
zatlas
Beginner


Joined: 17 Dec 2002
Posts: 43
Topics: 4

PostPosted: Thu Jan 23, 2003 12:57 am    Post subject: Reply with quote

Hi
Like the old VolksWagen after 40 years of production of virtually same model, all the bugs were kinked out. On the other hand, the thing was designed as an 18 wheeler from the start. It is not a desktop that was put on its side and recoined as a server (hmmm... builtin scalability, what a concept)
ZA
Back to top
View user's profile Send private message
DaveyC
Moderator


Joined: 02 Dec 2002
Posts: 151
Topics: 3
Location: Perth, Western Australia

PostPosted: Tue Jan 28, 2003 10:07 am    Post subject: Reply with quote

It didn't always used to be that way. I remember the dreaded MVS/XA spin loop like it was only yesterday. ESA with PAF and auto ACR took care of that nasty. The old machines would crash all the time because of environmental problems like a chiller unit failure (pop goes the TCM). Those environmentals cost a fortune to run, which is probably why companies wanted a cheaper alternative. Thank god for CMOS.

I used to be an MVS operator back in the 80's. If I had a dollar for each time we lost the machine I could retire.
_________________
Dave Crayford
Back to top
View user's profile Send private message Send e-mail
Himesh
CICS Forum Moderator
CICS Forum Moderator


Joined: 20 Dec 2002
Posts: 80
Topics: 21
Location: Chicago

PostPosted: Thu Jan 30, 2003 4:26 am    Post subject: Reply with quote

Dave,

What you said has, taken me by surprise.
I would highly appreciate it if you could explain just a little bit more about those problems that you talked about (especially the "spin loop").


regards,
Himesh
Back to top
View user's profile Send private message Yahoo Messenger
DaveyC
Moderator


Joined: 02 Dec 2002
Posts: 151
Topics: 3
Location: Perth, Western Australia

PostPosted: Thu Jan 30, 2003 6:52 am    Post subject: Reply with quote

Spin loops are caused by a CPU locking a resource required by another CPU, the waiting CPU goes into a spin loop until the resource is available. When the threshold is exceed CPU recovery is required. Spin loops are still quite common, but now the CPU recovery is automated. Back then a WTOR was issued to the operators. If they were quick and switched on they could answer the message and invoke CPU recovery, they had about 120 seconds to answer. Most of the time the Operators were watching TV and the machine crashed.

As for environmentals, before CMOS MVS mainframes were water cooled. We had 3 chiller units, 2 active and 1 spare. If you had a chiller failure the spare was supposed to kick in, but they were not reliable and often needed manual intervention. By the time you got to the chiller units the machine had crashed due to a thermal trip.

IBM hardware was also prone to nasty failures. We had 18 3390/3 HDA crashes in a month due to a chemical corruption in the manufacturing plant in Germany... EMC was the choice after that fiasco.

However, the last 10 years has seen mainframes deliver the 5 nines mentioned in that article.
_________________
Dave Crayford
Back to top
View user's profile Send private message Send e-mail
Himesh
CICS Forum Moderator
CICS Forum Moderator


Joined: 20 Dec 2002
Posts: 80
Topics: 21
Location: Chicago

PostPosted: Thu Jan 30, 2003 7:06 am    Post subject: Reply with quote

Dave,

Thanks for your valuable input.
It must have been quite interesting to have worked on the mainframes "of the past".

regards,
Himesh
Back to top
View user's profile Send private message Yahoo Messenger
Display posts from previous:   
Post new topic   Reply to topic   printer-friendly view    MVSFORUMS.com Forum Index -> Other Technical Topics All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


MVSFORUMS
Powered by phpBB © 2001, 2005 phpBB Group