Jimmie L Green
POS/355
July 10, 2014
James Johnsen
Four Failures That Matter
Introduction: Operating systems has come a long way and a much improvement in the way each system built. This paper will discuss the four common types of distributed computer system failures, which are crash failure also known as operating system failure. Hardware failure. Omission failures and byzantine failures. Included in the discussion are failures, which can also occur in a centralized computer system.
A failure described as the difference of service it delivers to the user when it deviates from compliance with the system specification for a specified period. Failures known as a system not working properly. A fault can lead to other faults, or to a failure, or neither.
Crash failures normally associated which a server fault in a typical distributed system. Inherently crash failures are interrupt operations of the server and can halt operation for a considerable time. Operating system or software failures come in many more varieties than hardware failures. Software bugs in distributed systems can be difficult to replicate and, consequently, repair and or debug. Corresponding fault tolerant systems are developed and employed with respect to these affects. An operating system or software failure can also occur in a centralized system such as a database this is why it is highly recommended to back up a database using stable mass storage media.
Omission failures either due to process crash or either due to communication link failures detected via timeouts. However, in an asynchronous system a timeout is an indication only that a process is not responding. The process crashed or just slow down due to heavy processing load. As usually as a timeout, we pick a maximum period. The maximum period allows to account for either the network became congested and therefore slower on the response return/ the slower processing in the process and the
References: Agbaria, A., & Friedman, R. (oct.2001). Overcoming Byzantine Failures Using Checkpointing. Retrieved from http://www.perform.csl.illinois.edu/papers/USAN_papers/03AGB02.PDF stallings, W. (2012). operating systems internal and design principles (7th ed.). Upper Saddle River, NJ: prentice hall.