1 INTRODUCTION 2 BASIC DEFINITIONS 3 FAULTS, ERRORS, AND FAILURES 4 FAULT DURATION 5 DESIGN TECHNIQUES 6 FAULT-TOLERANT TECHNIQUES 7 TYPES OF REDUNDANCY 8 FAULT-TOLERANT ARCHITECTURE 9 REAL-TIME FAULT-TOLERANT SYSTEMS 10 THE LATENCY PROBLEM 11 APPLICATION AREAS 12 SOFTWARE FAULTS 13 DEPENDABILITY MODELLING 2 5 11 15 19 21 25 33 54 58 62 75 85
1
1 INTRODUCTION
Welcome to the , CSE42RFS Real-Time and Fault-Tolerant Systems!
Course Objectives Historical Background
2
INTRODUCTION
3
COURSE OBJECTIVES
It is assumed that students in this course have not been exposed previously to the terminology and techniques used in the fault-tolerant and real-time computing eld. Henceforth the principal aim of this course is to provide the students an introduction to the design and analysis of fault-tolerant and real-time systems. After completing this course, a student will be able to: Comprehend the existing fault-tolerant and real-time computing literature. Describe, explain, generalise, classify, adapt and assess those techniques, which are currently available for designing and analyzing reliable faulttolerant and real-time computer systems. Outline the methodologies that are available to combat system failures, caused by hardware and/or software. Recognise the analysis techniques, which can be used to verify that a system has met its requirements. Discuss the system design fundamentals of a fault-tolerant and real-time system used by Australia 's leading companies.
INTRODUCTION
4
HISTORICAL BACKGROUND
Through constant technological innovation, the vacuum tubes of the early computer systems have been replaced by chips with very large scale integration (VLSI) consisting of many thousands of gates. This has resulted in dramatic changes in the scale and complexity of computer systems, in both hardware and software aspects. Such changes have enabled certain tasks that were previously performed manually, or were even impossible, to be carried out by computers:
References: Dordiecht, 1999, pp. 361-374. 7. Object Management Group, “Fault Tolerance Request for Proposals,” 1999; available online at http://www.omg.org/techprocess/meetings/schedule/ Fault_Tolerance_RFP.htm.