The objective of this course is to become familiar with general and state of the art techniques used in design and analysis of fault-tolerant digital systems. Study and investigate existing fault-tolerant systems. Both Hardware and software methods will be studied and new research topics will be investigated.
This course overviews reliable (fault-tolerant) computing and the design and evaluation of dependable systems, and provides a base for research in reliable systems. Models and methods are used in the analysis and design of fault-tolerant and highly reliable computer systems will be taught in this course. Topics include faults and their manifestations, fault/error modeling, reliability, availability and maintainability analysis, system evaluation, performance-reliability trade-offs, system level fault diagnosis, hardware and software redundancy techniques, and fault-tolerant system design methods.