Information RedundancyInformation can be defined a group of bits organised in a well defined manner. In real time system large amount is information has to be transferred in a very short time in a precisely secure manner. There are many casualties during the transfer of information. For example the information can be corrupted due to [...]
Continue reading Information Redundancy
Hardware Redundancy
Use of additional hardware to compensate for failures
This can be done in two ways
Fault detection, correction and Masking. Multiple hardware units may be assigned to do the same task in parallel and their results compared. If one or more units are faulty, we can express this to show up as a disagreement in the [...]
Continue reading Hardware Redundancy
A Fault in one part of the system cause large voltage swings in the other parts of the system. So it is necessary to prevent from spreading through the system. This is called as containment.
This can be divided into
Fault Containment Zone (FCZ) and
A failure of some part of the computer outside an FCZ cannot cause [...]
Continue reading Fault and Error Containment
Fault Tolerance Techniques
Introduction
Hardware Faults – Occurs due to a physical defect of a system like a broken wire or a logic struck at 0 in a gate.
Software faults – occurs due to a bug introduced in a system so the software misbehaves for a given set of inputs
Error – the manifestation of a fault is [...]
Continue reading Introduction to Fault Tolerance