Tolerancia a fallas
Tolérance aux pannes
Tolleranza ai guasti
Tolerância às avarias
Since it is impossible to totally prevent breakdowns, one solution consists in setting up redundancy mechanisms by duplicating critical resources.
The ability of a system to operate despite the failure of one of its components is called fault tolerance.
When one of the resources breaks down, the other resources take over in order to give system administrators the time to find a solution to the problem. This is called "Fail-Over Service" (FOS).
Ideally, in the case of material failures, the faulty material elements should be hot swappable, i.e. capable of being extracted and replaced without service interruption.