metrics for high availability
- mttf — mean time to failure — average life time of the system before it fails again.
- mtbf — mean time between failures — you have repaired the system and the system has again failed.
- mtrr — mean time to recover / repair /…