Providing fault tolerance in extreme scale parallel applications What can the HPC community learn from the Database community Providing fault tolerance in extreme scale parallel…