%@ page language="java" contentType="text/html" %> <%-- Include common initialisation code --%> <%@ include file="/arch/common.jsp" %> <%-- The current tab --%> <% String currentTab = "Research"; %> <%-- Content of navigation pane --%> <%@ include file="nav.jsp" %> <% showCurrentLink=true; %> <%-- Current navigation location --%> <% String currentNav = "Reports and Theses"; %> <%-- Include the code for the document header --%> <%@ include file="/arch/header.jsp" %>
Zhiming Liu, Modelling Checkpointing and Recovery within UNITY (August 1, 1989).
Following the method for developing programs proposed in UNITY [CM88], this paper presents a general model for checkpointing and recovery within which the consistency of checkpoints, error propagation and the domino-effect of recovery from faults are formally characterized. Based on this model and UNITY, a solution is developed for checkpointing and recovery which describes what should be done for checkpointing and recovery but imposes no restrictions on when it should, where it should be done (e.g. on which processors), or how it should be done. This supports the idea that the introduction of checkpointing and recovery can be treated systematically as transformations which convert a program into a fault-tolerant program. Such fault-tolerant transformations and their correctness are defined in this paper.