Please use this identifier to cite or link to this item:
Title: Checkpointing in hybrid distributed systems
Authors: Cao, J 
Chen, Y
Zhang, K
He, Y
Keywords: Algorithms
Communication systems
Computer system recovery
Data processing
Fault tolerant computer systems
Issue Date: 2004
Publisher: IEEE Computer Society
Source: ISPAN 2004 : 7th International Symposium on Parallel Architectures, Algorithms and Networks : 10-12 May 2004, Hong Kong, SAR, China, p. 136-141 How to cite?
Abstract: To provide fault tolerance to computer systems suffering from transient faults, checkpointing and rollback recovery is one of the widely-used techniques. Among others, two primary checkpointing schemes have been proposed: independent and coordinated schemes. However, most existing works address only the need of employing a single checkpointing and rollback recovery scheme to a target system. In this paper, issues are discussed and a new algorithm is developed to address the need of integrating independent and coordinated checkpointing schemes for applications running in a hybrid distributed environment containing multiple heterogeneous subsystems. The required changes to the original checkpointing schemes for each subsystem and the overall prevented unnecessary rollbacks for the integrated system are presented. Also described is an algorithm for collecting garbage checkpoints in the combined hybrid system.
ISBN: 0-7695-2135-5
ISSN: 1087-4089
Rights: © 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
checkpointing_04.pdf187.55 kBAdobe PDFView/Open
View full-text via PolyU eLinks SFX Query
Show full item record
PIRA download icon_1.1View/Download Contents


Last Week
Last month
Citations as of Jan 12, 2019


Last Week
Last month
Citations as of Jan 15, 2019

Page view(s)

Last Week
Last month
Citations as of Jan 13, 2019


Citations as of Jan 13, 2019

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.