Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/805
PIRA download icon_1.1View/Download Full Text
Title: Checkpointing in hybrid distributed systems
Authors: Cao, J 
Chen, Y
Zhang, K
He, Y
Issue Date: 2004
Source: ISPAN 2004 : 7th International Symposium on Parallel Architectures, Algorithms and Networks : 10-12 May 2004, Hong Kong, SAR, China, p. 136-141
Abstract: To provide fault tolerance to computer systems suffering from transient faults, checkpointing and rollback recovery is one of the widely-used techniques. Among others, two primary checkpointing schemes have been proposed: independent and coordinated schemes. However, most existing works address only the need of employing a single checkpointing and rollback recovery scheme to a target system. In this paper, issues are discussed and a new algorithm is developed to address the need of integrating independent and coordinated checkpointing schemes for applications running in a hybrid distributed environment containing multiple heterogeneous subsystems. The required changes to the original checkpointing schemes for each subsystem and the overall prevented unnecessary rollbacks for the integrated system are presented. Also described is an algorithm for collecting garbage checkpoints in the combined hybrid system.
Keywords: Algorithms
Communication systems
Computer system recovery
Data processing
Demodulation
Fault tolerant computer systems
Optimization
Publisher: IEEE Computer Society
ISBN: 0-7695-2135-5
ISSN: 1087-4089
Rights: © 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
checkpointing_04.pdf187.55 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

88
Last Week
1
Last month
Citations as of May 5, 2024

Downloads

91
Citations as of May 5, 2024

SCOPUSTM   
Citations

12
Last Week
0
Last month
Citations as of Apr 26, 2024

WEB OF SCIENCETM
Citations

7
Last Week
0
Last month
0
Citations as of May 2, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.