Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/805
Title: Checkpointing in hybrid distributed systems
Authors: Cao, J 
Chen, Y
Zhang, K
He, Y
Keywords: Algorithms
Communication systems
Computer system recovery
Data processing
Demodulation
Fault tolerant computer systems
Optimization
Issue Date: 2004
Publisher: IEEE Computer Society
Source: ISPAN 2004 : 7th International Symposium on Parallel Architectures, Algorithms and Networks : 10-12 May 2004, Hong Kong, SAR, China, p. 136-141 How to cite?
Abstract: To provide fault tolerance to computer systems suffering from transient faults, checkpointing and rollback recovery is one of the widely-used techniques. Among others, two primary checkpointing schemes have been proposed: independent and coordinated schemes. However, most existing works address only the need of employing a single checkpointing and rollback recovery scheme to a target system. In this paper, issues are discussed and a new algorithm is developed to address the need of integrating independent and coordinated checkpointing schemes for applications running in a hybrid distributed environment containing multiple heterogeneous subsystems. The required changes to the original checkpointing schemes for each subsystem and the overall prevented unnecessary rollbacks for the integrated system are presented. Also described is an algorithm for collecting garbage checkpoints in the combined hybrid system.
URI: http://hdl.handle.net/10397/805
ISBN: 0-7695-2135-5
ISSN: 1087-4089
Rights: © 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
checkpointing_04.pdf187.55 kBAdobe PDFView/Open
Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

12
Citations as of Sep 16, 2017

Page view(s)

349
Last Week
2
Last month
Checked on Sep 24, 2017

Download(s)

555
Checked on Sep 24, 2017

Google ScholarTM

Check



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.