Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/844
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Computing | - |
dc.creator | Cao, J | - |
dc.creator | Li, Y | - |
dc.creator | Guo, M | - |
dc.date.accessioned | 2014-12-11T08:24:13Z | - |
dc.date.available | 2014-12-11T08:24:13Z | - |
dc.identifier.isbn | 0-7695-2281-5 | - |
dc.identifier.uri | http://hdl.handle.net/10397/844 | - |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.rights | © 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | en_US |
dc.rights | This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. | en_US |
dc.subject | Distributed computer systems | en_US |
dc.subject | Knowledge based systems | en_US |
dc.subject | Parallel processing systems | en_US |
dc.subject | Telecommunication networks | en_US |
dc.title | Process migration for MPI applications based on coordinated checkpoint | en_US |
dc.type | Conference Paper | en_US |
dcterms.abstract | A lot of research has been done on fault-tolerance for MPI applications, some on checkpoint/restart, and some on network fault-tolerance. Process migration, however, has not gained widespread use due to the additional complexity of the requirement that the knowledge about the new location of a migrated process has to be made known to every other process in the application. Here we present a simple yet effective method of process migration based on coordinated checkpointing of MPI applications. Migration is achieved by checkpointing the application, modifying the process location information in the checkpoint files, and restarting the application. Checkpoint/restart and migration are transparent to MPI applications. Performance evaluation results showed that the additional checkpoint/restart capability has little impact on application performance, and the migration method scales well on a large number of nodes. | - |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | ICPADS 2005 : 11th International Conference on Parallel and Distributed Systems, 20-22 July 2005, Fukuoka, Japan, p. 306-312 | - |
dcterms.issued | 2005 | - |
dc.identifier.isi | WOS:000231815200045 | - |
dc.identifier.scopus | 2-s2.0-23944489879 | - |
dc.relation.ispartofbook | ICPADS 2005 : 11th International Conference on Parallel and Distributed Systems, 20-22 July 2005, Fukuoka, Japan | - |
dc.relation.conference | International Conference on Parallel and Distributed Systems [ICPADS] | - |
dc.identifier.rosgroupid | r25659 | - |
dc.description.ros | 2005-2006 > Academic research: refereed > Refereed conference paper | - |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | OA_IR/PIRA | en_US |
dc.description.pubStatus | Published | en_US |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
coordinated-checkpoint_05.pdf | 224.82 kB | Adobe PDF | View/Open |
Page views
77
Last Week
0
0
Last month
Citations as of Mar 24, 2024
Downloads
71
Citations as of Mar 24, 2024
SCOPUSTM
Citations
21
Last Week
0
0
Last month
0
0
Citations as of Mar 29, 2024
WEB OF SCIENCETM
Citations
8
Last Week
0
0
Last month
0
0
Citations as of Mar 28, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.