Title
Checkpointing facility on a metasystem
Date Issued
01 January 2001
Access level
open access
Resource Type
conference paper
Author(s)
Cardinale Y.
Hernández E.
Publisher(s)
Springer Verlag
Abstract
A metasystem allows seamless access to a collection of distributed computational resources. Checkpointing is an important service in high throughput computing, especially for process migration and recovery after system crash. This article describes the experiences on incorporating checkpointing and recovery facilities in a Java-based metasystem. Our case study is suma, a metasystem for execution of Java bytecode, both sequential and parallel. This paper also shows preliminary results on checkpointing and recovery overhead for single-node applications.
Start page
75
End page
79
Volume
2150
Language
English
OCDE Knowledge area
Ciencias de la computación
Ingeniería de sistemas y comunicaciones
Scopus EID
2-s2.0-84858918997
ISSN of the container
03029743
ISBN of the container
9783540424956
Conference
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): 7th European Conference on Parallel Computing, Euro-Par 2001
Sources of information:
Directorio de Producción Científica