Multi-site Jobs Management System (MJMS): A tool to manage multi-site MPI Applications Execution in Grid Environment
Contributo in Atti di convegno
Data di Pubblicazione:
2006
Abstract:
Multi-site parallel applications consist of multiple distributed processes running on one or more potentially heterogeneous resources in different locations. This class of applications can effectively use the Grid for high-performance computing. We propose a multi-site job management system (MJMS) based on the Globus Toolkit, the MPICH-G2 library and Condor-G for the effective, reliable and secure execution of multi-site parallel applications in the Grid environment.
This system allows the user to submit execution requests specifying application requirements and preferences in a high-level language (following Condor submit
file syntax) freeing her from the tasks of resource discovery and co-allocation: it efficiently selects available computing resources for execution according to the
application requirements and interacts with the local management systems through the Condor-G system.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
HPC; Grid; MPI; Multi-Site Applications
Elenco autori:
Gregoretti, Francesco; Oliva, Gennaro
Link alla scheda completa: