This paper discusses the process of building an environment where large-scale, complex, scientific analysis can be scheduled onto a heterogeneous collection of computational and storage resources. The example application is the Southern California Earthquake Center (SCEC) CyberShake project, an analysis designed to compute probabilistic seismic hazard curves for sites in the Los Angeles area. We explain which software tools were used to build to the system, describe their functionality and interactions. We show the results of running the CyberShake analysis that included over 250,000 jobs using resources available through SCEC and the TeraGrid. ?? 2006 IEEE.
Additional publication details
Managing large-scale workflow execution from resource provisioning to provenance tracking: The CyberShake example
Larger Work Title:
e-Science 2006 - Second IEEE International Conference on e-Science and Grid Computing
e-Science 2006 - 2nd IEEE International Conference on e-Science and Grid Computing