VECPAR'06 - Seventh International Meeting on High Performance Computing for Computational Science |
Experiencing Data Grids
Nicolaas Ruberg (COPPE/UFRJ)
Nelson Kotowski (COPPE/UFRJ)
Amanda Mattos (COPPE/UFRJ)
Luciana Matos (COPPE/UFRJ)
Melissa Machado (COPPE/UFRJ)
Daniel Oliveira (COPPE/UFRJ)
Rafael Monclar (COPPE/UFRJ)
Cláudio Ferraz (COPPE/UFRJ)
Talitta Sanchotene (COPPE/UFRJ)
Vanessa Braganholo (COPPE/UFRJ)
Many scientific experiments deal with data-intensive applications and the orchestration of computational workflow activities. These can benefit from data parallelism exploited in parallel systems to minimize execution time. Due to its complexity, robustness and efficiency to exploit data parallelism, grid infrastructures are widely used in some e-Science areas like bioinformatics. Workflow techniques are very important to in-silico bioinformatics experi-ments, allowing the e-scientist to describe and enact experimental process in a structured, repeatable and verifiable way. The main purpose of this paper is to describe our experience with Tavena Workbench and PeDRo, which are part of myGrid project. Taverna is provided with a workflow toolset and enactor, allow-ing the specification of processing units, data transfer and execution constraints. As a data entry tool, PeDRo provides a model, a controlled vocabulary and field validations for Web Services descriptions, leveraging the knowledge associated to the workflows. The main contribution of this work is a summary of some considerations drawn by our experience with the use of these tools, emphasiz-ing its advantages and negative aspects, together with proposals for some future improvements.
Data Grids, Web services and P2P , Transaction and workflow management in Grids
Logos Universidade Federal do Rio de Janeiro - Coordenação dos Programas de Pós-graduação de Engenharia Instituto Nacional de Matemática Pura e Aplicada Rio de Janeiro | Brazil | 2006 | July | 10 11 12 13