Synopses & Reviews
Scientific Workflow has seen massive growth in recent years as science becomes increasingly reliant on the analysis of massive data sets and the use of distributed resources. The workflow programming paradigm is seen as a means of managing the complexity in defining the analysis, executing the necessary computations on distributed resources, collecting information about the analysis results, and providing means to record and reproduce the scientific analysis. Workflows for e-Science presents an overview of the current state of the art in the field. It brings together research from many leading computer scientists in the workflow area and provides real world examples from domain scientists actively involved in e-Science. The computer science topics addressed in the book provide a broad overview of active research focusing on the areas of workflow representations and process models, component and service-based workflows, standardization efforts, workflow frameworks and tools, and problem solving environments and portals. The topics covered represent a broad range of scientific workflow and will be of interest to a wide range of computer science researchers, domain scientists interested in applying workflow technologies in their work, and engineers wanting to develop workflow systems and tools. As such Workflows for e-Science is an invaluable resource for potential or existing users of workflow technologies and a benchmark for developers and researchers.
This collection of articles on Work?ows for e-Science is very timely and - portant. Increasingly, to attack the next generation of scienti?c problems, multidisciplinary and distributed teams of scientists need to collaborate to make progress on these new Grand Challenges . Scientists now need to access and exploit computational resources and databases that are geographically distributed through theuseof high speed networks. Virtual Organizations or VOs must be established that span multiple administrative domains and/or institutions and which can provide appropriate authentication and author- ation services and access controls to collaborating members. Some of these VOsmayonlyhavea?eetingexistencebutthelifetimeofothersmayrun into many years. The Grid community is attempting to develop both sta- ards and middleware to enable both scientists and industry to build such VOs routinely and robustly. This, of course, has been the goal of research in distributed computing for many years; but now these technologies come with a new twist service orie- ation. By specifying resources in terms of a service description, rather than allowing direct access to the resources, the IT industry believes that such an approach results in the construction of more robust distributed systems. The industry has therefore united around web services as the standard technology toimplementsuchserviceorientedarchitecturesandtoensureinteroperability between di?erent vendor systems."
This is a timely book presenting an overview of the current state-of-the-art within established projects, presenting many different aspects of workflow from users to tool builders. It provides an overview of active research, from a number of different perspectives. It includes theoretical aspects of workflow and deals with workflow for e-Science as opposed to e-Commerce. The topics covered will be of interest to a wide range of practitioners.
About the Author
Dr. Ian Taylor has been a Lecturer at Cardiff University's School of Computer Science since 2002. He concurrently holds an adjunct Assistant Professorship at the Center for Computation & Technology at Louisiana State University and regularly offers consultations in the USA. He has a Ph.D. in Physics and Music and is the co-ordinator of Triana activities at Cardiff (http://www.trianacode.org). Through this he has been active in many major projects including GridLab, CoreGrid and GridOneD. His research interests include distributed techniques and workflow for Grid and P2P computing, which take in applications ranging from astrophysics and healthcare to distributed audio. Ian has previously written a professional book for Springer on P2P, Web Services and Grids, and has published over 50 scientific papers. He has also co-edited a special edition for Journal of Grid Computing on Scientific Workflow. Dr. Matthew Shields has been a research associate at Cardiff University, jointly in the Schools of Computer Science, Physics and Astronomy, since 2001. He gained his Ph.D. in Computer Science from Cardiff University in the area of problem solving environments. Dr Shields is one of two lead developers for the Triana project and has been responsible for helping broaden its adoption within new application domains including biodiversity. His interests include problem solving environments, workflow, component and service based computing, Grid and high-performance computing. Ewa Deelman is an Research Assistant Professor at the USC Computer Science Department and a Research Team Leader at the Center for Grid Technologies at the USC Information Sciences Institute. Dr. Deelman's research interests include the design and exploration of collaborative scientific environments based on Grid technologies, with particular emphasis on workflow management as well as the management of large amounts of data and metadata. At ISI, Dr. Deelman is leading the Pegasus project, which designs and implements workflow mapping techniques for large-scale workflows running in distributed environments. Pegasus is being used day-to-day by scientists in a variety of disciplines including astronomy, gravitational-wave physics, earthquake science and many others. Prior to joining ISI in 2000, she was a Senior Software Developer at UCLA conducting research in the area of performance prediction of large-scale applications on high performance machines. Dr. Deelman received her PhD from Rensselaer Polytechnic Institute in Computer Science in 1997 in the area of parallel discrete event simulation. Dr. Deelman is an Associate Editor responsible for Grid Computing for the Scientific Programming Journal and a chair of the GGF Workflow Management Research Group. Dennis Gannon, Department of Computer Science, Lindley Hall, Indiana University, Bloomington, IN 47401 (firstname.lastname@example.org) Dr. Gannon is a professor of Computer Science in the School of Informatics at Indiana University. He is also Science Director for the Indiana Pervasive Technology Labs. He received his Ph.D. in Computer Science from the University of Illinois in 1980 and his Ph.D. in Mathematics from the University of California in 1974. From 1980 to 1985, he was on the faculty at Purdue University. From 1997-2004 he was Chairman of the Indiana Computer Science Department. His research interests include software tools for high performance parallel and distributed systems and problem solving environments for scientific computation.
Table of Contents
From the contents
Introduction.- Scientific Versus Business Workflow.- Applications and User Perspectives.- Generating Complex Astronomy Workflows.- Workflows in Pulsar Astronomy.- Workflow and Biodiversity e-Science.- Ecological Niche Modeling Using the Kepler Workflow System.- Case Studies on the Use of Workflow Technologies for Scientific Analysis.- Dynamic Adaptive Workflows for Mesoscale Meteorology.- SCEC CyberShake Workflows.- Workflow Representations and Common Structure.- Control versus Data Driven Workflow.- Components Architectures and Services.- Petri Nets.- Adapting BPEL to Scientific Workflows.- Workflow Composition.- Virtual Data Language.- Frameworks and Tools.- Workflow Level Parametric Study Support by MOTEUR and the P_GRADE portal.- Taverna / myGrid.- The Triana Workflow Environment.- Java CoG Kit Workflow.- Workflow Management in Condor.- Pegasus.- ICENI.- Expressing Workflow in the Cactus Framework.- Sedna.- ASKALON.- Future Requirements.- Looking into the Future of Workflows.