----------------------------------------------------------------------------------- *** Call for Participation *** Workshop on Data-Intensive Distributed Computing In conjunction with HPDC 2011, June 8, San Jose, CA http://www.cse.buffalo.edu/faculty/tkosar/didc2011 ------------------------------------------------------------------------------------ The Fourth International Workshop on Data Intensive Distributed Computing (DIDC 2011) will be held in conjunction with the 20th International Symposium on High Performance Distributed Computing (HPDC 2011), in San Jose, California. The data needs of scientific as well as commercial applications from a diverse range of fields have been increasing exponentially over the recent years. This increase in the demand for large-scale data processing has necessitated collaboration and sharing of data collections among the world's leading education, research, and industrial institutions and use of distributed resources owned by collaborating parties. In a widely distributed environment, data is often not locally accessible and has thus to be remotely retrieved and stored. While traditional distributed systems work well for computation that requires limited data handling, they may fail in unexpected ways when the computation accesses, creates, and moves large amounts of data especially over wide-area networks. Further, data accessed and created is often poorly described, lacking both metadata and provenance. Scientists, researchers, and application developers are often forced to solve basic data-handling issues, such as physically locating data, how to access it, and/or how to move it to visualization and/or compute resources for further analysis. This workshop will focus on the challenges imposed by data-intensive applications on distributed systems, and on the different state-of-the-art solutions proposed to overcome these challenges. It will bring together the collaborative and distributed computing community and the data management community in an effort to generate productive conversations on the planning, management, and scheduling of data handling tasks and data storage resources. Workshop Program: SESSION - I 1:30 pm : Opening Remarks Tevfik Kosar, University at Buffalo, USA 1:40 pm : Keynote Address: "Wide-Area Clouds" Jon Weissman, University of Minnesota, USA 2:30 pm : Wide Area Placement of Data Replicas for Fast & Highly Available Data Access Fan Ping, University at Albany, USA Xiaohu Li, University at Albany, USA Christopher McConnell, University at Albany, USA Rohini Vabbalareddy, University at Albany, USA Jeong-Hyon Hwang, University at Albany, USA 3:00 pm : Preference Driven Server Selection in Peer-2-Peer Data Sharing Systems Abdelhamid Elwaer, Cardiff University, UK Ian Taylor, Cardiff University, UK Omer Rana, Cardiff University, UK 3:30 pm : Break SESSION - II 4:00 pm : Early Experience with the Distributed Nebula Cloud Pradeep Sundarrajan, University of Minnesota, USA Abhishek Gupta, University of Minnesota, USA Matthew Ryden, University of Minnesota, USA Rohit Nair, University of Minnesota, USA Abhishek Chandra, University of Minnesota, USA Jon Weissman, University of Minnesota, USA 4:30 pm : Towards Transparent Integration of Heterogeneous Cloud Storage Platforms Ilja Livenson, KTH Royal Institute of Technology, Sweden Erwin Laure, KTH Royal Institute of Technology, Sweden 5:00 pm : Clause-Iteration with Map-Reduce to Scalably Query Data Graphs in the SHARD Graph-store Kurt Rohloff, BBN Technologies, USA Richard E. Schantz, BBN Technologies, USA 5:30 pm : Integrated Data Placement and Task Assignment for Scientific Workflows in Clouds Umit V. Catalyurek, Ohio State University, USA Kamer Kaya, CERFACS, France Bora Ucar, CNRS & ENS, France 6:00 pm : Closing Remarks Tevfik Kosar, University at Buffalo, USA |