© 2007 Open Grid Forum Cloud Computing BOF OGF22 Birds of a Feather Session Hyatt Regency Cambridge February 27 2008 Geoffrey Fox Indiana University

Slides:



Advertisements
Similar presentations
automated single login access to Novell storage resources
Advertisements

“I think there is a world market for maybe five computers”
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
1 Mixing Public and private clouds a Practical Perspective Maarten Koopmans Nordunet Conference 2009 Maarten Koopmans Nordunet Conference 2009.
Grid Computing at The Hartford OGF22 February 27, 2008 Robert Nordlund
© 2007 Open Grid Forum Grids in the IT Data Center OGF 21 - Seattle Nick Werstiuk October 16, 2007.
© 2006 Open Grid Forum OGF-22 Opening Keynote Craig A. Lee, President, OGF February 25, 2008 OGF-22, February 25-29, 2008 Hyatt Regency Cambridge.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
Building and using REST information services Rion Dooley.
Hello i am so and so, title/role and a little background on myself (i.e. former microsoft employee or anything interesting) set context for what going.
Three Perspectives & Two Problems Shivnath Babu Duke University.
Ljubomir Ivaniš CPU d.o.o.
1 Challenges and New Trends in Data Intensive Science Panel at Data-aware Distributed Computing (DADC) Workshop HPDC Boston June Geoffrey Fox Community.
Cloud Computing From Different Perspective. but first, What is cloud? Why is it called cloud?
International Conference on Cloud and Green Computing (CGC2011, SCA2011, DASC2011, PICom2011, EmbeddedCom2011) University.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
 Amazon Web Services announced the launch of Cluster Compute Instances for Amazon EC2.  Which aims to provide high-bandwidth, low- latency instances.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago
1 Multicore and Cloud Futures CCGSC September Geoffrey Fox Community Grids Laboratory, School of informatics Indiana University
3DAPAS/ECMLS panel Dynamic Distributed Data Intensive Analysis Environments for Life Sciences: June San Jose Geoffrey Fox, Shantenu Jha, Dan Katz,
© 2007 Open Grid Forum Building Blocks for the Grid Chris Smith, VP of Standards, OGF Presented with small modifications by Geoffrey Fox eScience2007 Bangalore.
1 Challenges Facing Modeling and Simulation in HPC Environments Panel remarks ECMS Multiconference HPCS 2008 Nicosia Cyprus June Geoffrey Fox Community.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
CHAPTER FIVE Enterprise Architectures. Enterprise Architecture (Introduction) An enterprise-wide plan for managing and implementing corporate data assets.
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
Office SharePoint Server 2007 Mark Dunkel US Education TSP - SharePoint Microsoft Corporation.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
© 2006 Open Grid Forum Geoffrey Fox GFSG Meeting CWI Amsterdam December OGF eScience Function.
Introduction To Windows Azure Cloud
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
DISTRIBUTED COMPUTING
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
© 2006 Open Grid Forum Geoffrey Fox September OGF eScience Function.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
Virtual techdays INDIA │ august 2010 ENTERPRISE CONTENT MANAGEMENT WITH SHAREPOINT 2010 Naresh K Satapathy │ Solution Specialist, Microsoft Corporation.
SALSASALSASALSASALSA Cloud Panel Session CloudCom 2009 Beijing Jiaotong University Beijing December Geoffrey Fox
| Copyright© 2011 Microsoft Corporation 1 journey to the cloud KOEN VAN TOLHUYZEN TSP OFFICE 365 MICROSOFT CORPORATION.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Minority-Serving Institutions (MSI) Cyberinfrastructure (CI) Institute [MSI-CI 2 ] and CI Empowerment Coalition MSI-CIEC October Geoffrey Fox
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
Cloud Computing Paradigms for Pleasingly Parallel Biomedical Applications Thilina Gunarathne, Tak-Lon Wu Judy Qiu, Geoffrey Fox School of Informatics,
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cloud computing Cloud Computing1. NIST: Five essential characteristics On-demand self-service Computing capabilities, disks are demanded over the network.
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
1 Cloud Systems Panel at HPDC Boston June Geoffrey Fox Community Grids Laboratory, School of informatics Indiana University
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
© 2006 Open Grid Forum Geoffrey Fox OGF Workshop eScience 2006 Royal Tropical Institute Amsterdam December OGF eScience Function.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
© 2006 Open Grid Forum Geoffrey Fox Board Review April OGF eScience Function.
1 Grid Systems: What is needed from Web Service standards? ICSOC Panel November Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology.
Big Data is a Big Deal!.
Status and Challenges: January 2017
University of Technology
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
Clouds from FutureGrid’s Perspective
Emerging technologies-
Department of Intelligent Systems Engineering
Cloud Computing BOF OGF22 Birds of a Feather Session
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
Cloud versus Cloud: How Will Cloud Computing Shape Our World?
Presentation transcript:

© 2007 Open Grid Forum Cloud Computing BOF OGF22 Birds of a Feather Session Hyatt Regency Cambridge February Geoffrey Fox Indiana University

© 2007 Open Grid Forum 2 Cloud Agenda Geoffrey Fox (Indiana U.) Remarks on Cloud Computing Martin Swany (Internet2) Clouds and Dynamic Networking Steven Newhouse (Microsoft) Personal View on Clouds Kate Keahey (Argonne, Chicago) First Steps in the Clouds Next Steps

© 2007 Open Grid Forum 3 What are Clouds? Clouds are Virtual Clusters (Virtual Grids) of possibly Virtual Machines They may cross administrative domains or may just be a single cluster; the user cannot and does not want to know Clouds support access (lease of) computer instances Instances accept data and job descriptions (code) and return results that are data and status flags Each Cloud is a Narrow (perhaps internally proprietary) Grid When does Cloud concept work Parameter searches, LHC style data analysis.. Common case (most likely success case for clouds) versus corner case? Clouds can be built from Grids Grids can be built from Clouds

© 2007 Open Grid Forum 4 Cloud References Includes references to Amazon, Apple, Dell, Enomalism, Globus, Google, IBM, KnowledgeTreeLive, Nature, New York Times, Zimdesk Others like Microsoft Windows Live Skydrive important sk=view&id=2589&Itemid=1 Policy Issues sk=view&id=2589&Itemid=1 Hadoop (MapReduce) and Data Intensive Computing See Data intensive computing minitrack at HICSS-42 January OGF Thought Leadership blog OGF22 talks by Charlie Catlett and Irving Wladawsky-Berger

© 2007 Open Grid Forum 5 Big-Data Computing Study Group CCC Role Versus OGF? Hadoop and MapReduce are just workflow?

© 2007 Open Grid Forum 6 Google MapReduce Simplified Data Processing on Clusters/Clouds This is a dataflow model between services where services can do useful document oriented data parallel applications including reductions The decomposition of services onto cluster engines (clouds) is automated The large I/O requirements of datasets changes efficiency analysis in favor of dataflow Services (count words in example) can obviously be extended to general parallel applications There are many alternatives to language expressing either dataflow and/or parallel operations and/or workflow

© 2007 Open Grid Forum 7 Technical Questions about Clouds I What is performance overhead? On individual CPU On system including data and program transfer What is cost gain From size efficiency; green location (rumor that Google has purchased the Niagara Falls including Canada!) Is Cloud Security adequate: can clouds be trusted? Can one can do parallel computing on clouds? Looking at capacity not capability i.e. lots of modest sized jobs Marine corps will use Petaflop machines – they just need ssh and a.out

© 2007 Open Grid Forum 8 Technical Questions about Clouds II How is data compute affinity tackled in clouds? Co-locate data and compute clouds? Lots of optical fiber i.e. just move the data? What happens in clouds when demand for resources exceeds capacity – is there a multi-day job input queue? Are there novel cloud scheduling issues? Do we want to link clouds (or ensembles as atomic clouds); if so how and with what protocols Is there an intranet cloud e.g. cloud in a box software to manage personal (cores on my future 128 core laptop) department or enterprise cloud?

© 2007 Open Grid Forum 9 Standards for Compute and Storage Clouds We no longer need interoperability of services and messages (SOAP) but rather interoperability of clouds Maybe each cloud so big that interoperability between clouds not so critical Interoperability certainly for application specific data and perhaps also for job specifications WFS, GML for Geo-data; IVOA standards; DST LHC experiment formats JSDL, BES etc. Each Cloud will be proprietary but they might want raw infrastructure standards so they can easily swap in and out different vendors disk drives Clouds very very loosely coupled; services loosely coupled

© 2007 Open Grid Forum 10 MSI Challenge Problem There are > 330 MSIs – Minority Serving Institutions 2 examples ECSU is a small state university in North Carolina HBCU with 4000 students Working on PolarGrid (Sensors in Arctic/Antarctic linked to TeraGrid) Navajo Tech in Crown Point NM is community college with technology leadership for Navajo Nation Internet to the Hogan and Dine Grid links Navajo communities by wireless Wish to integrate TeraGrid science into Navajo Nation education curriculum Current Grid technology too complicated if you are not an R1 institution Hard to deploy campus grids broadly into MSIs Clouds provide virtual campus resources?

© 2007 Open Grid Forum 11 Next Steps at OGF Clouds are just starting and build on/are related to Grids Clear need for best practice in use and technology Likely to be need for new standards and novel use of existing/projected standards New Cloud Community Group? Chairs, participants? Workshop? OGF23 activity? Identify key players not currently involved with OGF?