A Swift Talk about Globus Technology: What Can It Do for Me? OOI Cyberinfrastructure Design Meeting, San Diego, 17-19 October The Globus Team (presented.

Slides:



Advertisements
Similar presentations
The future of Globus (Grid meets Cloud) Ian Foster Computation Institute University of Chicago & Argonne National Laboratory.
Advertisements

Grids and the Globus Community
GT 4 Security Goals & Plans Sam Meder
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Services for Science.
The Globus Toolkit and OMII-Europe Neil Chue Hong EPCC, University of Edinburgh Thanks to Ian Foster and the Globus Team for slides.
Service Oriented Grid Architecture Hui Li ICT in Business Colloquium, LIACS Mar 1 st, 2006 Note: Part of this presentation is based on Dr. Ian Foster’s.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Dorian Grid Identity Management and Federation Dialogue Workshop II Edinburgh, Scotland February 9-10, 2006 Stephen Langella Department.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
The Globus Toolkit Gary Jackson. Introduction The Globus Toolkit is a product of the Globus Alliance ( It is middleware for developing.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Sergey Belov, LIT JINR 15 September, NEC’2011, Varna, Bulgaria.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Dev.globus a framework for open Globus development Lisa Childers Argonne National Laboratory University of Chicago.
GT4 Introductory and Advanced Practicals Rachana Ananthakrishnan, Charles Bacon, Lisa Childers Argonne National Laboratory University of Chicago.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
What Happens When Cloud Computing Meets HPC Dr. Dan Fraser Director, CDIGS (Community Driven Improvement of Globus Software)
Using Globus to Scale an Application Case Study 4: Scientific Workflow for Computational Economics Tiberiu Stef-Praun, Gabriel Madeira, Ian Foster, Robert.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Globus Data Replication Services Ann Chervenak, Robert Schuler USC Information Sciences Institute.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
USC Viterbi School of Engineering Web and Grid Services Slides taken from a variety of sources: GT4 tutorial, by Borja Sotomayor
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
WSRF & WSRF’s Application in VO-DAS Haijun Tian ChinaVO
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
23:48:11Service Oriented Cyberinfrastructure Lab, Grid Portals Fugang Wang April 29
Middleware Support for Virtual Organizations Internet 2 Fall 2006 Member Meeting Chicago, Illinois Stephen Langella Department of.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Grid Services Overview & Introduction Ian Foster Argonne National Laboratory University of Chicago Univa Corporation OOSTech, Baltimore, October 26, 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Introduce Grid Service Authoring Toolkit Shannon Hastings, Scott Oster, Stephen Langella, David Ervin Ohio State University Software Research Institute.
1 Globus Toolkit Security Rachana Ananthakrishnan Frank Siebenlist Argonne National Laboratory.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
Wide Area Data Replication for Scientific Collaborations Ann Chervenak, Robert Schuler, Carl Kesselman USC Information Sciences Institute Scott Koranda.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Scaling eScience Impact.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
VO Privilege Activity. The VO Privilege Project develops and implements fine-grained authorization to grid- enabled resources and services Started Spring.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Globus Security: Features and Roadmap & Building Secure VOs using Globus Toolkit Frank Siebenlist Rachana Ananthakrishnan Computation Institute, University.
CEDPS Services Area Update CEDPS Face-to-Face Meeting ANL October 2007.
CaGrid 1.0 Security Infrastructure Stephen Langella, Scott Oster, Shannon Hastings, David Ervin, Joshua Phillips, Vinay Kumar, Tahsin Kurc, Joel Saltz.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Application Hosting Services — Enabling Science 2.0 —
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
Large Scale Distributed Computing
Presentation transcript:

A Swift Talk about Globus Technology: What Can It Do for Me? OOI Cyberinfrastructure Design Meeting, San Diego, October The Globus Team (presented by Kate Keahey)

10/18/07, ORION meetingVirtual Workspaces: Overview l Globus Toolkit overview l What is WSRF and how does it work? l Defining community: membership and laws l Workflow management with Swift l Globus as project: how we work

10/18/07, ORION meetingVirtual Workspaces: What Is the Globus Toolkit? l The Globus Toolkit is a collection of solutions to problems that frequently come up when trying to build collaborative distributed applications l Heterogeneity u To date (v1.0 - v4.0), the Toolkit has focused on simplifying heterogeneity for application developers u We are increasingly including more “vertical solutions” that implement typical application patterns l Security u The Grid Security Infrastructure (GSI) allows collaborators to share resources without blind trust l Standards u Our goal has been to capitalize on and encourage use of existing standards (IETF, W3C, OASIS, GGF) u The Toolkit also includes reference implementations of new/proposed standards in these organizations

10/18/07, ORION meetingVirtual Workspaces: What’s In the Globus Toolkit? l A Grid development environment u Develop new OGSA-compliant Web Services u Develop applications using Java or C/C++ Grid APIs u Secure applications using basic security mechanisms l A set of basic Grid services u Job submission/management u File transfer (individual, queued) u Database access u Data management (replication, metadata) u Monitoring/Indexing system information l Tools and Examples l The prerequisites for many Grid community tools

10/18/07, ORION meetingVirtual Workspaces: Globus Technology Areas l Core runtime u Infrastructure for building new services u WSRF = Web Service Resource Framework l Security u Apply uniform policy across distinct systems l Execution management u Provision, deploy, & manage services l Data management u Discover, transfer, & access large data l Monitoring u Discover & monitor dynamic services

10/18/07, ORION meetingVirtual Workspaces: Incubator Projects Security Execution Mgmt Info Services Common Runtime Globus Projects Other MPICH G2 GridWay Data Mgmt Incubation Mgmt Cog WF LRMA GAARDS OGROGDTEUGP HOC-SAPURSE GridShib Introduce Dyn Acct WEEP Gavia JSC Gavia MS DDM Virt WkSp SGGC Metrics ServMark GridFTP Reliable File Transfer OGSA-DAI GRAM MDS4 CAS Data Rep Delegation Replica Location Java Runtime C Runtime Python Runtime C SecGT4 Docs MEDICUS GSI- OpenSSH MyProxy SwiftMonMan NetLogger GEMLCA Globus Toolkit

Web Services Resource Framework (WSRF)

10/18/07, ORION meetingVirtual Workspaces: “Stateless” vs. “Stateful” Services l Without state, how does client: u Determine what happened (success/failure)? u Find out how many files completed? u Receive updates when interesting events arise? u Terminate a request? l Few useful services are truly “stateless”, but WS interfaces alone do not provide built-in support for state Client FileTransfer Service move (A to B) move

10/18/07, ORION meetingVirtual Workspaces: FileTransferService (without WSRF) l Developer reinvents wheel for each new service u Custom management and identification of state: transferID u Custom operations to inspect state synchronously (whatHappen) and asynchronously (tellMeWhen) u Custom lifetime operation (cancel) Client FileTransfer Service move (A to B) : transferID move state whatHappen tellMeWhen cancel

10/18/07, ORION meetingVirtual Workspaces: WSRF in a Nutshell l Service l State representation u Resource u Resource Property l State identification u Endpoint Reference l State Interfaces u GetRP, QueryRPs, GetMultipleRPs, SetRP l Lifetime Interfaces u SetTerminationTime u ImmediateDestruction l Notification Interfaces u Subscribe u Notify l ServiceGroups RPs Resource Service GetRP GetMultRPs SetRP QueryRPs Subscribe SetTermTime Destroy EPR

10/18/07, ORION meetingVirtual Workspaces: FileTransferService (w/ WSRF) l Developer specifies custom method to createResource and leaves the rest to WSRF standards: u State exposed as Resource + Resource Properties and identified by Endpoint Reference (EPR) u State inspected by standard interfaces (GetRP, QueryRPs) u Lifetime management by standard interfaces (Destroy) Client FileTransferService createResource (A to B) : EPR createResource RPs Transfer getRP queryRPs destroy

10/18/07, ORION meetingVirtual Workspaces: The Introduce Authoring Tool l Define service l Create skeleton l Discover types l Add operations l Configure security l Modify service Introduce: Hastings, Saltz, et al., Ohio State University Generates GT4- compatible Web Services

10/18/07, ORION meetingVirtual Workspaces: Defining Community: Membership and Laws l Identify VO participants and roles u For people and services l Specify and control actions of members u Empower members  delegation u Enforce restrictions  federate policy A 12 B 12 A B Access granted by community to user Site admission- control policies Effective Access Policy of site to community

Courtsey : DOE report : LBNL : Authorization & Attribute Certificates for Widely Distributed Access Control XACML SAML

10/18/07, ORION meetingVirtual Workspaces: Globus Authorization Framework VOMSShibbolethLDAP PERMIS … GT4 Client GT4 Server PDP Attributes Authorization Decision PIP

10/18/07, ORION meetingVirtual Workspaces: Swift System l Clean separation of logical/physical concerns u XDTM specification of logical data structures + Concise specification of parallel programs u SwiftScript, with iteration, etc. + Efficient execution on distributed resources u Karajan threading, Falkon provisioning, Globus interfaces, pipelining, load balancing + Rigorous provenance tracking and query u Virtual data schema & automated recording  Improved usability and productivity u Demonstrated in numerous applications

10/18/07, ORION meetingVirtual Workspaces: Workflow Language - SwiftScript l Goal: Natural feel to expressing distributed applications u Variables (basic, data structures) u Conditional operators (if, foreach, ) u Functions (atomic / compound) l Used to connect outputs to inputs l It does not specify invocation order, only dependencies l It can be seen as a metadata for expressing experiments

10/18/07, ORION meetingVirtual Workspaces: Execution Engine l Karajan engine (event-based execution) l Has a scheduler to map tasks to resources u Score-based planning u Recovers from failures (retries) l Falkon resource manager creates a “private cluster” u Uses Globus GRAM4 (PBS/Condor/Fork) to acquire resources from Grid systems

Virtual Node(s) SwiftScript Abstract computation Virtual Data Catalog SwiftScript Compiler SpecificationExecution Virtual Node(s) Provenance data Provenance data Provenance collector launcher file1 file2 file3 App F1 App F2 Scheduling Execution Engine (Karajan w/ Swift Runtime) Swift runtime callouts C CCC Status reporting Provisioning Resource Provisioners Falkon/VWS Amazon EC2 Dynamic Provisioning: Swift Architecture Yong Zhao, Mihael Hatigan, Ioan Raicu, Mike Wilde, Ben Clifford

10/18/07, ORION meetingVirtual Workspaces: Globus Philosophy l Globus was first established as an open source project in 1996 l The Globus Toolkit is open source to: u Allow for inspection l for consideration in standardization processes u Encourage adoption l in pursuit of ubiquity and interoperability u Encourage contributions l harness the expertise of the community l The Globus Toolkit is distributed under the (BSD-style) Apache License version 2

10/18/07, ORION meetingVirtual Workspaces: dev.globus l Governance model based on Apache Jakarta u Consensus based decision making l Globus software is organized as several dozen “Globus Projects” u Each project has its own “Committers” responsible for their products u Cross-project coordination through shared interactions and committers meetings l A “Globus Management Committee” u Overall guidance and conflict resolution

10/18/07, ORION meetingVirtual Workspaces: Guidelines (Apache Jakarta) Infrastructure (CVS, , bugzilla, Wiki) Projects Include …

10/18/07, ORION meetingVirtual Workspaces: Open Source != “Free time” l Globus development is well-funded u The open source model facilitates contributions u NSF and DOE sponsor Globus development at several institutions via multiple grants, totaling >$5M/yr u Non-U.S. science agencies also contribute to Globus development u Corporations also sponsor developers l NSF explicitly funds Globus improvements u CDIGS: Community-Driven Improvements to Globus Software

10/18/07, ORION meetingVirtual Workspaces: Parting Thoughts l There is a wealth of available Grid technology that can be leveraged for this project u Security mechanisms, workflow, and other services l The Globus Toolkit is funded by NSF to support scientific communities.