GCE06, Tampa, FL November 12-13, 2006 Science Gateways on the TeraGrid Charlie Catlett, Sebastien Goasguen, Jim Marsteller, Stuart Martin, Don Middleton,

Slides:



Advertisements
Similar presentations
GridShib Tom Barton, U Chicago. 2 Grid Computing Distributed computing and/or data resources Heterogeneous computing & storage environments Interfaces.
Advertisements

Scaling TeraGrid Access A Testbed for Attribute-based Authorization and Leveraging Campus Identity Management
1 US activities and strategy :NSF Ron Perrott. 2 TeraGrid An instrument that delivers high-end IT resources/services –a computational facility – over.
Earth System Curator Spanning the Gap Between Models and Datasets.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
TeraGrid Science Gateway AAAA Model: Implementation and Lessons Learned Jim Basney NCSA University of Illinois Von Welch Independent.
Network, Operations and Security Area Tony Rimovsky NOS Area Director
NOS Objectives, YR 4&5 Tony Rimovsky. 4.2 Expanding Secure TeraGrid Access A TeraGrid identity management infrastructure that interoperates with campus.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
CCSM Portal/ESG/ESGC Integration (a PY5 GIG project) Lan Zhao, Carol X. Song Rosen Center for Advanced Computing Purdue University With contributions by:
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
SAN DIEGO SUPERCOMPUTER CENTER Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director for Science Gateways SDSC Director of Consulting,
April 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr Area Director for Science Gateways San Diego Supercomputer Center
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
Science Gateways on the TeraGrid Nancy Wilkins-Diehr Area Director for Science Gateways San Diego Supercomputer Center
TeraGrid Science Gateways: Scaling TeraGrid Access Aaron Shelmire¹, Jim Basney², Jim Marsteller¹, Von Welch²,
Advancing Scientific Discovery through TeraGrid Scott Lathrop TeraGrid Director of Education, Outreach and Training University of Chicago and Argonne National.
August 2007 Advancing Scientific Discovery through TeraGrid Scott Lathrop TeraGrid Director of Education, Outreach and Training University of Chicago and.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
TeraGrid VO Support and Plans for AAA Testbed Dane Skow, Deputy Director TeraGrid University of Chicago / Argonne National Laboratory Internet2 Member.
August 2007 Advancing Scientific Discovery through TeraGrid Adapted from S. Lathrop’s talk in SC’07
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
TeraGrid Overview Cyberinfrastructure Days Internet2 10/9/07 Mark Sheddon Resource Provider Principal Investigator San Diego Supercomputer Center
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
GridShib: Grid/Shibboleth Interoperability September 14, 2006 Washington, DC Tom Barton, Tim Freeman, Kate Keahey, Raj Kettimuthu, Tom Scavo, Frank Siebenlist,
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
10/24/2015OSG at CANS1 Open Science Grid Ruth Pordes Fermilab
Federated Environments and Incident Response: The Worst of Both Worlds? A TeraGrid Perspective Jim Basney Senior Research Scientist National Center for.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
Tutorial: Building Science Gateways TeraGrid 08 Tom Scavo, Jim Basney, Terry Fleury, Von Welch National Center for Supercomputing.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Leveraging the InCommon Federation to access the NSF TeraGrid Jim Basney Senior Research Scientist National Center for Supercomputing Applications University.
SC06, Tampa FL November 11-17, 2006 Science Gateways on the TeraGrid Powerful Beyond Imagination! Nancy Wilkins-Diehr TeraGrid Area Director for Science.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Riding the Crest: High-End Cyberinfrastructure Experiences and Opportunities on the NSF TeraGrid A Panel Presentation by Laura M c GinnisRadha Nandkumar.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
1 Overall Architectural Design of the Earth System Grid.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Network, Operations and Security Area Tony Rimovsky NOS Area Director
TeraGrid Overview John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory March 25,
Attribute-based Authentication for Gateways Jim Basney Terry Fleury Stuart Martin JP Navarro Tom Scavo Nancy Wilkins-Diehr.
AT LOUISIANA STATE UNIVERSITY CCT: Center for Computation & Technology Introduction to the TeraGrid Daniel S. Katz Lead, LONI as a TeraGrid.
October 2007 TeraGrid : Advancing Scientific Discovery and Learning Diane A. Baxter, Ph.D. Education Director San Diego Supercomputer Center University.
PEER 2003 Meeting 03/08/031 Interdisciplinary Framework Major focus areas Structural Representation Fault Systems Earthquake Source Physics Ground Motions.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
SAN DIEGO SUPERCOMPUTER CENTER Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director for Science Gateways SDSC Director of Consulting,
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
Gateways security Aashish Sharma Security Engineer National Center for Supercomputing Applications (NCSA) University of Illinois at Urbana-Champaign.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
Accessing the VI-SEEM infrastructure
Clouds , Grids and Clusters
Data Management Components for a Research Data Archive
Presentation transcript:

GCE06, Tampa, FL November 12-13, 2006 Science Gateways on the TeraGrid Charlie Catlett, Sebastien Goasguen, Jim Marsteller, Stuart Martin, Don Middleton, Kevin J. Price, Anurag Shankar, Von Welch, Nancy Wilkins-Diehr

GCE06, Tampa, FL November 12-13, 2006 Today’s Topics TeraGrid Background – 5 min. Gateway integration issues – 15 min. –Accounting GRAM –Security Commsh Attribute-based authentication –Metrics Future work – 10 min. –Gateway primer –Best practices

GCE06, Tampa, FL November 12-13, 2006 What is the TeraGrid? NSF-funded facility to offer high end compute, data and visualization resources to the nation’s academic researchers

GCE06, Tampa, FL November 12-13, 2006 TeraGrid Technology Data 18.8 Petabytes Storage Memory Intensive Resources Computation Visualization 100+ Teraflops Computation 40gigabit/second cross-country network

GCE06, Tampa, FL November 12-13, 2006 Over 100 Tflops in Computing Power

GCE06, Tampa, FL November 12-13, 2006 TeraGrid Resources Available to Academic Researchers at No Cost TeraGrid creates integrated, persistent, and pioneering computational resources that significantly improve our nation’s ability and capacity to gain new insights into our most challenging research questions and societal problems. Proposal-based access, researchers can use resources at no cost –Collaborative opportunities, but Principal Investigators must be from the U.S.

GCE06, Tampa, FL November 12-13, 2006 Gateways are part of TeraGrid’s 3-pronged strategy to further science DEEP Science: Enabling Terascale Science –Make science more productive through an integrated set of very- high capability resources Advanced Support for TeraGrid Applications (ASTA) projects WIDE Impact: Empowering Communities –Bring TeraGrid capabilities to the broad science community Science Gateways OPEN Infrastructure, OPEN Partnership –Provide a coordinated, general purpose, reliable set of services and resources Grid interoperability working group

GCE06, Tampa, FL November 12-13, 2006 Science Gateways A new initiative for the TeraGrid Increasing investment by communities in their own cyberinfrastructure, but heterogeneous: Resources Users – from expert to K-12 Software stacks, policies Science Gateways –Provide “TeraGrid Inside” capabilities –Leverage community investment Three common forms: –Web-based Portals –Application programs running on users' machines but accessing services in TeraGrid –Coordinated access points enabling users to move seamlessly between TeraGrid and other grids. Workflow Composer

GCE06, Tampa, FL November 12-13, 2006 Gateways are growing in numbers 10 initial projects as part of TG proposal >20 Gateway projects today No limit on how many gateways can use TG resources –Prepare services and documentation so developers can work independently Open Science Grid (OSG) Special PRiority and Urgent Computing Environment (SPRUCE) National Virtual Observatory (NVO) Linked Environments for Atmospheric Discovery (LEAD) Computational Chemistry Grid (GridChem) Computational Science and Engineering Online (CSE- Online) GEON(GEOsciences Network) Network for Earthquake Engineering Simulation (NEES) SCEC Earthworks Project Network for Computational Nanotechnology and nanoHUB GIScience Gateway (GISolve) Biology and Biomedicine Science Gateway Open Life Sciences Gateway The Telescience Project Grid Analysis Environment (GAE) Neutron Science Instrument Gateway TeraGrid Visualization Gateway, ANL BIRN Gridblast Bioinformatics Gateway Earth Systems Grid Astrophysical Data Repository (Cornell) Many others interested –SID Grid –HASTAC

GCE06, Tampa, FL November 12-13, 2006 TeraGrid Background – 5 min. Gateway integration issues – 15 min. –Accounting GRAM –Security Commsh Attribute-based authentication –Metrics Future work – 10 min. –Gateway primer –Best practices

GCE06, Tampa, FL November 12-13, 2006 What Did We Learn About Common Gateway Requirements? Accounting –Support for accounts with differing capabilities –Ability to associate compute job to a individual portal user –Scheme for portal registration and usage tracking –Dynamic accounts Security –Community account privileges –Need to identify human responsible for a job for incident response –Acceptance of other grid certificates Web Services –Many will build on the Globus Toolkit, but additional interfaces may be needed –Web Service security –Interfaces to scheduling and account management are common requirements Software –Interoperability of software stacks between TeraGrid and peer grids –Software installations for gateways across all TG sites –Community software areas –Management (pacman, other options)

GCE06, Tampa, FL November 12-13, 2006 In Today’s Talk Per job accounting Secured community accounts Federated identity management Metrics for Success Futures –Primer –Gateway Best Practices

GCE06, Tampa, FL November 12-13, 2006 “Per Job” Accounting is Key Functionality for Gateways Common gateway structure – Web front end, users log on to gateway –Jobs run as single user on TeraGrid –Need to tie usage to individual users Globus used by many gateway developers to access TeraGrid resources GRAM operates in a fire and forget mode –When a job finishes there is no straightforward way to determine how many CPU hours the job consumed –That information is critical to attributing usage to individual users using a Science Gateway account on the TeraGrid

GCE06, Tampa, FL November 12-13, 2006 GRAM Audit Extension Provides Need Accountability GRAM2 and GRAM4 services were enhanced to create audit records that are written to a database local to the GRAM services Enhancements provide a persistent link between the grid service’s job id and the local resource manager’s (LRM) job id Open Grid Services Architecture-Data Access and Integration (OGSA-DAI) provide a service interface for TeraGrid’s audit and accounting information

GCE06, Tampa, FL November 12-13, 2006 Individual Usage Tracking Now Possible Gateways can remotely submit jobs to TeraGrid and –Account for usage on a per job basis without needing to understand the details of the various local resource managers chosen by TeraGrid resource providers. Capability will be very useful for other projects using Globus where per-job usage information is needed. Enhancements reduce the complexity for gateways to interface with TeraGrid’s computational resources –Allows TeraGrid to simultaneously support an increasing number of gateways.

GCE06, Tampa, FL November 12-13, 2006 Linked Environments for Atmospheric Discovery Providing tools that are needed to make accurate predictions of tornados and hurricanes Meteorological data Forecast models Analysis and visualization tools Data exploration and Grid workflow

GCE06, Tampa, FL November 12-13, 2006 Securing Community Accounts Additional risks arise when providing community account and web interfaces to high performance resources. TeraGrid security working group analyzing risks developing mitigation approaches. –Sites may take independent approaches to risk mitigation One approach being developed at NCSA is the Community Shell, or Commsh Commsh allows for two methods of account restriction: –a configuration file is created that defines which commands (or sets of commands) a given account can execute. –commands can be specified using wildcards and regular expressions for flexibility –change-root (or chroot) jailing. Change-root jailing effectively creates a filesystem-based "sandbox" for the account, only allowing commands to be executed from within this sandbox

GCE06, Tampa, FL November 12-13, 2006 NanoHub Harnesses TeraGrid for Education Nanotechnology education Used in dozens of courses at man universities Teaching materials Collaboration space Research seminars Modeling tools Access to cutting edge research software And much more

GCE06, Tampa, FL November 12-13, 2006 Federated Identity Management Traditionally each resource or resource-providing site was responsible for the management of their users identities The science gateway model brought an out-sourcing of identity management from the resource to the gateway For maximum scalability, the goal is to shift identity management all the way back to the user’s home institution and leverage the existing identity management infrastructure Mechanisms to achieve this based on Shibboleth, GridShib, myVocs are other technologies are currently being evaluated by TeraGrid

GCE06, Tampa, FL November 12-13, 2006 Individual User Environment Resource TGCDB uid project (G)Id Grant Process Use cases: Traditional users, Development O(10) O(1) O(10) O(1000) O(1)

GCE06, Tampa, FL November 12-13, 2006 Authenticated User Environment Resource TGCDB project (G)Id Grant Process Use cases: Grid-savvy user communities, Production runs, user managed services uid O(10) ? O(10) O(1) O(10) O(1)

GCE06, Tampa, FL November 12-13, 2006 Gateway Gateway Environment Resource TGCDB project Grant Process Use cases: Large communities of users, novice users, public uid GId ComId O(1) ? O(10) O(1) O(10) O(1) ? O(1)O(100) ? O(1000) O(100) O(1000) O(100) O( ) ? O(1)

GCE06, Tampa, FL November 12-13, 2006 Community Gateway Accounts Shift authentication and authorization from RP to the Science Gateway Whole community then appears as “one” user to the RP in terms of authorization –One grid-mapfile and /etc/password entry or perhaps (a mapped set of) virtual machine images –Except accounting and troubleshooting. We still need an individual identifier

GCE06, Tampa, FL November 12-13, 2006 The Proposal Plan for a world where users can be authenticated via their home campus identity management system Enable attribute-based authorization of users by RP site –Allow for user authentication with authorization by community Prototype system in testbed, with involvement of interested parties to work out issues All usage still billed to an allocation –Community or individual

GCE06, Tampa, FL November 12-13, 2006 Metrics Metrics of success are commonly requested for government funded programs Successful gateway design will allow principal investigators to highlight gateway usage as well as science accomplishments due to the gateway –Some gateways may set up a mechanism for researchers to cite the use of the gateway in publications Success both in funding the gateway and in requesting TeraGrid resources can be traced to scientific accomplishments and a history of publications

GCE06, Tampa, FL November 12-13, 2006 Sample Metrics Collected by ESG The DOE-sponsored Earth System Grid (ESG) project includes a Metrics Service that tracks –logins –file and aggregation downloads –browse and search requests –total volume of activity conducted via its portal. This information is very useful to principal investigators and sponsors in terms of determining the overall impact of the project As ESG begins to utilize TeraGrid resources, it will need to track computational and data services that are delivered to it as a Science Gateway

GCE06, Tampa, FL November 12-13, 2006 NCAR Earth System Grid Science Gateway for climate research –Enabling analysis and understanding gained from global Earth System computational models ESG originally a distributed data management/access system but it has evolved into more. User registration, authorization controls, and metrics tracking CCSM model source, initialization datasets, post-processing codes, and analysis and visualization tools. Prototypes of model- submission environments –Eventually real-time tracking of model status along with references to available output datasets. Expect to see more model runs at higher- resolution and with greater component scope.

GCE06, Tampa, FL November 12-13, 2006

GCE06, Tampa, FL November 12-13, 2006 TeraGrid Background – 5 min. Gateway integration issues – 15 min. –Accounting GRAM –Security Commsh Attribute-based authentication –Metrics Future work – 10 min. –Gateway primer –Best practices

GCE06, Tampa, FL November 12-13, 2006 Science Gateway Primer Primer components –TeraGrid resources and services available to Science Gateways –Requirements for using TeraGrid resources –Best practices when designing a gateway –Software contribution area Wiki-based –Very dynamic development community –Counting on them for contributions –

GCE06, Tampa, FL November 12-13, 2006 TeraGrid Resources and Services Compute, data and visualization resources Software –Common TeraGrid Software Stack (CTSS) –Third party applications on a variety of platforms –Community Software Area for user-maintained software –Software packaging and distribution mechanisms Accounting services –developer accounts –community accounts –in the future, dynamic accounts External relations staff available to help publicize successes

GCE06, Tampa, FL November 12-13, 2006 Gateway Requirements Additional information to be provided when requesting community accounts –IP address of portal –Data and compute expectations Recommended audit trails for usage tracking Mechanisms to restrict problem jobs

GCE06, Tampa, FL November 12-13, 2006 Lots of Best Practices! Thanks Anurag Planning Assess If Gateway-ing Adds Any Value Create a Precise List of Requirements the GW Must Meet Plan for the Long Term (= GW Lifetime) Design Use Formal Design Principles Involve Users in the Design Use Mockups to Perform Usability Testing Design a Focused and Uncluttered UI Implementation Choose Technologies Based on Resources/Time Hire/Use Developers with UI Experience Develop in Stages Use Reusable Components Operation Monitor Gateway Components 24x7 Institute Help Desk ProcessesMonitor & Implement New Technologies Keep Content Current/Relevant Desirable Gateway Characteristics Universal, Secure Access Ability to Personalize Based on Open Standards (JSR 168/286, OGSA, etc.) Use of Modular, Reusable Design (Use Portlets) Use of Technologies With a Rich API/Abstraction Layer Platform Independence (Web, Java, XML, etc.) Rapid Development Capability Ease of Integration into Existing Infrastructure Availability of Workflows Use of Commodity Software Airtight Security Extensibility Maintainability Scalability Extensive Help and documentation

GCE06, Tampa, FL November 12-13, 2006 Would development of a gateway help your research? Think about your current bottlenecks –What would you like to explore if only you had Lots of disk Lots of compute resources Powerful analysis capabilities A nice interface to information

GCE06, Tampa, FL November 12-13, 2006 Gateways in Tampa Gateway BOF 11/14, 5:30pm rooms Gateway talks at many TG booths – Nancy Wilkins-Diehr,