OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.

Slides:



Advertisements
Similar presentations
LCG WLCG Operations John Gordon, CCLRC GridPP18 Glasgow 21 March 2007.
Advertisements

Jan 2010 Current OSG Efforts and Status, Grid Deployment Board, Jan 12 th 2010 OSG has weekly Operations and Production Meetings including US ATLAS and.
SCD FIFE Workshop - GlideinWMS Overview GlideinWMS Overview FIFE Workshop (June 04, 2013) - Parag Mhashilkar Why GlideinWMS? GlideinWMS Architecture Summary.
OSG Area Coordinators Meeting Operations Rob Quick 2/22/2012.
MyOSG: A user-centric information resource for OSG infrastructure data sources Arvind Gopu, Soichi Hayashi, Rob Quick Open Science Grid Operations Center.
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
OSG Services at Tier2 Centers Rob Gardner University of Chicago WLCG Tier2 Workshop CERN June 12-14, 2006.
Integration and Sites Rob Gardner Area Coordinators Meeting 12/4/08.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
Publication and Protection of Site Sensitive Information in Grids Shreyas Cholia NERSC Division, Lawrence Berkeley Lab Open Source Grid.
Open Science Grid The OSG Accounting System: GRATIA by Philippe Canal (FNAL) & Matteo Melani (SLAC) Mumbai, India CHEP2006.
May 8, 20071/15 VO Services Project – Status Report Gabriele Garzoglio VO Services Project – Status Report Overview and Plans May 8, 2007 Computing Division,
G RID M IDDLEWARE AND S ECURITY Suchandra Thapa Computation Institute University of Chicago.
Apr 30, 20081/11 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting Apr 30, 2008 Gabriele Garzoglio.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
Use of Condor on the Open Science Grid Chris Green, OSG User Group / FNAL Condor Week, April
J OINING OSG Suchandra Thapa Computation Institute University of Chicago.
Overview of Monitoring and Information Systems in OSG MWGS08 - September 18, Chicago Marco Mambelli - University of Chicago
OSG Software and Operations Plans Rob Quick OSG Operations Coordinator Alain Roy OSG Software Coordinator.
INFSO-RI Enabling Grids for E-sciencE EGEE 1 st EU Review – 9 th to 11 th February 2005 CERN.
Enabling Grids for E-sciencE EGEE-II INFSO-RI OSG-doc-498 Maite Barroso: Grid Operations LHCC review, CERN,25 th September Operations EGEE.
Incident Response Plan for the Open Science Grid Grid Operations Experience Workshop – HEPiX 22 Oct 2004 Bob Cowles – Work.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Job and Data Accounting on the Open Science Grid Ruth Pordes, Fermilab with thanks to Brian Bockelman, Philippe Canal, Chris Green, Rob Quick.
OSG Tier 3 support Marco Mambelli - OSG Tier 3 Dan Fraser - OSG Tier 3 liaison Tanya Levshina - OSG.
July 25, 20071/21 OSG Information Services Gabriele Garzoglio, Rob Quick, Chris Green OSG Information Services, VO Monitoring Services and Resource Selection.
BNL Tier 1 Service Planning & Monitoring Bruce G. Gibbard GDB 5-6 August 2006.
Grid Operations Lessons Learned Rob Quick Open Science Grid Operations Center - Indiana University.
Meeting Minutes and TODOs TG has no distributed monitoring. During incident response, use a manual twiki page to distribute information TG monitors the.
Microsoft Management Seminar Series SMS 2003 Change Management.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
The OSG and Grid Operations Center Rob Quick Open Science Grid Operations Center - Indiana University ATLAS Tier 2-Tier 3 Meeting Bloomington, Indiana.
DTI Mission – 29 June LCG Security Ian Neilson LCG Security Officer Grid Deployment Group CERN.
RSV: OSG Grid Fabric Monitoring and Interoperation with WLCG Monitoring Systems Rob Quick, Arvind Gopu, and Soichi Hayashi Computing in High Energy and.
INFSO-RI Enabling Grids for E-sciencE An overview of EGEE operations & support procedures Jules Wolfrat SARA.
Operations Activity Doug Olson, LBNL Co-chair OSG Operations OSG Council Meeting 3 May 2005, Madison, WI.
Auditing Project Architecture VERY HIGH LEVEL Tanya Levshina.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Area Coordinator Report for Operations Rob Quick 4/10/2008.
Open Science Grid OSG Resource and Service Validation and WLCG SAM Interoperability Rob Quick With Content from Arvind Gopu, James Casey, Ian Neilson,
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
Operations Area Coordinator Report. 31 Jan Overview Operations Current Initiatives  RSV Version 2  New Probes, Easier Configuration, Improved.
User Support of WLCG Storage Issues Rob Quick OSG Operations Coordinator WLCG Collaboration Meeting Imperial College, London July 7,
Opensciencegrid.org Operations Interfaces and Interactions Rob Quick, Indiana University July 21, 2005.
Integration TestBed (iTB) and Operations Provisioning Leigh Grundhoefer.
OSG Status and Rob Gardner University of Chicago US ATLAS Tier2 Meeting Harvard University, August 17-18, 2006.
Grid Deployment Technical Working Groups: Middleware selection AAA,security Resource scheduling Operations User Support GDB Grid Deployment Resource planning,
RSV: OSG Grid Monitoring and User Customizable Views Rob Quick, Arvind Gopu, and Soichi Hayashi High Performance Distributed Computing Location: Munich,
March 2014 Open Science Grid Operations A Decade of HTC Infrastructure Support Kyle Gross Operations Support Lead Indiana University / Research Technologies.
OSG Facility Miron Livny OSG Facility Coordinator and PI University of Wisconsin-Madison Open Science Grid Scientific Advisory Group Meeting June 12th.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
Monitoring Working Group Update Grid Deployment Board 5 th December, CERN Ian Neilson.
Grid Colombia Workshop with OSG Week 2 Startup Rob Gardner University of Chicago October 26, 2009.
OSG Operations – Lessons Learned CHEP 2010, 18 October 15:10 (Asia/Taipei) – Room 2, BHSS OSG Operations – Lessons Learned CHEP 2010, 18 October 15:10.
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
Regional Operations Centres Core infrastructure Centres
Operations Interfaces and Interactions
Open Science Grid Progress and Status
Monitoring and Information Services Technical Group Report
POW MND section.
Incident Response Plan for the Open Science Grid
EGEE VO Management.
Grid Service Monitoring Working Group
Leigh Grundhoefer Indiana University
Presentation transcript:

OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June 2007

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Outline How We Operate How We Interoperate What We Still Need to Do

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 The Open Science Grid Operations Center (GOC)

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 The Open Science Grid Operations Center (GOC) Critical Service Support Communication Hub Security Incident Response Provide Software Caches Coordinate Grid Wide Policy Problem Tracking and Resolution

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Critical Services Virtual Organization Resource Selector (VORS)

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Other Existing Information Services CEMon/BDII Integrated Server VOMS Monitor GIP Validator Gratia Account Data (FNAL) GridCat (Deprecated with Next Release) MonALISA (Deprecated with Next Release) Duplicated for Integration Test Bed

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Other Infrastructure Services VOMS (Infrastructure and Small VOs) Site Maintenance Tool Registration Database Critical Service Monitoring (Nagios)

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 OSG Communication Hub Trouble Ticketing System (Footprints) 24x7 Trouble Reporting and Ticket Creation OSG Twiki RSS Operations News Feed GOC Information Web Pages ( Weekly Operations Meeting (WLCG and OSG) Various Mailing Lists (osg-

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Security Response Technician on-call 24/7/365 to evaluate security incidents. Critical Incidents are Immediately Addressed with OSG Security Officer opensciencegrid.org 24/7/365 phone availability

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 OSG Software Cache OSG and ITB Caches Compute Element Configuration of Condor, PBS, LSF, SGE Worker Node Client Client VOMS GUMS Patches and Optional Components Coming Soon GOC Developed Packages Including Monitoring Probes

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Coordinate OSG Wide Policy Standard Operating Procedures Administrative Registration Information Policy Enforcement

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Problem Tracking and Solution OSG Ticketing System All Problems that cross out of a VO get ticketed. This includes peering grids (EGEE). GOC Operators follow up on all tickets to assure acceptable solution is found. Automated Exchange of tickets with some Larger VOs, Service Providers, and Peering Grids.

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 The Near Future for the OSG GOC Site Availability and Validation Project Focus on Getting Site Administrators Involved and Feeling Responsible for Maintaining a “Good Site” Series of probes based on standards of Grid Monitoring Working Group Probe data will eventually feed VORS and SAM Infrastructure being developed

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 OSG Information Management Project Consolidating Information Within the OSG Schema Being Developed Data Will Feed OSG Monitoring Tools (VORS, Information and Accounting Services) Project Includes Dashboards for Site Admins, Operations, VO Admins, and Others Views yet to be defined

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007

Other Projects Redundancy of Critical Services (Indianapolis and Bloomington) Ticket Metrics and Trending Smooth VO Additions Defining a “Good Site” and getting Site Admins interested in maintaining one Syslog-ng central log collection

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Interoperability Trouble Ticket Exchange with GGUS is in place. This has been in place ~1 year, time to revisit and add more functionality? Automatic routing to proper OSG Support Center Increased Reliability Weekly WLCG Operations Call Joint WLCG Operations Meeting

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 OSG Site Reporting to WLCG SAM Probe data exchange being discussed by Grid Monitoring Working Group Testing of Data Gathering and Exchange to be tested on next OSG ITB Release Reverse Flow of Status Data (Display in VORS?)

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Critical Services Virtual Organization Resource Selector (VORS)

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 What Next? We communicate on an administrative and operational level… We exchange trouble tickets… We will exchange status data… None of these things make us interoperable! Do we want jobs running cross grid boundaries?

R. Quick "WLCG-OSG-EGEE Interop" 26 Jan 2007 Thank You Special Thanks GOC Team: John Rosheck, Tim Silvers, Kyle Gross, and Arvind Gopu