Monitoring and Accounting on the NGS Guy Warner NeSC TOE Team.

Slides:



Advertisements
Similar presentations
The National Grid Service Mike Mineter.
Advertisements

NGS computation services: API's,
The National Grid Service and OGSA-DAI Mike Mineter
LeadManager™- Internet Marketing Lead Management Solution May, 2009.
IWay Service Manager 6.1 Product Update Scott Hathaway iWay Software Copyright 2010, Information Builders. Slide 1.
Page 1 More information at; gaddsoftware.comgaddsoftware.com.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Monitoring and performance measurement in Production Grid Environments David Wallom.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
Chapter 14 Chapter 14: Server Monitoring and Optimization.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
Distributed Systems Management What is management? Strategic factors (planning, control) Tactical factors (how to do support the strategy practically).
Maintaining and Updating Windows Server 2008
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
16.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 16: Examining Software Update.
M ONITORING SERVER PERFORMANCE Unit objectives Use Task Manager to monitor server performance and resource usage Use Event Viewer to identify and troubleshoot.
Module 15: Monitoring. Overview Formulate requirements and identify resources to monitor in a database environment Types of monitoring that can be carried.
CNJohnson & Associates, Inc An Overview of Chargeback Best Practices.
Hall D Online Data Acquisition CEBAF provides us with a tremendous scientific opportunity for understanding one of the fundamental forces of nature. 75.
Hands-On Microsoft Windows Server 2008
1 Guide to Novell NetWare 6.0 Network Administration Chapter 13.
OSG Public Storage and iRODS
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
Next Steps Guy Warner
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
1 Network Monitoring Mi-Jung Choi Dept. of Computer Science KNU
The National Grid Service Guy Warner.
Maintaining and Updating Windows Server Monitoring Windows Server It is important to monitor your Server system to make sure it is running smoothly.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
SFC User Management USM: the new user management tool.
Next Steps: becoming users of the NGS Mike Mineter
1 Visalia Unified School District SRTS User Training November 21, 2005 By SRTS Support
Evolving Interfaces to Impacting Technology: The Mobile TeraGrid User Portal Rion Dooley, Stephen Mock, Maytal Dahan, Praveen Nuthulapati, Patrick Hurley.
Next Steps.
Creating and running an application.
SAN DIEGO SUPERCOMPUTER CENTER Inca Control Infrastructure Shava Smallen Inca Workshop September 4, 2008.
Grid Monitoring and Information Services: Globus Toolkit MDS4 & TeraGrid Inca Jennifer M. Schopf Argonne National Lab UK National eScience Center (NeSC)
Portal Update Plan Ashok Adiga (512)
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
1 Computer Maintenance Software Configuration: Evaluating Software Packages, Software Licensing, and Computer Protection through the Installation and Maintenance.
EGEE-0 / LCG-2 middleware Practical.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
Module 14 Monitoring and Maintaining Windows Server® 2008 Servers.
Rob Allan Daresbury Laboratory NW-GRID Training Event 26 th January 2007 Next Steps R.J. Allan CCLRC Daresbury Laboratory.
SAN DIEGO SUPERCOMPUTER CENTER Welcome to the 2nd Inca Workshop Sponsored by the NSF September 4 & 5, 2008 Presenters: Shava Smallen
The National Grid Service Mike Mineter.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
Quality Assurance (QA) Working Group Update July 1, 2010 Kate Ericson (SDSC) Shava Smallen (SDSC)
Accounting in LCG Dave Kant CCLRC, e-Science Centre.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
1 Chapter Overview Monitoring Access to Shared Folders Creating and Sharing Local and Remote Folders Monitoring Network Users Using Offline Folders and.
Charaka Palansuriya EPCC, The University of Edinburgh An Alarms Service for Federated Networks Charaka.
Monitoring Guy Warner NeSC Training.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Maintaining and Updating Windows Server 2008 Lesson 8.
Monitoring and Information Services Core Infrastructure (MIS-CI) Service Description Mark L. Green OSG Integration Workshop at UC Feb 15-17, 2005.
ATLAS Computing Wenjing Wu outline Local accounts Tier3 resources Tier2 resources.
Managing and Monitoring Windows 7 Performance
Next Steps.
Migration Strategies – Business Desktop Deployment (BDD) Overview
The Application Lifecycle
STATEL an easy way to transfer data
Presentation transcript:

Monitoring and Accounting on the NGS Guy Warner NeSC TOE Team

Policy for re-use This presentation can be re-used for academic purposes. However if you do so then please let know. We need to gather statistics of re-use: no. of events, number of people trained. Thank you!!

Acknowledgements The slides in this presentation are taken from presentations by: S. Pickering at the e-Science All Hands Meeting – J. Schopf – Talks/mds4Inca_lcg_nov2004.ppthttp://www-unix.mcs.anl.gov/~schopf/ Talks/mds4Inca_lcg_nov2004.ppt S. Smallen and K. Ericson at Super Computing 05. – K. Weeks – presentation at All Hands Meeting 2007

NGS Grid Monitoring Service Reliability Performance Monitoring Benchmarking Site Interoperability Certification Software Stack Validation Customisations Archiving Integration – PBS, GITS, Ganglia, INCA

Collecting Information System Administration –Operating System –Disk –Network –Problem detection User Information –Software/Modules –Queues –Resources

What is monitoring? Discovery and expression of data Discovery: –Registry service –Contains descriptions of data that is available –Sometimes also where last value of data is kept (caching) Expression of data –Access to sensors, archives, etc. –Producer (in consumer producer model)

What is Grid monitoring? Grid level monitoring concerns data that is: –Shared between administrative domains –For use by multiple people –Often summarized –(think scalability) Different levels of monitoring needed: –Application specific –Node level –Cluster/site Level –Grid level Grid monitoring may contain summaries of lower level monitoring

Grid Monitoring Does Not Include… All the data about every node of every site Years of utilization logs to use for planning next hardware purchase Low-level application progress details for a single user Application debugging data Point-to-point sharing of all data over all sites

INCA & Ganglia INCA –a framework for the automated testing, benchmarking and monitoring of Grid resources –INCA on the NGS - Ganglia –Each node broadcasts information (UDP Multicast) –One node listens –Good for current CPU/Memory usage –Ganglia on the NGS - Only the front page is available to users. You will get "Page not found" or equivalent errors if you try and drill down into ganglia.

Grid Accounting Accounting for any production grid is an important part of the monitoring process –Pricing policies may be introduced to grids in the future –To uphold policies relating to grid use and allocated hours –To monitor systems – particularly important for funding and future planning –To have an overview of the system – how much are we allocating? How much is being used? How much spare capacity do we have? How much are our biggest users using? It’s an issue many grids now face

Grid policing Users are allocated limited resources Important to know how much of those resources have been consumed Users tend to go over quota even when monitored Need to ‘lock-out’ users who go over quota There is an important distinction between accounting and policing Retain integrity of application and peer-review process

Policing the NGS User Accounting System (UAS) queries the RUS every day for total CPU and disk space for every user A warning is sent out when you reach 90% of your CPU allocation The account is automatically locked and an sent when you reach 100% of your CPU allocation

Policing the NGS (2) When an account is locked, you can apply for more resources –Via application form –Via your account details When your application is successful, your account is automatically updated with your new allocation and account is ‘active’ again An is sent to you letting you know you’re back within your limits Your account will be active within the hour.

Accessing your details Users wanted to know how much of their allocation they had used Certificate access to account details –Not supported by Oracle Apex –Needed a workaround to take certificate details from browser Also provides ability to change contact details Renewals can be done through their own account