OSG Storage VDT Support and Troubleshooting Concerns Tanya Levshina.

Slides:



Advertisements
Similar presentations
Abuse Testing Laboratory Management Laboratory Management.
Advertisements

Dec 14, 20061/10 VO Services Project – Status Report Gabriele Garzoglio VO Services Project WBS Dec 14, 2006 OSG Executive Board Meeting Gabriele Garzoglio.
CHEPREO Tier-3 Center Achievements. FIU Tier-3 Center Tier-3 Centers in the CMS computing model –Primarily employed in support of local CMS physics community.
Futures – Alpha Cloud Deployment and Application Management.
Enterprise development reference architecture (EDRA) -Deepti Seelamsetti.
Toolbox Mirror -Overview Effective Distributed Learning.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
Maintaining and Updating Windows Server 2008
OSG Logging Architecture Update Center for Enabling Distributed Petascale Science Brian L. Tierney: LBNL.
Network security policy: best practices
Duties of a system administrator. A system administrator's responsibilities typically include:
COMPUTER SOFTWARE ALISA RAHMANI PUTRI / VIDIYA RACHMAWATI /
Network and Active Directory Performance Monitoring and Troubleshooting NETW4008 Lecture 8.
© 2013 Jones and Bartlett Learning, LLC, an Ascend Learning Company All rights reserved. Security Strategies in Linux Platforms and.
Linux Operations and Administration
The Role of DBMS in Computing
Organizing Information Technology Resources
MyOSG: A user-centric information resource for OSG infrastructure data sources Arvind Gopu, Soichi Hayashi, Rob Quick Open Science Grid Operations Center.
School and LEA Users
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
Security Baseline. Definition A preliminary assessment of a newly implemented system Serves as a starting point to measure changes in configurations and.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
Microsoft ® Official Course Module 10 Optimizing and Maintaining Windows ® 8 Client Computers.
OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.
The NOAA National Geophysical Data Center And Collocated World Data Service for Geophysics Dan Kowal Data Administrator, Information Services Division.
Introduction to OSG Storage Suchandra Thapa Computation Institute University of Chicago March 19, 20091GSAW 2009 Clemson.
Project Management Methodology Project Closing. Project closing stage Must be performed for all projects, successfully completed or shut off by management.
Module 7: Fundamentals of Administering Windows Server 2008.
Publication and Protection of Site Sensitive Information in Grids Shreyas Cholia NERSC Division, Lawrence Berkeley Lab Open Source Grid.
May 8, 20071/15 VO Services Project – Status Report Gabriele Garzoglio VO Services Project – Status Report Overview and Plans May 8, 2007 Computing Division,
Windows Vista Inside Out Chapter 22 - Monitoring System Activities with Event Viewer Last modified am.
G RID M IDDLEWARE AND S ECURITY Suchandra Thapa Computation Institute University of Chicago.
1 OSG Accounting Service Requirements Matteo Melani SLAC for the OSG Accounting Activity.
Production Coordination Staff Retreat July 21, 2010 Dan Fraser – Production Coordinator.
Module 7 : Configuration I Jong S. Bok
Scott Butson District Technology Manager. Provide professional to all district staff Professional development has been provided on a regular basis to.
CN2140 Server II Kemtis Kunanuraksapong MSIS with Distinction MCT, MCITP, MCTS, MCDST, MCP, A+
OSG Production Report OSG Area Coordinator’s Meeting Aug 12, 2010 Dan Fraser.
CSCI 1033 Computer Hardware Course Overview. Go to enter TA in the “Enter Promotion Code” box on the bottom right corner.
Meeting Minutes and TODOs TG has no distributed monitoring. During incident response, use a manual twiki page to distribute information TG monitors the.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Operating Systems Concepts 1/e Ruth Watson Chapter 8 Chapter 8 Network Administrator Ruth Watson.
The OSG and Grid Operations Center Rob Quick Open Science Grid Operations Center - Indiana University ATLAS Tier 2-Tier 3 Meeting Bloomington, Indiana.
Mar 27, gLExec Accounting Solutions in OSG Gabriele Garzoglio gLExec Accounting Solutions in OSG Mar 27, 2008 Middleware Security Group Meeting Igor.
RCE Platform Technology (RPT) Mark Arndt User Support.
WebCCTV 1 Contents Introduction Getting Started Connecting the WebCCTV NVR to a local network Connecting the WebCCTV NVR to the Internet Restoring the.
Auditing Project Architecture VERY HIGH LEVEL Tanya Levshina.
OSG Site Admin Workshop - Mar 2008Using gLExec to improve security1 OSG Site Administrators Workshop Using gLExec to improve security of Grid jobs by Alain.
Chapter 3 Pre-Incident Preparation Spring Incident Response & Computer Forensics.
OSG Area Coordinator’s Report: Workload Management Maxim Potekhin BNL May 8 th, 2008.
OSG Deployment Preparations Status Dane Skow OSG Council Meeting May 3, 2005 Madison, WI.
New town web site Demo of new town web site created  Why was this done? Uses standard web technolgies(PHP, MySQL)‏ I will.
Michael Mast Senior Architect Applications Technology Oracle Corporation.
ITMT 1371 – Window 7 Configuration 1 ITMT Windows 7 Configuration Chapter 8 – Managing and Monitoring Windows 7 Performance.
OSG Facility Miron Livny OSG Facility Coordinator and PI University of Wisconsin-Madison Open Science Grid Scientific Advisory Group Meeting June 12th.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
What is OSG? (What does it have to do with Atlas T3s?) What is OSG? (What does it have to do with Atlas T3s?) Dan Fraser OSG Production Coordinator OSG.
BeStMan/DFS support in VDT OSG Site Administrators workshop Indianapolis August Tanya Levshina Fermilab.
Week 11 Organizing Information Technology Resources
Understanding the New PTC System Monitor (PSM/Dynatrace) Application’s Capabilities and Advanced Usage Stephen Vaillancourt PTC Technical Support –Technical.
Michael Mast Senior Architect
TYPES OF SERVER. TYPES OF SERVER What is a server.
How to Fix Windows 10 Update Error 0x ?.
Unit 27: Network Operating Systems
Based on work by DoIT Network Services, UW-Madison
Leigh Grundhoefer Indiana University
Training Module Introduction to the TB9100/P25 CG/P25 TAG Customer Service Software (CSS) Describes Release 3.95 for Trunked TB9100 and P25 TAG Release.
Regional Joint Conference on Alexandria, Egypt, April 2007
EE 122: Lecture 22 (Overlay Networks)
A very basic introduction
Presentation transcript:

OSG Storage VDT Support and Troubleshooting Concerns Tanya Levshina

Support Challenges Complicated, highly distributed services Huge variety of configuration options (software and hardware) Widely diverse utilization patterns Poor error diagnostic, exception handling and propagation Lack of monitoring/diagnostic tools Support team does not have access to the service. Support personnel –Often are not authorized to use the service as user –Can not access site logs and configuration –Often can not access storage monitoring pages on the site

Why do we need help from ET? We need some guidance how we should organize the support efforts when Tier-2/Tier-3 sites will be in “production” mode and how to Get access to –the logs (all of them) –configuration –dCache admin interface Understand utilization patterns Establish working relationship with site storage administrators as well as VOs using storage Provide periodical training for site storage administrators to decrease the support and troubleshooting load Increase pressure on developer –to improve logging and error diagnostic –to provide service monitoring

Potential Solutions Maintain centralized logs (syslog-ng) –Site keeps centralized logs and allow secured web access –Site forwards logs to the central OSG log collector Maintain centralized configuration –Site provides secured web access to most configuration files Give OSG Support access to each site's dCache administrative interface. Could be done now by providing password ssh-key based access Admin interface allows to control the system – storage administrators are reluctant to allow that Encourage sites to turn on gratia storage probes – could help to understand usage patterns better Work with developers on improving logging –Started to work with dCache team –Try to apply framework for troubleshooting provided by CEDPS to dCache logs