Presentation is loading. Please wait.

Presentation is loading. Please wait.

5/25/2001Monitoring panel, Monitoring session LCCWS Olof Bärring, CERN.

Similar presentations


Presentation on theme: "5/25/2001Monitoring panel, Monitoring session LCCWS Olof Bärring, CERN."— Presentation transcript:

1 5/25/2001Monitoring panel, LCCWS@FNAL1 Monitoring session LCCWS Olof Bärring, CERN

2 5/25/2001Monitoring panel, LCCWS@FNAL2 Talks Tony Chan, BNL: ??? Tanya Levshina, FNAL: NGOP Olof Bärring, CERN: PEM Do you monitor services or servers?

3 5/25/2001Monitoring panel, LCCWS@FNAL3 BNL (Tony Chan) Using VACM tools to monitor power and cooling. Recovery actions LSF tools to monitor the queues AFS and NFS Built a layer on top –WEB interface –Archive

4 5/25/2001Monitoring panel, LCCWS@FNAL4 FNAL (Tanya Levshina) Used various tools to monitor different things (Xfalive, Patrol, NOC, FBSNG, ENSTORE) No satisfactory tool from survey. Launched NGOP project in –99: –Focus on alarms –Centrally configured. XML, MATHML –All events in Oracle –Supports actuators –In production. Good experience

5 5/25/2001Monitoring panel, LCCWS@FNAL5 CERN (Olof Bärring) Used various tools up to now Started PEM project in –99 –Monitor and alarm on service rather than server –Alarms and performance. All measurements stored in Oracle –Central configuration –First prototype running: 400 nodes (1000 soon) –~1GB/day, initially some pbs with JDBC –Hierarchical structure allows local event-action decisions

6 5/25/2001Monitoring panel, LCCWS@FNAL6 Key points Apart from tools coming with products (VACM, LSF), everybody build their own Today: monitor mostly objects, not services

7 5/25/2001Monitoring panel, LCCWS@FNAL7 Panel discussion Enterprise type systems –Home built: don’t forget ongoing costs –Commercial tools: don’t forget substantial effort for adaptation and integration. Footprint? Alarm vs. performance monitoring –Sysadmins want alarms –Users want performance

8 5/25/2001Monitoring panel, LCCWS@FNAL8 Panel discussion Other tools –NERSC using netsaint (netsaint.org). Active support –CERN looked at SiteAssure (Platform) Possible collaboration –Shared sensor repository?


Download ppt "5/25/2001Monitoring panel, Monitoring session LCCWS Olof Bärring, CERN."

Similar presentations


Ads by Google