May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology ICFA WORKSHOP Daegu, May 2005 Daegu, May 2005 An Agent Based, Dynamic Service.

Slides:



Advertisements
Similar presentations
Caltech Proprietary Videoconferencing Security in VRVS 3.0 and Future Videoconferencing Security in VRVS 3.0 and Future Kun Wei California Institute of.
Advertisements

CPSCG: Constructive Platform for Specialized Computing Grid Institute of High Performance Computing Department of Computer Science Tsinghua University.
Chapter 3: Planning a Network Upgrade
Data Management Expert Panel - WP2. WP2 Overview.
May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology May 2005 An Agent Based, Dynamic Service System to Monitor, Control and Optimize.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
June 2003 Iosif Legrand MONitoring Agents using a Large Integrated Services Architecture Iosif Legrand California Institute of Technology.
Monitoring and controlling VRVS Reflectors Catalin Cirstoiu 3/7/2003.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.
October 2003 Iosif Legrand Iosif Legrand California Institute of Technology.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
© DSRG 2001www.cs.agh.edu.pl Cross Grid Workshop - Kraków Krzysztof Zieliński, Sławomir Zieliński University of Mining and Metallurgy {kz,
L. Granado Cardoso, F. Varela, N. Neufeld, C. Gaspar, C. Haen, CERN, Geneva, Switzerland D. Galli, INFN, Bologna, Italy ICALEPCS, October 2011.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Understanding and Managing WebSphere V5
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
BMC Software confidential. BMC Performance Manager Will Brown.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
September 2005 Iosif Legrand 1 End User Agents: extending the "intelligence" to the edge in Distributed Service Systems Iosif Legrand California Institute.
KARMA with ProActive Parallel Suite 12/01/2009 Air France, Sophia Antipolis Solutions and Services for Accelerating your Applications.
Users’ Authentication in the VRVS System David Collados California Institute of Technology November 20th, 2003TERENA - Authentication & Authorization.
Online Monitoring with MonALISA Dan Protopopescu Glasgow, UK Dan Protopopescu Glasgow, UK.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
ACAT 2003 Iosif Legrand Iosif Legrand California Institute of Technology.
Ramiro Voicu December Design Considerations  Act as a true dynamic service and provide the necessary functionally to be used by any other services.
1 Ramiro Voicu, Iosif Legrand, Harvey Newman, Artur Barczyk, Costin Grigoras, Ciprian Dobre, Alexandru Costan, Azher Mughal, Sandor Rozsa Monitoring and.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
Monitoring, Accounting and Automated Decision Support for the ALICE Experiment Based on the MonALISA Framework.
February 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology February 2006 February 2006 An Agent Based, Dynamic Service System to Monitor,
1 VINCI : Virtual Intelligent Networks for Computing Infrastructures An Integrated Network Services System to Control and Optimize Workflows in Distributed.
1 Iosif Legrand, Harvey Newman, Ramiro Voicu, Costin Grigoras, Catalin Cirstoiu, Ciprian Dobre An Agent Based, Dynamic Service System to Monitor, Control.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
ALICE, ATLAS, CMS & LHCb joint workshop on
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
LISHEP 2004 Iosif Legrand Iosif Legrand California Institute of Technology DISTRIBUTED SERVICES.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Jian Gui WANG New Implementation of Agriculture Models APAN19---Jan New Implementations of Agriculture Models Using Mediate Architecture.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
A Collaborative Framework for Scientific Data Analysis and Visualization Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox Department of Computer.
ABone Architecture and Operation ABCd — ABone Control Daemon Server for remote EE management On-demand EE initiation and termination Automatic EE restart.
Overview of ALICE monitoring Catalin Cirstoiu, Pablo Saiz, Latchezar Betev 23/03/2007 System Analysis Working Group.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 MonALISA Team Iosif Legrand, Harvey Newman, Ramiro Voicu, Costin Grigoras, Ciprian Dobre, Alexandru Costan MonALISA capabilities for the LHCOPN LHCOPN.
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
April 2003 Iosif Legrand MONitoring Agents using a Large Integrated Services Architecture Iosif Legrand California Institute of Technology.
PPDG February 2002 Iosif Legrand Monitoring systems requirements, Prototype tools and integration with other services Iosif Legrand California Institute.
+ AliEn site services and monitoring Miguel Martinez Pedreira.
October 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
January 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology January 2006 An Agent Based, Dynamic Service System to Monitor, Control and.
Netscape Application Server
California Institute of Technology
Open Source distributed document DB for an enterprise
GGF15 – Grids and Network Virtualization
AWS Cloud Computing Masaki.
AIMS Equipment & Automation monitoring solution
STATEL an easy way to transfer data
Presentation transcript:

May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology ICFA WORKSHOP Daegu, May 2005 Daegu, May 2005 An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed Systems Control and Optimize Distributed Systems

May 2005 Iosif Legrand 2 MonALISA is A Dynamic, Distributed Service Architecture   Real-time monitoring is an essential part of managing distributed systems. The monitoring information gathered is necessary for developing higher level services, and components that provide automated decisions, to help operate and optimize the workflow in complex systems.   The MonALISA system is designed as an ensemble of autonomous multi-threaded, self-describing agent-based subsystems which are registered as dynamic services, and are able to collaborate and cooperate in performing a wide range of monitoring tasks. These agents can analyze and process the information, in a distributed way, to provide optimization decisions in large scale distributed applications.   An agent-based architecture provides the ability to invest the system with increasing degrees of intelligence; to reduce complexity and make global systems manageable in real time

May 2005 Iosif Legrand 3 The MonALISA Architecture Provides:  Distributed Registration and Discovery for Services and Applications.  Monitoring all aspects of complex systems :  System information for computer nodes and clusters  Network information : WAN and LAN  Monitoring the performance of Applications, Jobs or services  The End User Systems, its performance  Can interact with any other services to provide in near real-time customized information based on monitoring data  Secure, remote administration for services and applications  Agents to supervise applications, to restart or reconfigure them, and to notify other services when certain conditions are detected.  The MonALISA framework can be used to develop higher level decision services, implemented as a distributed network of communicating agents, to perform global optimization tasks.  Graphical User Interfaces to visualize complex information

May 2005 Iosif Legrand 4 The MonALISA Discovery System & Services Network of JINI-LUSs Secure & Public MonALISA services Proxies Clients, HL services repositories Distributed Dynamic Discovery- based on a lease Mechanism and REN Distributed System for gathering and Analyzing Information. Dynamic load balancing Scalability & Replication Security AAA for Clients Global Services or Clients Fully Distributed System with no Single Point of Failure AGENTS

May 2005 Iosif Legrand 5 Lookup Service MonALISA service & Data Handling Data Cache Service & DB Configuration Control (SSL) Configuration Control (SSL) Lookup Service Data Stores WEB Service WSDL SOAP Client (other service) Java Discovery Registration Client (other service) Web client data Postgres MySQL Applications User defined loadable Modules to write /sent data Predicates & Agents Communications via the ML Proxy MonALSIA Service

May 2005 Iosif Legrand 6 Lookup Service Registration / Discovery Admin Access and AAA for Clients MonALISA Service Lookup Service Client (other service) Discovery Registration (signed certificate) MonALISA Service MonALISA Service Services Proxy Multiplexer Services Proxy Multiplexer Client (other service) Admin SSL connection Trust keystore AAA services Client authentication Data Data Filters & Agents Filters & Agents Trust keystore

May 2005 Iosif Legrand 7 Security in the MonALSIA System Secure LUSs Secure Registration SSL/TLS, PKIX, GSS-API 1) Community-based trust relationships. Multiple MonaLisa services may be operated by a community. The community memberships is maintained in specialized Authorization Services 2) Flexible communication protection 3) Secure registration in LUSs based on an X.509 host or site certificate 4) Auditing PROXY SERVICE NETWORK Authorization Enforcement Point

May 2005 Iosif Legrand 8   Grid3 Grid3 ~40 sites in US and 1 Korea   CMS-US sites CMS-US sites   CMS CMS   CDF   D0 SAR D0 SAR   ABILENE backbone ABILENE   GLORIAD GLORIAD   STAR STAR   ALICE ALICE   VRVS System VRVS System   RoEduNET backbone RoEduNET backbone   INTERNET2 PIPES INTERNET2 PIPES   OSG   LHCb Communities using MonALISA ABILENE VRVS - - GRID3 ALICE CMS-DC04 It has been used for Demonstrations at:   SC2003   Telecom 2003   WSIS 2003   SC 2004   I2 2005

May 2005 Iosif Legrand 9 Monitoring I2 Network Traffic, Grid03 Farms and Jobs

May 2005 Iosif Legrand 10 Monitoring Network Topology Latency, Routers NETWORKS AS ROUTERS

May 2005 Iosif Legrand 11 Monitoring the Execution of Jobs and the Time Evolution SPLIT JOBS LIFELINES for JOBS Job Job1 Job2 Job3 Job 31 Job 32 Summit a Job DAG

May 2005 Iosif Legrand 12 Monitoring ABILENE backbone Network u Test for a Land Speed Record u ~ 7 Gb/s in a single TCP stream from Geneva to Caltech

May 2005 Iosif Legrand 13 Monitoring VRVS Reflectors and Communication Topology

May 2005 Iosif Legrand 14 ApMon – Application Monitoring MonALISA Service MonALISA Service ApMon APPLICATION Monitoring Data UDP/XDR Mbps_out: 0.52 Status: reading App. Monitoring MB_inout: ApMon Config parameter1: value parameter2: value App. Monitoring... Time;IP;procID Monitoring Data UDP/XDR Monitoring Data UDP/XDR load1: 0.24 processes: 97 System Monitoring pages_in: 83 MonALISA hosts Config Servlet Library of APIs (C, C++, Java, Perl. Python) that can be used to send any information to MonALISA services   Flexibility, dynamic configuration, high communication performance dynamic reloading ApMon configuration generated automatically by a servlet / CGI script   Automated system monitoring   Accounting information No Lost Packages

May 2005 Iosif Legrand 15 LISA- Localhost Information Service Agent End To End Monitoring Tool u u It is very easy to deploy and install by simply using any browser. u u It detects the system architecture, the operating system and selects dynamically the binary parts necessary on each system. u u It can be easily deployed on any system. It is now used on all versions of Windows, Linux, Mac. u u It provides complete system monitoring of the host computer: u u CPU, memory, IO, disk, … u u Hardware detection u u Main components, Audio, Video equipment, u u Drivers installed in the system u u Provides embedded clients for IPERF (or other network monitoring tools, like Web 100 ) u u A user friendly GUI to present all the monitoring information. A lightweight Java Web Start application that provides complete monitoring of the end user systems, the network connectivity and can use the MonALISA framework to optimize client applications

May 2005 Iosif Legrand 16 LISA- Provides an Efficient Integration for Distributed Systems and Applications LISA Lookup Service Lookup Service Discovery Registration Best Service MonALISA Application Service MonALISA Application Service MonALISA Application Service MonALISA Application Service u u It is using external services to identify the real IP of the end system, its network ID and AS u u Discovers MonALISA services and can select, based on service attributes, different applications and their parameters (location, AS, functionality, load … ) è è Based on information such as AS number or location, it determines a list with the best possible services. è è Registers as a listener for other service attributes (eg. number of connected clients). è è Continuously monitors the network connection with several selected services and provides the best one to be used from the client’s perspective. è è Measures network quality, detects faults and informs upper layer services to take appropriate decisions

May 2005 Iosif Legrand 17 Communication in the Distributed Collaborative System Communication in the Distributed Collaborative System vrvs us vrvs eu pub funet star- light sinica kek cor- nell triumf cal- tech usf usp inet 2 vrvs 5 Reflectors are hosts that interconnect users by permanent IP tunnels. The active IP tunnels must be selected so that there is no cycle formed. Tree The selection is made according to the real-time measurements of the network performance. minimum-spanning tree(MST) minimum-spanning tree (MST)

May 2005 Iosif Legrand 18 Creating a Dynamic, Global, Minimum Spanning Tree to optimize the connectivity A weighted connected graph G = (V,E) with n vertices and m edges. The quality of connectivity between any two reflectors is measured every 2s. Building in near real time a minimum- spanning tree T

May 2005 Iosif Legrand 19 EVO: LISA Detects the Best Reflector for each Client and MonALISA Agents keep the reflectors connected in a MST  Dynamic Discovery of Reflectors  Creates and maintains, in real-time, the optimal connectivity between reflectors (MST) based on periodic network measurements.  Detects and monitor the User configuration, its hardware, the connectivity and its performance.  Dynamically connects the client to the best reflector  Provides secure administration.  It is using alarm triggers to notify unexpected events

May 2005 Iosif Legrand 20 Optical Switch Runs a ML Demon > ml_path IP1 IP4 “copy file IP4” ML proxy services used in Agent Communication ML Demon Control and Monitor the switch Optical Switch MonALISA ML Agent MonALISA ML Agent MonALISA ML Agent Discovery & Secure Connection 4 MonALISA agents to create on demand on an optical path or tree Time to create a path on demand <1s independent of the location and the number of connections Time to create a path on demand <1s independent of the location and the number of connections

May 2005 Iosif Legrand 21 Monitoring Optical Switches Agents to Create on Demand an Optical Path

May 2005 Iosif Legrand 22 Test Setup for Controlling Optical Switches 3 Simulated Links as L2 VLAN CALIENT (LA) Glimmerglass (GE) 3 partitions on each switch They are controlled by a MonALISA service 10G links 1G links u u Monitor and control switches using TL1 u u Interoperability between the two systems

May 2005 Iosif Legrand 23 MonALISA is a framework to correlate information from different layers Interface with GMPLS where available where available GMPLS Job Job1 Job2 Job3 Job 31 Job 32 User Farms & Data Serv. Networking Applications HELP to create Vertical Integration

May 2005 Iosif Legrand 24 SUMMARY MonaLISA is a fully distributed service system with no single point of failure. It provides reliable registration and discovery. MonaLISA is a fully distributed service system with no single point of failure. It provides reliable registration and discovery. u MonALISA is interfaced with many monitoring tools and is capable to collect any information from different applications u It allows to analyze and process information in real time, locally, using Filters or Agents that are dynamically deployed. u Can be used to control and monitor any other applications. Agents can be used to supervise applications, to restart or reconfigure them, and to notify other services when certain conditions are detected. u Provides a secure administration interface which allows to remotely control (start / stop/ reconfigure / upgrade) distributed services or applications. u The Agent system in the MonALISA framework can be used to develop higher level services, implemented as a distributed network of communicating agents, to perform global optimization tasks. It proved to be a stable and reliable distributed service system ~200 Sites running MonALISA ~200 Sites running MonALISA