Presentation is loading. Please wait.

Presentation is loading. Please wait.

Łukasz Skitał 2, Renata Słota 1, Maciej Janusz 1 and Jacek Kitowski 1,2 1 Institute of Computer Science AGH University of Science and Technology, Mickiewicza.

Similar presentations


Presentation on theme: "Łukasz Skitał 2, Renata Słota 1, Maciej Janusz 1 and Jacek Kitowski 1,2 1 Institute of Computer Science AGH University of Science and Technology, Mickiewicza."— Presentation transcript:

1 Łukasz Skitał 2, Renata Słota 1, Maciej Janusz 1 and Jacek Kitowski 1,2 1 Institute of Computer Science AGH University of Science and Technology, Mickiewicza 30, 30-059 Krakow, Poland 2 Academic Computer Centre CYFRONET-AGH, Nawojki 11, 30-950 Krakow, Poland Int.eu.grid project Grid environments evolve from being 'better batch systems' into full-featured systems supporting variety of application types. Int.eu.grid project focuses on creating grid environment for interactive or real-time applications. Due to it's strict requirements, real-time applications belongs to the most demanding group, which is very difficult to support in distributed infrastructure. To support one of this difficult applications we have created a specialized virtual organization (HEP VO) with web-based Management System. Management System for HEP VO (MS-HEPVO) provides control over HEP VO specific aspects without overlapping with existing Virtual Organization Management System (VOMS). MS-HEPVO covers following areas of VO management: site management (certification, SLA, monitoring and statistics) and application control (grid job submission, job pool support, diagnostics). The Application and VO HEP Application is based on ATLAS filtering framework for LHC experiment at CERN. Considerable effort has been put into migration of this application to the grid environment. Application uses pilot jobs idea and requires real-time communication between LHC experiment and grid worker nodes. HEP VO provides stable, dynamic environment using resources available from many grid sites. Management of this kind of VO is not trivial. To be up to this task, VO is based on contracts strengthened by Service Level Agreements (SLA) with sites. Management of the environment and monitoring of contracts fulfillment require some additional effort, which can be minimized with adequate tools, like Management System for HEP VO. User interface The system offers portal-based interface, which can be used by VO Manager, site administrators and application supervisors. This interface provides, among others, quick overview of application's run-time environment condition basing on SLAs fulfillment. The system is also integrated with Migrating Desktop (MD), offering interface (plug-in) for application supervisor. MD plug- in allows to adjust grid job submission to current experiment needs. Management System for HEP VO Management System for HEP VO supports VO management process. The system consists of two web- services: Grid job submission service is responsible for application run-time environment. It keeps number of grid jobs on level desired by the application. The service takes advantage of dynamic resource allocation offered by the grid. Site management and SLA Monitoring service is used to keep track of site operations, including site certification, monitoring and failure recovery. The service gathers SLA metrics values from different information sources (like JIMS, O-CMG, SAM or BDII) and checks, if SLAs are fulfilled. It can react on any SLA infringement by changing rules for resource allocation (eg. exclude failing resources) and sending warnings or alarms. Use cases Management System for HEP VO is used by three type of users: VO Manager supervise VO operation. He performs certification of sites, manages site state according to its operation, prepares job submission profiles (application parameters sets). VO Manager checks also site operation according to signed SLA and in case of any problem can change site status. Experiment operator supervise the application. He control job submission basing on current experiment needs and can retrieve job output for problem diagnostic. Site owner register site by filling a form. He provides all relevant information which are also used for preparation of site’s specific SLA References [1] The int.eu.grid project http://www.interactive-grid.euhttp://www.interactive-grid.eu [2] Service Level Agreement metrics for real-time application on the Grid Skital L., Janusz M., Slota R., Kitowski J.,PPAM 2007, September 9-12, 2007, Poland Acknowledgement The work has been supported by EU IST 031857 int.eu.grid project. Certification procedure Each site before going into production has to pass certification procedure, which validates site configuration and quality. MS-HEPVO supports this procedure. It provides a form for site registration, what starts certification process, allows to change site status during the procedure, visualize monitoring data and support SLA which is signed after successful certification. SLA Enforcement MS-HEPVO monitors SLA fulfillment. It gather data for each metric for each site and creates a report for VO Manager. In case of problems a warning to VO Manager and an administrator is sent. If failing site disrupt application stability, site’s operation in HEP VO can be automatically suspended.


Download ppt "Łukasz Skitał 2, Renata Słota 1, Maciej Janusz 1 and Jacek Kitowski 1,2 1 Institute of Computer Science AGH University of Science and Technology, Mickiewicza."

Similar presentations


Ads by Google