Yingping Huang and Gregory Madey University of Notre Dame A W S utonomic eb-based imulation Presented by Tariq M. King Published by the IEEE Computer Society.

Slides:



Advertisements
Similar presentations
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
Advertisements

CloudStack Scalability Testing, Development, Results, and Futures Anthony Xu Apache CloudStack contributor.
KAIS T The Vision of Autonomic Computing Jeffrey O. Kephart, David M Chess IBM Watson research Center IEEE Computer, Jan 발표자 : 이승학.
Welcome to DEAS 2005 Design and Evolution of Autonomic Application Software David Garlan, CMU Marin Litoiu, IBM CAS Hausi A. Müller, UVic John Mylopoulos,
Presented by: Ehsan Kouroshfar. Outline Introduction Self-management Architectural Considerations Engineering challenges Scientific challenges Discussion.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Futures – Alpha Cloud Deployment and Application Management.
Autonomic Systems Justin Moles, Winter 2006 Security in an Autonomic Computing Environment Paper by: D. M. Chess, C. C. Palmer S. R. White Presentation.
DARPA ITS PI Meeting – Honolulu – July 17-21, 2000Slide 1 Aegis Research Corporation Intrusion Tolerance Using Masking, Redundancy and Dispersion DARPA.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development.
Fabián E. Bustamante, Winter 2006 Autonomic Computing The vision of autonomic computing, J. Kephart and D. Chess, IEEE Computer, Jan Also - A.G.
Copyright 2007, Information Builders. Slide 1 Workload Distribution for the Enterprise Mark Nesson, Vashti Ragoonath June, 2008.
UNIT – 1 INTRODUCTION AND OVERVIEW OF OPERATING SYSTEMS ECEA.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
Autonomic Computing Shafay Shamail Malik Jahan Khan.
Figure 1.1 Interaction between applications and the operating system.
Introduction to the new mainframe: Large-Scale Commercial Computing © Copyright IBM Corp., All rights reserved. Chapter 8: Autonomic computing.
Lesson 1: Configuring Network Load Balancing
Achieving self-healing in service delivery software systems by means of case- based reasoning Stefania Montani Cosimo Anglano Presented by Tony Schneider.
Overview SAP Basis Functions. SAP Technical Overview Learning Objectives What the Basis system is How does SAP handle a transaction request Differentiating.
H-1 Network Management Network management is the process of controlling a complex data network to maximize its efficiency and productivity The overall.
Trends & Challenges Systems Supporting M-Commerce Nour El Kadri University of Ottawa.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
MOBILE CLOUD COMPUTING
1 Autonomic Computing An Introduction Guenter Kickinger.
An Approach to Test Autonomic Containers Ronald Stevens (IEEE Computer Society & ACM Student Member) August 1, 2006 REU Sponsored by NSF.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
WELCOME. AUTONOMIC COMPUTING PRESENTED BY: NIKHIL P S7 IT ROLL NO: 33.
Computing on the Cloud Jason Detchevery March 4 th 2009.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
A Self-Manageable Infrastructure for Supporting Web-based Simulations Yingping Huang Xiaorong Xiang Gregory Madey Computer Science & Engineering University.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
August 3-4, 2004 San Jose, CA Developing a Complete VoIP System Asif Naseem Senior Vice President & CTO GoAhead Software.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
"1"1 Introduction to Managing Data " Describe problems associated with managing large numbers of disks " List requirements for easily managing large amounts.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
THE VISION OF AUTONOMIC COMPUTING. WHAT IS AUTONOMIC COMPUTING ? “ Autonomic Computing refers to computing infrastructure that adapts (automatically)
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
A Systematic Survey of Self-Protecting Software Systems
A Self-Configuring Test Harness for Web Applications Jairo Pava School of Computing and Information Sciences Florida International University Courtney.
VMware vSphere Configuration and Management v6
Security Vulnerabilities in A Virtual Environment
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
June 13-15, 2007Policy 2007 Infrastructure-aware Autonomic Manager for Change Management H. Abdel SalamK. Maly R. MukkamalaM. Zubair Department of Computer.
The Vision of Autonomic Computing Self-Management Unit 7-2 Managing the Digital Enterprise Kephart, and Chess.
Shane O’Neill Synergy Geekovation XenDesktop VM’s need to be registered in order for the user to be able to access them. Registration with the XenDesktop.
Enterprise Computing with Jini Technology Mark Stang and Stephen Whinston Jan / Feb 2001, IT Pro presented by Alex Kotchnev.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
AUTONOMIC COMPUTING B.Akhila Priya 06211A0504. Present-day IT environments are complex, heterogeneous in terms of software and hardware from multiple.
CT101: Computing Systems Introduction to Operating Systems.
Sun Tech Talk 3 Solaris 10 and OpenSolaris Pierre de Filippis Sun Campus Evangelist
High Availability Linux (HA Linux)
Chapter 19: Architecture, Implementation, and Testing
Unit 27: Network Operating Systems
Software Defined Networking (SDN)
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
ASPECT ORIENTATED PROGRAMMING RESEARCH
Fault Tolerance Distributed Web-based Systems
The Vision of Autonomic Computing
Analysis models and design models
System Testing.
Allocating IP Addressing by Using Dynamic Host Configuration Protocol
Self-Managed Systems: an Architectural Challenge
Network Monitoring System
Presentation transcript:

Yingping Huang and Gregory Madey University of Notre Dame A W S utonomic eb-based imulation Presented by Tariq M. King Published by the IEEE Computer Society in the 38 th Annual Simulation Symposium (April 2005)

2 Autonomic Web-based Simulation Web-based Simulation + Autonomic Computing Motivations: Scientific simulations are large programs that will probably contain errors when deployed to web Increased complexity in large-scale web-based simulations due to integration of different pieces of services Goal: Self-manageable Web-based Simulations A W S

3 Brain controls higher order conscious activities ( thought, reasoning, abstraction) Brain also controls lower level involuntary activities called autonomic functions ANS monitors and regulates in such a way that there is no conscious human involvement ANS was the basis for IBM’s Autonomic Computing initiative for system self-management Human Nervous System = CNS + PNS

4 Autonomic Computing Vision I B M Adapt to dynamically changing environments Monitor and tune resources automatically Discover, diagnose & react to disruptions Anticipate, detect, identify and protect against attacks 4

5 AWS Requirements 1. Simulation checkpointing and restarting 2. Simulation self-awareness and proactive failure detection 3. Self-manageable computing infrastructure to host simulations A W S

6 Checkpointing (Self-healing/Optimizing) R Q 1 Checkpointing is often used in simulations, databases, systems and operations research Determining optimal checkpoint is not trivial Excessive => performance degradation Deficient => expensive redo Both yields a longer execution time An optimization problem is formed

7 Expected Execution Time 7

8 Modeling Simulation Execution 8

9 Proactive Failure Detection Major cause of simulation crashes is low memory API’s in J2SE 5.0 can be used for: External monitoring using external monitoring software Internal monitoring by adding logic inside the simulation E.g. MXBeans Low Mem Notification => checkpoint and terminate gracefully R Q 2

10 Autonomic Infrastructure for AWS R Q 3 Autonomic Agent on each server Autonomic Manager on DB server Firewall/Router with Standby DB with Standby DW 10 Autonomic IP forwarding switch

11 Self-Configuring under AWS Autonomic discovery of new servers Autonomic resize of server pool Autonomic configuration of firewall/router, application servers and simulation servers Autonomic configuration of the database server and the data warehouse A W S

12 Self-Healing under AWS Some degree of redundancy is required to achieve self-healing in AWS Hot standby data warehouse and hot standby database Database and data warehouse are designed on two physical hosts Server pool ensures that when an application server is down, other servers can pick up its tasks A W S

13 Self-Healing under AWS (contd) Application Servers autonomic agent monitors execution status untimely response => failed app server New server started and IP forwarding is changed by the autonomic agent on the firewall Simulation Servers Autonomic agents upload operating system metrics (load avg, free memory) This also serves as the “heart-beat”, if the autonomic manager doesn’t receive the heart- beat => failed simulation server A W S

14 Self-Healing under AWS (contd) Database Servers The autonomic manager resides on the DBS. Vital to keep server running 24/7 Whenever primary database is down, database connections can be failed over to the standby database. Simulations Checkpointing Dispatcher redistributes crashed simulations to appropriate simulation servers. A W S

15 Self-Optimizing under AWS Load balancing the server pool Achieved by the Dispatcher and the Autonomic Agents New simulation is assigned to the simulation server with the lowest OS load Agents check Dispatcher table periodically to start any unassigned simulations At each checkpoint, Agents check with the Autonomic Manager to see if migration is necessary Simulations on heavily loaded servers are checkpointed and restarted on light servers. A W S

16 Self-Protecting under AWS Careful configuration of the firewall Security configuration on the grid Users of the grid must register and be verified by the system administrator System administrator must assign appropriate user roles Use of data model tables USERS, USER_ROLES, VERIFIER Is this self-protecting/autonomic? A W S

17 Conclusions and Future Work Paper presents a prototype of autonomic web-based simulation Implementation of an autonomic infrastructure to support AWS is discussed Future work focuses on implementing more autonomic features into AWS A W S

18 Agnostic Question #1 The authors describe one possible implementation of autonomic web-based simulations. One example for a project that uses such an implementation is the NOM project. Do you know of any other projects that have been proposed or developed? How do they compare to each other in terms of efficiency, technique and architecture used? A W S

19 Agnostic Question #2 The paper states that web-based simulations need to be deployed through computing systems (i.e. storage devices, database, web servers and simulation servers). Can you think of any component(s) involved that would increase the level of complexity more than the other? A W S

20 Agnostic Question #3 One method the authors provide for handling faults after they have occurred is through the use of checkpointing and restarting. Which approach is better: Using static checkpointing (fixed time intervals) Using dynamic checkpointing (context-specific, amount of computation, etc) A W S

21 Agnostic Question #4 The authors suggest that for a system to achieve autonomic features, that system must become even more complex by embedding the complexity into the system infrastructure itself. Is there any approach that involves less complexity in achieving autonomic features? If yes, give examples. A W S

22 Agnostic Question #5 One method given by the paper for handling simulation servers that have not uploaded the OS metrics in a timely fashion would be to mark the simulations on that server as crashed and restart the simulations from the last checkpoint on another server. What action would be taken if the former server starts responding optimally. A W S

23 Agnostic Question #6 The authors stated two major requirements (proactive failure detection and, checkpointing and restarting) for AWS. Can you think of any other requirement that would be necessary for AWS? A W S

24 Agnostic Question #7 The paper suggests some techniques that could be used to implement the autonomic infrastructure for AWS such as autonomic discovery of new servers, autonomic failure detection etc. Can you think of any other techniques that could be considered useful? A W S

25 Agnostic Question #8 What challenges would be faced when trying to validate and test an autonomic web-based simulation? How important is test to autonomic web- based simulation? A W S

26 Agnostic Question #9 Compare and contrast the difference between autonomic grid computing and autonomic web-based simulations? How would the challenges in validating and testing an autonomic web-based simulation application differ from what is required to validate and test an autonomic grid computing application? A W S