FT-ERF Fault-Tolerance in an Event Rule Framework for Distributed Systems Hillary Caituiro-Monge, Graduate Student. Advisor: Javier Arroyo-Figueroa, Ph.D.

Slides:



Advertisements
Similar presentations
The Challenges of CORBA Security It is important to understand that [CORBAsecurity] is only a (powerful) security toolbox and not the solution to all security.
Advertisements

Impossibility of Distributed Consensus with One Faulty Process
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Software Connectors Software Architecture. Importance of Connectors Complex, distributed, multilingual, modern software system functionality and managing.
Consensus Hao Li.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
Common Object Request Broker Architecture (CORBA) By: Sunil Gopinath David Watkins.
Broker Pattern Pattern-Oriented Software Architecture (POSA 1)
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
A Dependable Auction System: Architecture and an Implementation Framework
A brief look at CORBA. What is CORBA Common Object Request Broker Architecture developed by OMG Combine benefits of OO and distributed computing Distributed.
CS 582 / CMPE 481 Distributed Systems Replication.
Distributed Service Architectures Yitao Duan 03/19/2002.
2/23/2009CS50901 Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial Fred B. Schneider Presenter: Aly Farahat.
Transactional Services Ricardo Jiménez-Peris Marta Patiño-Martínez Technical University of Madrid 1 st Adapt Workshop 23 rd -24 th September 2002 Madrid,
Investigating Lightweight Fault Tolerance Strategies for Enterprise Distributed Real-time Embedded Systems Tech-X Corporation Boulder, Colorado Vanderbilt.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Architectural Design Principles. Outline  Architectural level of design The design of the system in terms of components and connectors and their arrangements.
Object Based Operating Systems1 Learning Objectives Object Orientation and its benefits Controversy over object based operating systems Object based operating.
1 A Framework for Highly Available Services Based on Group Communication Alan Fekete Idit Keidar University of Sidney MIT.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Team Members Lora zalmover Roni Brodsky Academic Advisor Professional Advisors Dr. Natalya Vanetik Prof. Shlomi Dolev Dr. Guy Tel-Zur.
Quality Assurance for Component- Based Software Development Cai Xia (Mphil Term1) Supervisor: Prof. Michael R. Lyu 5 May, 2000.
Scalability and Fault- Tolerance in an Event Rule Framework for Distributed Systems Hillary Caituiro-Monge Research Assistant, Graduate Student. Center.
26 Sep 2003 Transparent Adaptive Resource Management for Distributed Systems Department of Electrical Engineering and Computer Science Vanderbilt University,
Fault Tolerance via the State Machine Replication Approach Favian Contreras.
The Starfish System: Intrusion Detection and Intrusion Tolerance for Middleware Systems Kim Potter Kihlstrom Westmont College Santa Barbara, CA, USA Priya.
TRƯỜNG ĐẠI HỌC CÔNG NGHỆ Bộ môn Mạng và Truyền Thông Máy Tính.
Instituto de Informática and Dipartimento di Automatica e Informatica Universidade Federal do Rio Grande do Sul and Politecnico di Torino Porto Alegre,
Lecture 3: Sun: 16/4/1435 Distributed Computing Technologies and Middleware Lecturer/ Kawther Abas CS- 492 : Distributed system.
Wireless Access and Terminal Mobility in CORBA Dimple Kaul, Arundhati Kogekar, Stoyan Paunov.
1 of of 25 3 of 25 ORBs (Object Request Broker) – A distributed software bus for communication among middleware services and applications – To.
High Throughput Computing on P2P Networks Carlos Pérez Miguel
Dependable Systems (CSE 890), Thursday, 27 th 2003 IRL Interoperable Replication Logic: A three-tier approach to FT-CORBA Infrastructures Authors: R. Baldoni,
ARMADA Middleware and Communication Services T. ABDELZAHER, M. BJORKLUND, S. DAWSON, W.-C. FENG, F. JAHANIAN, S. JOHNSON, P. MARRON, A. MEHRA, T. MITTON,
Security Patterns in Wireless Sensor Networks By Y. Serge Joseph October 8 th, 2009 Part I.
Introduction to CORBA University of Mazandran Science & Tecnology By : Esmaill Khanlarpour January
Distributed Computing CSC 345 – Operating Systems By - Fure Unukpo 1 Saturday, April 26, 2014.
Secure Systems Research Group - FAU 1 Active Replication Pattern Ingrid Buckley Dept. of Computer Science and Engineering Florida Atlantic University Boca.
Architectural pattern: Interceptor Source: POSA II pp 109 – 140POSA II Environment: developing frameworks that can be extended transparently Recurring.
Distributed Objects and Middleware. Sockets and Ports Source: G. Coulouris et al., Distributed Systems: Concepts and Design.
Transparent Fault-Tolerant Java Virtual Machine Roy Friedman & Alon Kama Computer Science — Technion.
Toward Fault-tolerant P2P Systems: Constructing a Stable Virtual Peer from Multiple Unstable Peers Kota Abe, Tatsuya Ueda (Presenter), Masanori Shikano,
Fault Tolerance in CORBA and Wireless CORBA Chen Xinyu 18/9/2002.
Implementing Simple Replication Protocols using CORBA Portable Interceptors and Java Serialization T. Bennani, L. Blain, L. Courtes, J.-C. Fabre, M.-O.
Hwajung Lee.  Interprocess Communication (IPC) is at the heart of distributed computing.  Processes and Threads  Process is the execution of a program.
CORBA1 Distributed Software Systems Any software system can be physically distributed By distributed coupling we get the following:  Improved performance.
GLOBE DISTRIBUTED SHARED OBJECT. INTRODUCTION  Globe stands for GLobal Object Based Environment.  Globe is different from CORBA and DCOM that it supports.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 13. Review Shared Data Software Architectures – Black board Style architecture.
CRG talk on Tuesday, August 19, 2003 SeyedMasoud Sadjadi Software Engineering and Networking Systems Laboratory Department of.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Fault Tolerant Services
Software Connectors Acknowledgement: slides mostly from Software Architecture: Foundations, Theory, and Practice; Richard N. Taylor, Nenad Medvidovic,
Copyright © Richard N. Taylor, Nenad Medvidovic, and Eric M. Dashofy. All rights reserved. Software Connectors in Practice Software Architecture.
Middleware for Fault Tolerant Applications Lihua Xu and Sheng Liu Jun, 05, 2003.
E-COMMERCE & MOBILE COMPUTING. On Technicals… Considerations for evaluating platform Ecommerce Applications Development Process Integration Options Middlewares.
FLARe: a Fault-tolerant Lightweight Adaptive Real-time Middleware for Distributed Real-time and Embedded Systems Dr. Aniruddha S. Gokhale
Intrusion Tolerant Distributed Object Systems Joint IA&S PI Meeting Honolulu, HI July 17-21, 2000 Gregg Tally
The Role of Reflection in Next Generation Middleware
University of Maryland College Park
CORBA Alegria Baquero.
Component-Based Software Engineering: Technologies, Development Frameworks, and Quality Assurance Schemes X. Cai, M. R. Lyu, K.F. Wong, R. Ko.
CORBA Alegria Baquero.
Inventory of Distributed Computing Concepts
Fault Tolerance CSC 8320 : AOS Class Presentation Shiraj Pokharel
Fault-Tolerant CORBA By, Srinivas Seshu.
Quality Assurance for Component-Based Software Development
Group Service in CORBA Xing Gang Supervisor: Prof. Michael R. Lyu
Fault-Tolerant CORBA By, Srinivas Seshu.
Presentation transcript:

FT-ERF Fault-Tolerance in an Event Rule Framework for Distributed Systems Hillary Caituiro-Monge, Graduate Student. Advisor: Javier Arroyo-Figueroa, Ph.D. Presentation 3

Presentation Objectives Understand the Architecture of the Scalable and Fault-Tolerant ERF Architecture Relate Challenges on Active Replication Analyze Core Lacks among RUBIES replicas, with the purpose of Achieve Fault-Tolerance:  Lack of Timing Synchronization of Rule Evaluation Cycles (REC)  Lack of Consistency of Event Sets (ES)  Distributed Agreement Protocol

Presentation Objectives Introduce Research New Objective

SCALABLE AND FAULT TOLERANT ERF ARCHITECTURE DISTRIBUTION DIMENSION REPLICATION DIMENSION

REPLICATION CLASS DIAGRAM

DISTRIBUTION CLASS DIAGRAM

Challenges on Active Replication Strong replica consistency  All replicas must have the same state after method invocations Duplicated invocation detection and suppression

Lack of Timing Synchronization of Rule Evaluation Cycles (REC) among RUBIES replicas It is a source of non-deterministic behavior among RUBIES replicas It is not triggered in response to direct or indirect client’s method invocation It is always running Thereby the replicas consistency is not reachable by means of interface based consistency mechanisms

Lack of Timing Synchronization of Rule Evaluation Cycles (REC) among RUBIES replicas Each replica from a group has its independent REC, where the  Starting time differs  Duration time differs Making a scenario where each group member or replica runs each REC including different events.

Lack of Consistency of Event Sets (ES) among RUBIES replicas It is a source of non-deterministic behavior among RUBIES replicas The ES’ content changes different for each replica The ES’ content changes for two reasons:  Incoming events  Died events

Lack of Consistency of Event Sets (ES) among RUBIES replicas The ES’ content changes different for each replica, it is as consequence of delivery communication delay of events to each replica.

What is the problem? Each replica, belong to same group, includes dissimilar events for each consecutive equivalent REC execution.  As result each RUBIES replica posts different events in different times and with different state. Such behavior is a problem for load distribution and/or replication.

What is the issue? Strong replica consistency  Synchronize rule evaluation cycles among RUBIES replicas  Turn consistent event sets among RUBIES replicas

How to do it? Distributed Agreement or Consensus Protocol (Currently working in this)  RUBIES replicas must start each REC after an agreement. RECs must have an unique ID RECs of same ID must run simultaneously

How to do it? Distributed Agreement or Consensus Protocol (Currently working in this)  RUBIES replicas must include same events for RECs of same ID Agreement must include which events will consider Sliding window

Research New Objective The proposed research will focus on the fault-tolerance problem in ERF.  The main purpose is to design and implement a strong replica consistency mechanism to achieve fault-tolerance.

Procedure Select an Active Replication Software  Must be CORBA Fault-Tolerant Compatible  Must be portable  Must not be intrusive  No commercial Make an Distributed Agreement Protocol  Related Above

OGS (Object Group Service) Non-intrusive Service approach. Requiring no change to the underlying ORB Compliant with the CORBA specification Not proprietary. Designed and implemented as a set of CORBA objects. This makes it interoperable between different ORBs. Plans to extend OGS and make it compliant with FT- CORBA specification. White box.

Eternal Systems FTORB Non-intrusive Interception approach. CORBA objects above the ORB support the interfaces of the OMG Fault-Tolerant standard specifications Replication mechanisms below the ORB that provide strong replica consistency Interceptors to reach independence of the ORB and applications.

Others GMS (Group Communication Service) IRL Isis+Orbix Electra AQua

Comparison among Fault-Tolerant CORBA systems Carlo Marchetti et. al. “Architectural Issues on Fault Tolerance in CORBA”, in Proceedings of the SSGRR 2000 Computer & Business Conference, L'Aquila, Italy, 2000

Conclusion For Fault-Tolerance in ERF is necessary the design and implementation of an agreement protocol with the purpose of achieve strong replica consistency. Strong replica consistency will enable ERF for distributed scenarios, such as replication, load distribution, load balancing, and so on.

Thanks