ARMADA Middleware and Communication Services T. ABDELZAHER, M. BJORKLUND, S. DAWSON, W.-C. FENG, F. JAHANIAN, S. JOHNSON, P. MARRON, A. MEHRA, T. MITTON,

Slides:



Advertisements
Similar presentations
Reliable Communication in the Presence of Failures Kenneth Birman, Thomas Joseph Cornell University, 1987 Julia Campbell 19 November 2003.
Advertisements

Quality of Service Requirements
T.Sharon-A.Frank 1 Multimedia Quality of Service (QoS)
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
Reliability on Web Services Presented by Pat Chan 17/10/2005.
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
CS 582 / CMPE 481 Distributed Systems Fault Tolerance.
Distributed systems Module 2 -Distributed algorithms Teaching unit 1 – Basic techniques Ernesto Damiani University of Bozen Lesson 3 – Distributed Systems.
Group Communications Group communication: one source process sending a message to a group of processes: Destination is a group rather than a single process.
CS 582 / CMPE 481 Distributed Systems Communications (cont.)
Group Communication Phuong Hoai Ha & Yi Zhang Introduction to Lab. assignments March 24 th, 2004.
OSI Model.
In-Band Flow Establishment for End-to-End QoS in RDRN Saravanan Radhakrishnan.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
EEC 688/788 Secure and Dependable Computing Lecture 13 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Multicast Communication
Department of Electronic Engineering City University of Hong Kong EE3900 Computer Networks Transport Protocols Slide 1 Transport Protocols.
CSc 461/561 CSc 461/561 Multimedia Systems Part C: 3. QoS.
Multicast Transport Protocols: A Survey and Taxonomy Author: Katia Obraczka University of Southern California Presenter: Venkatesh Prabhakar.
Switching Techniques Student: Blidaru Catalina Elena.
Distributed Real-Time systems 1 By: Mahdi Sadeghizadeh Website: Sadeghizadeh.ir Advanced Computer Networks.
1 A Modular Approach to Fault-Tolerant Broadcasts and Related Problems Author: Vassos Hadzilacos and Sam Toueg Distributed Systems: 526 U1580 Professor:
Integrated Services (RFC 1633) r Architecture for providing QoS guarantees to individual application sessions r Call setup: a session requiring QoS guarantees.
CSE679: QoS Infrastructure to Support Multimedia Communications r Principles r Policing r Scheduling r RSVP r Integrated and Differentiated Services.
1 System Models. 2 Outline Introduction Architectural models Fundamental models Guideline.
Computing in the RAIN: A Reliable Array of Independent Nodes Group A3 Ka Hou Wong Jahanzeb Faizan Jonathan Sippel.
Introduction to Networks CS587x Lecture 1 Department of Computer Science Iowa State University.
Chapter 1. Introduction. By Sanghyun Ahn, Deot. Of Computer Science and Statistics, University of Seoul A Brief Networking History §Internet – started.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
TOTEM: A FAULT-TOLERANT MULTICAST GROUP COMMUNICATION SYSTEM L. E. Moser, P. M. Melliar Smith, D. A. Agarwal, B. K. Budhia C. A. Lingley-Papadopoulos University.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
SPREAD TOOLKIT High performance messaging middleware Presented by Sayantam Dey Vipin Mehta.
Reliable Communication in the Presence of Failures Based on the paper by: Kenneth Birman and Thomas A. Joseph Cesar Talledo COEN 317 Fall 05.
CS 447 Network & Data Communication QoS (Quality of Service) & DiffServ Introduction Department of Computer Science Southern Illinois University Edwardsville.
Group Communication Group oriented activities are steadily increasing. There are many types of groups:  Open and Closed groups  Peer-to-peer and hierarchical.
Farnaz Moradi Based on slides by Andreas Larsson 2013.
Fault Tolerance in CORBA and Wireless CORBA Chen Xinyu 18/9/2002.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 18 – Multimedia Transport (Part 1) Klara Nahrstedt Spring 2014.
Multimedia and Networks. Protocols (rules) Rules governing the exchange of data over networks Conceptually organized into stacked layers – Application-oriented.
Hwajung Lee. A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types of groups:
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
Chapter 24 Transport Control Protocol (TCP) Layer 4 protocol Responsible for reliable end-to-end transmission Provides illusion of reliable network to.
Totally Ordered Broadcast in the face of Network Partitions [Keidar and Dolev,2000] INF5360 Student Presentation 4/3-08 Miran Damjanovic
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Fault Tolerant Services
The Totem Single-Ring Ordering and Membership Protocol Y. Amir, L. E. Moser, P. M Melliar-Smith, D. A. Agarwal, P. Ciarfella.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Distributed Real-Time Systems.
Building Dependable Distributed Systems, Copyright Wenbing Zhao
Lecture 10: Coordination and Agreement (Chap 12) Haibin Zhu, PhD. Assistant Professor Department of Computer Science Nipissing University © 2002.
NDDS: The Real-Time Publish Subscribe Middleware Network Data Delivery Service An Efficient Real-Time Application Communications Platform Presented By:
Failure detection The design of fault-tolerant systems will be easier if failures can be detected. Depends on the 1. System model, and 2. The type of failures.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
Relying on Safe Distance to Achieve Strong Partitionable Group Membership in Ad Hoc Networks Authors: Q. Huang, C. Julien, G. Roman Presented By: Jeff.
CIS679: RSVP r Review of Last Lecture r RSVP. Review of Last Lecture r Scheduling: m Decide the order of packet transmission r Resource configuration.
Multimedia Communication Systems Techniques, Standards, and Networks Chapter 4 Distributed Multimedia Systems.
Group Communication A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types.
Distributed Systems Lecture 7 Multicast 1. Previous lecture Global states – Cuts – Collecting state – Algorithms 2.
Chapter 8 Fault Tolerance. Outline Introductions –Concepts –Failure models –Redundancy Process resilience –Groups and failure masking –Distributed agreement.
Replication Chapter Katherine Dawicki. Motivations Performance enhancement Increased availability Fault Tolerance.
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Reliable multicast Tolerates process crashes. The additional requirements are: Only correct processes will receive multicasts from all correct processes.
Klara Nahrstedt Spring 2009
EEC 688/788 Secure and Dependable Computing
Seminar Mobilkommunikation Reliable Multicast in Wireless Networks
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
M. Mock and E. Nett and S. Schemmer
Presentation transcript:

ARMADA Middleware and Communication Services T. ABDELZAHER, M. BJORKLUND, S. DAWSON, W.-C. FENG, F. JAHANIAN, S. JOHNSON, P. MARRON, A. MEHRA, T. MITTON, A. SHAIKH, K. SHIN, Z. WANG, H. ZOU Real-Time Computing Laboratory University of Michigan Presented by Guoliang Xing

Agenda Introduction RTCAST Group Comm. Service Real-Time Channel Architecture Platforms RTPB Replication Service Evaluation Tools

Target Applications Embedded fault-tolerant applications Industrial and manufacturing systems Distributed multimedia Air traffic control

Key Challenges Timely delivery of services with end-to-end real-time constraints Dependability of services in the presence of h/s failures Scalability of computation and communication resources Exploitation of open systems and emerging standards in operating systems and communication services

ARMADA Architecture Applications Middleware Services Evaluation Tools API Real-Time Channels Microkernel

RTCAST Multicast comm. and group management in timely fashion, with faults

Group Communication Reliable message delivery Agreement on group membership Failure detection and handling Consistency Atomicity: either everybody gets the message or nobody gets it Global order

Real-time Group Comm. Late message means failure Atomic, ordered message delivery in timely fashion Immediate message delivery without compromising the above

Achieve reliability, atomicity, RT Reliability: e ach member either receives a multicast message m or crashes before receiving m Atomicity: correct members receive all message and in the same order Time-bounded multicast: each member either receives each multicast m in total order within T time units or crashes during T before receiving m

RTCAST - Architecture Real-time Process Groups API Clock SynchronizationVirtual Network Interface Unicast Datagram Communication Admission Control and Schedulability Analysis Group Membership Service Timed Atomic Multicast

System Model Assumptions: each processor has its own unique identifier a path exists between any two processors communication delay is bounded (in the absence of failures) synchronized clocks Failures processors may suffer performance or crash failures messages may suffer performance or omission failures

Agreement on membership All members have the same membership view at group initialization time For each membership update U which changes membership view from V to V’, U is delivered atomically (in order) to all members in V U V’ within T time units

Steady-state operation

Token Ring: ensure order A processor sends messages only after holds token Upon receiving the token sends multicast messages within maximum token hold time sends a heartbeat which is a token to successor Upon receiving a multicast message deliver to application in sequence if message omission detected, crash

Steady-state operation– contd.

Handle faults

Membership Changes Processor crashes Each processor checks the heartbeats from members when its turn comes Send membership update multicast Joins Sends a join request to some processor which multicasts membership change message Joining processor checks the consistence of membership views sent in ACKs

Token Rotation Period P token – Token rotation time T i – maximum token hold time at any processor n – number of processes d max – comm. delay

Admission Control Goal: Only admit affordable messages Assumptions: Each sender can transmit messages for up to Tj units of time within P Time elapsed between the send and delivery is bounded by Δ

Admission Control – Contd. Real-time message: Maximum transmission time C i, period P i, deadline d i Sufficient Schedulability Condition:

Implementation

Agenda Introduction RTCAST Group Comm. Service Real-Time Comm. Architecture Platforms RTPB Replication Service Evaluation Tools Conclusion

RT Channel Architecture

RT Comm. Architecture – Contd. Real-time channel: unicast virtual connection between two hosts with bounded end-to-end delay guarantee RTC API: Clip: endpoint with QoS parameters RTCOP: Signaling and resource reservation QoS model & Admission control:

RTC API

RTCOP-Contd. Real-Time Connection Ordination Protocol: Distributed end-to-end signaling Request and reply handler: manage signaling state and interface to admission control Comm. module: reliably forward signaling message Signaling connection is non-real-time but reliable

RTCOP

Resource scheduling

Resource scheduling- Contd. QoS-sensitive CPU scheduling: Each message must be sent within deadline Comm. Handler scheduled with EDF policy Resource reservation: Associate each Comm. Handler with budget Policing: Link bandwidth allocation: Dynamic priority based link scheduler

Resource Scheduling – contd.

Traffic isolation in RTC

Agenda Introduction RTCAST Group Comm. Service Real-Time Comm. Architecture Platforms RTPB Replication Service Evaluation Tools Conclusion

Platforms Microkernel x-kernel: Co- located server UDP/IP

RTPB Architecture Many RT applications can tolerate minor inconsistencies in replicated state Backup maintains a less current copy of primary Distance between the primary and backup data is bounded within a time window

Evaluation Tools - ORCHESTRA A distributed protocol is viewed as an abstraction layer through which participants communicate by exchanging messages A probe/fault injection (PFI) layer is inserted between any two consecutive layers in a protocol stack. PFI layer can delay, drop, reorder, duplicate, modify, introducing spontaneous messages

Conclusions Middleware Services for fault-tolerant group communication Real-time communication services validation tools

Questions?