Presentation is loading. Please wait.

Presentation is loading. Please wait.

Fault and Intrusion Tolerant (FIT) Event Broker & BFT-SMaRt A. Casimiro, D. Kreutz, A. Bessani, J. Sousa, I. Antunes, P. Veríssimo University of Lisboa,

Similar presentations


Presentation on theme: "Fault and Intrusion Tolerant (FIT) Event Broker & BFT-SMaRt A. Casimiro, D. Kreutz, A. Bessani, J. Sousa, I. Antunes, P. Veríssimo University of Lisboa,"— Presentation transcript:

1 Fault and Intrusion Tolerant (FIT) Event Broker & BFT-SMaRt A. Casimiro, D. Kreutz, A. Bessani, J. Sousa, I. Antunes, P. Veríssimo University of Lisboa, Portugal Meeting PT, November 27, 2012

2 Cloud Infrastructures Monitoring Tools and Control Engines Processing farm Storage farm Switching and Routing Control Events Control Events Control Events Alert! Cloud infrastructures are one of the new hot targets of attacks! Meeting PT / November 27, 20122

3 Example scenario: Portugal Telecom Cloud Computing Infrastructure SmartCloud product First and main problem: Centralized monitoring approach Diversity of monitoring tools ArchSight, Pulse, SCOM Meeting PT / November 27, 20123 Agentless Agent-Based Agent with ArchSight ArcSight (engine) Monitoring Probe Events ArcSight or other tool Problems: (a)faults and attacks; (b)diversity is hard to achieve in practice.

4 The TRONE approach Fault and Intrusion Tolerant (FIT) Event Broker Automated Failure Diagnosis Multi-homing for fast reconfiguration Meeting PT / November 27, 20124 1 2 3

5 FIT Event Broker Goals and challenges Overarching goals: To provide support for trustworthy and resilient monitoring of cloud/datacenter infrastructures To achieve improved Quality of Protection without neglecting Quality of Service (performance) needs Some specific challenges: Deal with large flows of information (events) Support different kinds of events (e.g. different criticality) Low intrusiveness and easy integration 5Meeting PT / November 27, 2012

6 FIT Event Broker Assumptions System entities: Probes, event collectors/brokers, consoles Some event processing may be done by collectors Fully connected network E.g., all the entities lie in the same monitoring VLAN Partially synchronous system Clocks may be used to timestamp events Faults Some FIT brokers may crash or fail in a Byzantine way We do not require/enforce clients (probes/consoles) to be correct If this is a problem for monitoring, then it must also be solved 6Meeting PT / November 27, 2012

7 FIT Event Broker Baseline design options Topic-based Publish-Subscribe paradigm Good fit to considered scenarios State Machine Replication Active replication is better for Byzantine fault tolerance f out of n replicas of a FIT Broker may fail in a Byzantine way Public-key cryptography Client authentication, avoid attacks from malicious probes Event channels with support for QoP and QoS Differentiated fault-tolerance support (e.g. crash only or BFT) 7Meeting PT / November 27, 2012

8 FIT Event Broker High level architectural view Meeting PT / November 27, 20128

9 FIT Event Broker Interface 9Meeting PT / November 27, 2012 Create event channel In: TAG and CLASS Destroy event channel In: TAG Register to channel In: TAG Publish event In: EVENT Subscribe to channel In: TAG Receive event Out: EVENT

10 FIT Event Broker Internal state From the SMR perspective, it is important to identify the relevant state that needs to be maintained consistent across replicas Data related to the broker configuration Existing channels and their CLASS Registered publishers and subscribers Data related to events Events that are ready to be delivered 10 All client input that affects the state of the FIT broker state (e.g. channel and subscription data, some events) must be handled as a state machine command Meeting PT / November 27, 2012

11 BFT-SMaRt Overview Java-based platform for BFT SMR, available at http://code.google.com/p/bft-smart/ Actively being developed and improved in our group BFT SMR “common” features State machine programming model n ≥ 3f+1 replicas required A small step away from being a commercial product Advanced features Replica recovery (state transfer) Reconfigurations Extensible API: e.g. custom voter Meeting PT / November 27, 201211

12 BFT-SMaRt Service invocation Meeting PT / November 27, 201212 PROBE FIT Broker state Agreement on order performed by SMaRt

13 BFT-SMaRt Execution and response Meeting PT / November 27, 201213 Commands are delivered to the FIT broker, which updates the state/queues and replies Voting on client side

14 The FIT Broker is currently being implemented… …and integrated with BFT-SMaRt Evaluation: Throughput Aim is to deal with 40K events/sec Resilience Measure performance under attack Verify recovery and reconfiguration capabilities A simple demo is available Meeting PT / November 27, 201214 BFT-SMaRt Implementation & Evaluation

15 Preliminary results available [DAIS 2012] Meeting PT / November 27, 201215 Throughput for up to 100 channels

16 Summary FIT Event Broker – Event dissemination support For easier deployment of multiple monitoring tools Manage which events are propagated, to which consoles, with which QoS BFT-SMaRT – Byzantine fault tolerant replication First usable implementation of BFT replication Leading edge worldwide Resilience against malicious attacks with small overhead Portugal Telecom’s cloud infrastructure is being used as real use case for application and evaluation of the work Meeting PT / November 27, 201216


Download ppt "Fault and Intrusion Tolerant (FIT) Event Broker & BFT-SMaRt A. Casimiro, D. Kreutz, A. Bessani, J. Sousa, I. Antunes, P. Veríssimo University of Lisboa,"

Similar presentations


Ads by Google