Flexible Provisioning of Service Workflows Sebastian Stein Supervisors: Nick Jennings Terry Payne KEG Seminar, Aston University 4 th March 2008.

Slides:

Advertisements

Similar presentations

L3S Research Center University of Hanover Germany

Advertisements

CprE 458/558: Real-Time Systems

QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,

Multi-level SLA Management for Service-Oriented Infrastructures Wolfgang Theilmann, Ramin Yahyapour, Joe Butler, Patrik Spiess consortium / SAP.

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.

Hadi Goudarzi and Massoud Pedram

Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures Pree Thiengburanathum Advanced computer architecture Oct 24,

ISE480 Sequencing and Scheduling Izmir University of Economics ISE Fall Semestre.

SE503 Advanced Project Management Dr. Ahmed Sameh, Ph.D. Professor, CS & IS Project Uncertainty Management.

Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.

CS 795 – Spring  “Software Systems are increasingly Situated in dynamic, mission critical settings ◦ Operational profile is dynamic, and depends.

All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.

Gizem ALAGÖZ. Simulation optimization has received considerable attention from both simulation researchers and practitioners. Both continuous and discrete.

Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.

Planning under Uncertainty

A Service Selection Model to Improve Composition Reliability Natallia Kokash.

Zach Ramaekers Computer Science University of Nebraska at Omaha Advisor: Dr. Raj Dasgupta 1.

GridFlow: Workflow Management for Grid Computing Kavita Shinde.

A Heuristic Bidding Strategy for Multiple Heterogeneous Auctions Patricia Anthony & Nicholas R. Jennings Dept. of Electronics and Computer Science University.

Three heuristics for transmission scheduling in sensor networks with multiple mobile sinks Damla Turgut and Lotzi Bölöni University of Central Florida.

BUSINESS PROCESS DESIGN: TOWARDS SERVICE-BASED GREEN INFORMATION SYSTEMS Barbara Pernici, Danilo Ardagna, Cinzia Cappiello Politecnico di Milano

Job Release-Time Design in Stochastic Manufacturing Systems Using Perturbation Analysis By: Dongping Song Supervisors: Dr. C.Hicks & Dr. C.F.Earl Department.

Improving Robustness in Distributed Systems Jeremy Russell Software Engineering Honours Project.

Company Enterprise Risk Management & Stress Testing Case Study.

Models for Measuring and Hedging Risks in a Network Plan

Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,

Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.

1 Optimizing Utility in Cloud Computing through Autonomic Workload Execution Reporter : Lin Kelly Date : 2010/11/24.

New Challenges in Cloud Datacenter Monitoring and Management

Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)

VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.

1 A User-Guided Cognitive Agent for Wireless Service Selection in Pervasive Computing George Lee May 5, 2004 G. Lee, P. Faratin, S. Bauer, and J. Wroclawski.

1 Risk Based Negotiation of Service Agent Coalitions Bastian Blankenburg, Matthias KluschDFKI Minghua He, Nick JenningsUniversity of Southampton.

SOFTWARE DESIGN AND ARCHITECTURE LECTURE 09. Review Introduction to architectural styles Distributed architectures – Client Server Architecture – Multi-tier.

An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.

Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.

Autonomous Replication for High Availability in Unstructured P2P Systems Francisco Matias Cuenca-Acuna, Richard P. Martin, Thu D. Nguyen

PERVASIVE COMPUTING MIDDLEWARE BY SCHIELE, HANDTE, AND BECKER A Presentation by Nancy Shah.

Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.

Stochastic DAG Scheduling using Monte Carlo Approach Heterogeneous Computing Workshop (at IPDPS) 2012 Extended version: Elsevier JPDC (accepted July 2013,

Efficient Provisioning of Service Level Agreements for Service Oriented Applications Valeria Cardellini, Emiliano Casalicchio, Vincenzo Grassi, Francesco.

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

10 th December, 2013 Lab Meeting Papers Reviewed:.

Haley: A Hierarchical Framework for Logical Composition of Web Services Haibo Zhao, Prashant Doshi LSDIS Lab, Dept. of Computer Science, University of.

© 2012 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.

Optimal Resource Allocation for Protecting System Availability against Random Cyber Attack International Conference Computer Research and Development(ICCRD),

Learning to Navigate Through Crowded Environments Peter Henry 1, Christian Vollmer 2, Brian Ferris 1, Dieter Fox 1 Tuesday, May 4, University of.

CUHK Learning-Based Power Management for Multi-Core Processors YE Rong Nov 15, 2011.

Accommodating Bursts in Distributed Stream Processing Systems Yannis Drougas, ESRI Vana Kalogeraki, AUEB

WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.

Xiao Liu 1, Yun Yang 1, Jinjun Chen 1, Qing Wang 2, and Mingshu Li 2 1 Centre for Complex Software Systems and Services Swinburne University of Technology.

OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.

DO LOCAL MODIFICATION RULES ALLOW EFFICIENT LEARNING ABOUT DISTRIBUTED REPRESENTATIONS ? A. R. Gardner-Medwin THE PRINCIPLE OF LOCAL COMPUTABILITY Neural.

Static Process Scheduling

Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.

Service Reliability Engineering The Chinese University of Hong Kong

A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,

Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,

Presented by: Omar Alqahtani Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.

Dynamic Power Management Using Online Learning Gaurav Dhiman, Tajana Simunic Rosing (CSE-UCSD) Existing DPM policies do not adapt optimally with changing.

Multiple-goal Search Algorithms and their Application to Web Crawling Dmitry Davidov and Shaul Markovitch Computer Science Department Technion, Haifa 32000,

1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.

Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.

Erik Ela, Eamonn Lannoye, Bob Entriken, Aidan Tuohy

Analytics and OR DP- summary.

ISP and Egress Path Selection for Multihomed Networks

2016 International Conference on Grey Systems and Uncertainty Analysis

An Adaptive Middleware for Supporting Time-Critical Event Response

Market-based Dynamic Task Allocation in Mobile Surveillance Systems

Self-Managed Systems: an Architectural Challenge

Presentation transcript:

Flexible Provisioning of Service Workflows Sebastian Stein Supervisors: Nick Jennings Terry Payne KEG Seminar, Aston University 4 th March 2008

Flexible Provisioning of Service Workflows Agenda  Background & Motivation  Flexible Service Provisioning  On-Demand Invocation  Advance Agreements  Conclusions 2

Background & Motivation

Flexible Provisioning of Service Workflows 4 Background  Computer systems are increasingly distributed:  E-commerce Source: National Statistics Website,

Flexible Provisioning of Service Workflows 5 Background  Computer systems are increasingly distributed:  E-commerce  High performance computing

Flexible Provisioning of Service Workflows 6 Service-Oriented Computing  Distributed agents offer their capabilities as computer services, which are high-level behaviours that can be procured by consumers in order to achieve their objectives. These include:  Traditional business services (e.g., ordering components, making logistic arrangements, booking a flight ticket),  Computational services (e.g., data analysis, transformation and communication),  Information services (e.g., yellow pages, weather forecast, financial data).

Flexible Provisioning of Service Workflows 7 Workflows  Services are rarely used in isolation.  Usually, they form the building blocks for more complex applications.  Definition: A workflow is a collection of tasks and their dependencies.

Flexible Provisioning of Service Workflows 8 Taverna Workflow (myGrid) Source: Exploring Williams-Beuren Syndrome Using myGrid, Hannah Tipney,

Flexible Provisioning of Service Workflows 9 Pegasus Workflow Source: Pegasus Teragrid Talk SC2005 Seattle Washington,

Flexible Provisioning of Service Workflows Service Provisioning 10  Services are dynamically provisioned (selected) by consumers at run-time.  Services are provided by autonomous agents.  These may be unreliable (may fail or take longer than expected)…  …and heterogeneous. $ h -$20 -$10 -$5-$25 Failure! Value Deadline

Flexible Provisioning of Service Workflows 11 Problem Statement  How to design a service consuming agent able to deal effectively and efficiently with unreliable and heterogeneous service providers when executing complex workflows.

Flexible Provisioning of Service Workflows Related Work  Many current approaches concentrate on functional aspects of services and assume their behaviour to be deterministic.  Some work explicitly considers service failures:  Exception handling (e.g., fault handlers in WS-BPEL),  Fixed redundancy (e.g., replicated Web services),  Retry and timeout policies (Zeng 2005, Erradi 2006),  Non-functional service constraints (McIlraith and Son 2002).  These require significant manual input! 12

Flexible Provisioning of Service Workflows Related Work (Quality-of-Service Optimisation) 13  Local task QoS optimisation (Zeng 2004):  For each task, provision the provider that optimises some property for that task (e.g., cost, reliability, duration).  Global workflow QoS optimisation (Zeng 2003, Yu/Lin 2005):  Provision one provider for each task, so that a weighted sum of global performance characteristics is optimised:  Adaptive variants re-provision upon failure (Canfora 2005).  But: These do not reason explicitly about failures, rely on manually specified weights and constraints, and select single provider for each task.

Flexible Provisioning On-Demand Invocation

Flexible Provisioning of Service Workflows Central Idea  How to address uncertainty during provisioning? 15 Existing work mostly relies on single service for each workflow task. We can do better by exploiting parallel and serial redundancy. … and by taking into consideration service heterogeneity.

Flexible Provisioning of Service Workflows Service Model 16  We devised an abstract model to describe a service- oriented system.  Assumptions:  Assume silent “crash” failures.  Providers paid on invocation.  Failures and durations are independent.  Free disposal of redundant services (but cost still incurred!)  Utility function: Cost: c(s 1 ) = £100 Failure Prob.: f(s 1 ) = 0.01 Duration:

Flexible Provisioning of Service Workflows 17 Flexible Strategy  We want to find a provisioning allocation for each task, e.g.:  This is an optimisation problem: Expected reward Expected cost

Flexible Provisioning of Service Workflows Why is this difficult? 18  Intuitively,  Combinatorial problem:  Difficult objective function (probabilistic durations).  Based on this, we can show that provisioning is inherently hard...

Flexible Provisioning of Service Workflows Provisioning Provisioning Problem 19 Knapsack (NP-complete) PERT CDF (#P-complete)  Provisioning is NP-hard  Provisioning is #P-hard  Big problem as we wanted efficient methods for realistic workflows!

Flexible Provisioning of Service Workflows 20 Flexible Strategy  Approximate the expected utility of an allocation using a heuristic utility function:  Optimise this with local search. Estimated utility Success probability Estimated workflow duration pdf Estimated cost Reward function

Flexible Provisioning of Service Workflows Local Task Calculations  We start by calculating a number of performance parameters for each task in the workflow: 21 Success Probability: 95.00% Expected Cost: £30.00 Expected Duration: min Variance: min 2 Success Probability: 96.83% Expected Cost: £7.23 Expected Duration: min Variance: min 2 Cost:£1£30 Success:25%95% Duration:Exp (80) Gamma (10,6) 2 Service populations: Success Probability: 99.99% Expected Cost: £26.15 Expected Duration: min Variance: min 2

Flexible Provisioning of Service Workflows Global Workflow Calculations  These task parameters are then combined to estimate the overall expected profit: 22 Global Parameters: Success Probability: 68% Estimated Cost: £98.40 Estimated Duration: 132 min Variance: 912 min 2 100% 95% 80% 99% 100% 90% £24 £10 £3£42 £5 £ =

Flexible Provisioning of Service Workflows Empirical Evaluation  To test the strategy, we compare it to a number of benchmarks:  Naïve: Provisions a single provider for each task.  Models current approaches that do not consider service unreliability.  Global QoS: Optimises weighted QoS measures over entire workflow (set all w i =1/3, use maximum utility and zero reward time as budget/time constraints).  Adaptive Global QoS: As above, but also uses timeouts and re- provisions dynamically.  Local QoS: Optimises weighted QoS measure for each task. 23

Flexible Provisioning of Service Workflows Empirical Evaluation 24

Flexible Provisioning of Service Workflows Empirical Evaluation 25

Flexible Provisioning of Service Workflows Empirical Evaluation 26

Flexible Provisioning of Service Workflows Empirical Evaluation 27

Flexible Provisioning of Service Workflows Empirical Evaluation 28

Flexible Provisioning of Service Workflows Further Results  We can compare our performance to an optimal strategy for very small workflows (3 tasks!).  Achieves around 98% of optimal utility.  Results indicate that our strategy is robust to inaccurate information (with errors up to 10-15%). Beyond that, generally degrades gracefully, but problems when expected utility very low.  Trends hold on larger workflows (tested up to 1000 tasks). 29

Flexible Provisioning of Service Workflows So far…  We have proposed a flexible provisioning strategy that deals with uncertain service providers:  By provisioning multiple providers redundantly for critical tasks.  By re-provisioning services that seem to have failed.  By exploiting the heterogeneity of providers.  Our strategy outperforms the state of the art in flexible service provisioning.  But so far, our strategy:  Assumes that service populations are static throughout execution.  Assumes that services are always invoked on demand.  Does not adapt to new information during execution. 30

Flexible Provisioning Advance Agreements

Flexible Provisioning of Service Workflows Advance Provisioning  Increasingly, services will be offered in the context of pre- negotiated agreements (this is already emerging in computational Grids).  The agreements form a contract about when and how a service will be provided in the future. 32 I need service X in 2 hours. Reservation Cost:£20 Invocation Cost:£10 Start time: 2:00 Completion time: 2:30

Flexible Provisioning of Service Workflows Advance Provisioning  Performance characteristics might vary depending on time of provisioning (e.g., airline pricing policies): 33 Contract TermOn-Demand1h Advance12h Advance Cost£10£5£15 Duration45min20min10min Failure Probability10%2%0.1%

Flexible Provisioning of Service Workflows Modified System Model 34 Providers  Model a dynamic market:  Each time step:  Providers post offers, according to some stochastic process.  Consumer provisions offers.  Offers disappear (acquired by other consumers or withdrawn). Consumer Service Type: T 1 Start Time: 200 End Time: 220 Reservation Cost:£1 Execution Cost: £5 Penalty: £20 Failure Probability: 10% Defection Probability: 50%

Flexible Provisioning of Service Workflows Challenges  Future availability of offers uncertain.  Fixing advance agreements may mean that reservations costs are lost if preceding services fail.  Need to balance benefits of advance provisioning with risk! 35

Flexible Provisioning of Service Workflows Our Approach  Gradual Provisioning:  First make high-level provisioning decisions (how and when to provision tasks).  Follow these at run-time.  Adapt strategies when failures occur. 36 High-level decision Provisioned Completed Failure

Flexible Provisioning of Service Workflows High-Level Decisions  Assume we have a set of atomic provisioning strategies for each service type:  Performance statistics of strategies are learnt offline by observing the market. 37 Service Types Strategies … … Strategy w: Advance time Number of offers Selection strategy Expected performance: Reservation cost Execution cost Failure probability Duration (if successful) Duration (if failed) Variances of above

Flexible Provisioning of Service Workflows Contingency Planning  Atomic strategies represent single attempt at completing a task.  We can build simple plans from several such strategies to deal with failures: 38 Expected task performance: Success probability Reservation cost Execution cost Duration Variance

Flexible Provisioning of Service Workflows Overlapping Provisioning  Finally, associate a late probability p l with each task plan.  This indicates when services should be provisioned.  Higher p l results in less delays when provisioning in advance, but also increases probability that provisioned offers are lost when preceding tasks overrun.  Use heuristic based on critical path to estimate delays and to determine during which task to provision. 39 t x-2 t x-1 txtx txtx p l = 0.0 provision after t x-1 t x-2 t x-1 txtx txtx p l = 0.05 provision during t x-1 t x-2 t x-1 txtx txtx p l = 0.1 provision after t x-2

Flexible Provisioning of Service Workflows Strategy Summary  Given a high-level plan and late probability for each task, estimate utility in a similar manner as for on demand invocation, but include delays and reservation costs.  Optimise this using simulated annealing.  At run-time, follow task strategies, then incorporate information about provisioned offers and adapt strategy accordingly. 40

Flexible Provisioning of Service Workflows Empirical Evaluation  Small 8-task workflow with 5 service types.  Offer characteristics drawn from uniform distributions.  Comparison with three benchmark strategies:  Global QoS  Adaptive Global QoS  Local QoS  Also assume services always provide refunds for failures. 41

Flexible Provisioning of Service Workflows Empirical Evaluation 42

Flexible Provisioning of Service Workflows Empirical Evaluation 43

Flexible Provisioning of Service Workflows Empirical Evaluation 44

Flexible Provisioning of Service Workflows Empirical Evaluation 45

Flexible Provisioning of Service Workflows Conclusions  We proposed a novel algorithm that uses redundancy and dynamic re-provisioning to deal with uncertain service providers.  It does this in a flexible way by reasoning about service behaviours in the context of a decision-theoretic framework.  We first showed how it applies to scenarios where services are invoked on-demand, then extended it to environments with advance agreements.  In most scenarios considered, our strategy outperforms the state of the art in service provisioning. 46

Flexible Provisioning of Service Workflows Future Work  Improved prediction of workflow durations.  More expressive workflow models with branches and loops.  Consider more dynamic environments.  Incorporate meta-reasoning about time spent on optimisation. 47

Flexible Provisioning of Service Workflows Bibliography  Presented work from:  Stein, Jennings, Payne (2007). Provisioning Heterogeneous and Unreliable Providers for Service Workflows. In: AAAI-07. pp  Stein, Jennings, Payne (2008). Flexible Service Provisioning with Advance Agreements. In: AAMAS-08. (in press).  Related work on homogeneous providers:  Stein, Payne, Jennings (2008). Flexible Provisioning of Web Service Workflows. In: ACM Toit 8(4). (in press).  Other related work on QoS-based optimisation:  Zeng et al (2003). Quality driven web services composition. In: WWW-03. pp  Zeng et al (2004). QoS-Aware Middleware for Web Services Composition. In IEEE Soft. Eng. pp  Yu and Lin (2005). Service Selection Algorithms for Composing Complex Services with Multiple QoS Constraints. In: ICSOC-05.  Canfora et al (2005). QoS-Aware Replanning of Composite Web Services. In: ICWS-05. pp

Thank you! Any Questions? This work was sponsored by::