Service Reliability Engineering The Chinese University of Hong Kong

Slides:



Advertisements
Similar presentations
Branch prediction Titov Alexander MDSP November, 2009.
Advertisements

Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Henry C. H. Chen and Patrick P. C. Lee
Exploring Latent Features for Memory- Based QoS Prediction in Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
7-1 INTRODUCTION: SoA Introduced SoA in Chapter 6 Service-oriented architecture (SoA) - perspective that focuses on the development, use, and reuse of.
A Server-less Architecture for Building Scalable, Reliable, and Cost-Effective Video-on-demand Systems Jack Lee Yiu-bun, Raymond Leung Wai Tak Department.
Learning to Recommend Hao Ma Supervisors: Prof. Irwin King and Prof. Michael R. Lyu Dept. of Computer Science & Engineering The Chinese University of Hong.
Making Services Fault Tolerant
1 Building Reliable Web Services: Methodology, Composition, Modeling and Experiment Pat. P. W. Chan Department of Computer Science and Engineering The.
Task Scheduling and Distribution System Saeed Mahameed, Hani Ayoub Electrical Engineering Department, Technion – Israel Institute of Technology
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
Improving Robustness in Distributed Systems Jeremy Russell Software Engineering Honours Project.
Ensuring Non-Functional Properties. What Is an NFP?  A software system’s non-functional property (NFP) is a constraint on the manner in which the system.
1 Making Services Fault Tolerant Pat Chan, Michael R. Lyu Department of Computer Science and Engineering The Chinese University of Hong Kong Miroslaw Malek.
Zibin Zheng Jieming Zhu *Rung-Tsong Michael Lyu Service-generated Big Data and Big Data-as-a- Service: An Overview IEEE 2nd International.
H-1 Network Management Network management is the process of controlling a complex data network to maximize its efficiency and productivity The overall.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
Load Balancing Dan Priece. What is Load Balancing? Distributed computing with multiple resources Need some way to distribute workload Discreet from the.
Modeling and Exploiting QoS Prediction in Cloud and Service Computing
Failure Spread in Redundant UMTS Core Network n Author: Tuomas Erke, Helsinki University of Technology n Supervisor: Timo Korhonen, Professor of Telecommunication.
FMEA-technique of Web Services Analysis and Dependability Ensuring Anatoliy Gorbenko Vyacheslav Kharchenko Olga Tarasyuk National Aerospace University.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Quality Attributes of Web Software Applications – Jeff Offutt By Julia Erdman SE 510 October 8, 2003.
Reliability Andy Jensen Sandy Cabadas.  Understanding Reliability and its issues can help one solve them in relatable areas of computing Thesis.
Ketan Patel, Igor Markov, John Hayes {knpatel, imarkov, University of Michigan Abstract Circuit reliability is an increasingly important.
OASIS WSQM TC Meeting Dugki Min. 컴퓨터공학부 건국대학교 Agenda 1. Roll Call 2. Review and approval of the agenda 3. Review and approval of the previous.
Cluster Reliability Project ISIS Vanderbilt University.
Wancai Zhang, Hailong Sun, Xudong Liu, Xiaohui Guo.
Distributed QoS Evaluation for Real- World Web Services Zibin Zheng, Yilei Zhang, and Michael R. Lyu July 07, 2010 Department of Computer.
BFTCloud: A Byzantine Fault Tolerance Framework for Voluntary-Resource Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
--He Xiangnan PhD student Importance Estimation of User-generated Data.
Scalable Computing on Open Distributed Systems Jon Weissman University of Minnesota National E-Science Center CLADE 2008.
Building Reliable SOA from the Unreliable Web Services Ben, Zibin ZHENG Department of Computer Science & Engineering The Chinese University of Hong Kong.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
14.1/21 Part 5: protection and security Protection mechanisms control access to a system by limiting the types of file access permitted to users. In addition,
An OBSM method for Real Time Embedded Systems Veronica Eyo Sharvari Joshi.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu Yu Kang, Yangfan Zhou, Zibin Zheng, and Michael R. Lyu {ykang,yfzhou,
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
WS-DREAM: A Distributed Reliability Assessment Mechanism for Web Services Zibin Zheng, Michael R. Lyu Department of Computer Science & Engineering The.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
ICDCS 2014 Madrid, Spain 30 June-3 July 2014
Qusay H. Mahmoud CIS* CIS* Service-Oriented Computing Qusay H. Mahmoud, Ph.D.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
A Clustering-based QoS Prediction Approach for Web Service Recommendation Shenzhen, China April 12, 2012 Jieming Zhu, Yu Kang, Zibin Zheng and Michael.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
1 Developing Aerospace Applications with a Reliable Web Services Paradigm Pat. P. W. Chan and Michael R. Lyu Department of Computer Science and Engineering.
Investigating QoS of Web Services by Distributed Evaluation Zibin Zheng Feb. 8, 2010 Department of Computer Science & Engineering.
“The Role of Experience in Software Testing Practice” A Review of the Article by Armin Beer and Rudolf Ramler By Jason Gero COMP 587 Prof. Lingard Spring.
Reputation-aware QoS Value Prediction of Web Services Weiwei Qiu, Zhejiang University Zibin Zheng, The Chinese University of HongKong Xinyu Wang, Zhejiang.
Item-Based Collaborative Filtering Recommendation Algorithms
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Experience Report: System Log Analysis for Anomaly Detection
A Collaborative Quality Ranking Framework for Cloud Components
WSRec: A Collaborative Filtering Based Web Service Recommender System
CARP: Context-Aware Reliability Prediction of Black-Box Web Services
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Cross-library API Recommendation Using Web Search Engines
Pinjia He, Jieming Zhu, Jianlong Xu, and
RECOMMENDER SYSTEMS WITH SOCIAL REGULARIZATION
Web Service and Fault Tolerance Stratregy Evaluation and Selection
Exploring Latent Features for Memory-Based QoS Prediction in Cloud Computing Yilei Zhang 17/05/2011.
Huifeng Sun 1, Zibin Zheng 2, Junliang Chen 1, Michael R. Lyu 2
Topic: Semantic Text Mining
Presentation transcript:

Service Reliability Engineering The Chinese University of Hong Kong Zibin Zheng The Chinese University of Hong Kong 1

Modern Web Applications Web applications permeate modern software systems! Composed by distributed Web services Service reliability engineering posts new research challenges

Building Reliable Service-Oriented Systems It is difficult to build reliable service-oriented systems Reliability of the system is highly dependent on the invoked Web services Web services are provided by other organizations The Internet environment is unpredictable Service reliability engineering becomes a major challenge Approaches for building reliable service-oriented systems Fault avoidance Fault removal Fault tolerance Fault prediction

Reliability Prediction of Web Services Target: determine the optimal Web service from a set of functionally equivalent candidates. Method 1: evaluate all the candidates Weak points of Method 1: Expensive: Requiring a lot of Web service invocations Time-consuming: A large number of candidates to evaluate Inaccurate: Users are not experts on WS evaluation

Reliability Prediction of Web Services Method 2: Predict reliability/QoS of Web services The prediction should be personalized for a specify user A user may invoked some or none of the service candidates Advantages: Low cost: no additional WS invocations for evaluation purpose Efficient: no need to wait for the evaluation results Research problem: How to make personalized Web service reliability/QoS prediction?

Approach 1: Neighborhood-Based [ICSE’10, ACM SIGSOFT Distinguished Paper Award] Reliability is extended to Quality-of-Service (QoS) Key idea: Using past usage experiences of similar users. Issue: How to calculate user similarity?

Approach 1: Neighborhood-Based Similarity Computation User-item matrix: M×N, each entry is the failure probability of a Web service             0.5 ? Pearson Correlation Coefficient (PCC)

Approach 1: Neighborhood-Based Drawbacks of Neighborhood-based Approach Computational complexity Matrix sparsity problem Not easy to find similar users (or similar items)

Approach 2: Model-Based Approach 2: Model-based Approach [IEEE TSC’13a] A small number of factors influencing the QoS performance A user’s Web service QoS values correspond to a linear combination of the factors Each row of UT is a set of feature factors, and each column of V is a set of linear predictors  Matrix Factorization (MF) The error between the actual Value and the prediction s1 s2 s3 s4 s5 s6 Regularization terms UT V

Fault-Tolerant Web Services Nature of service-oriented systems by Web services Web services are hosted by other organizations May contain faults May become unavailable suddenly Source codes of the Web services are unavailable The Internet environment is unpredictable Resources are abundant How to employ the redundant Web services and their QoS values for building fault-tolerant service-oriented systems?

Adaptive Fault Tolerance [ESE’10] Internet environment is highly dynamic Network condition changes frequently and abruptly Continuous software/hardware updates of the Web services Server workload changes without notice Traditional fault tolerance strategies are too static Fixed at design time Cannot adapt to the dynamic environment

Adaptive Fault Tolerance Idea: determine optimal fault tolerance strategy dynamically at runtime based on the Web service QoS values.

Fault Tolerance for Significant Components [IEEE TSC’12] Ranking Build a component invocation graph Identify the most significant components Select optimal fault tolerance strategies for the significant components. Idea: A component that is invoked frequently by other important components is considered as a significant component (PageRank).

Fault-Tolerant Web Services Framework Fault-Tolerant Framework [IEEE TC’13] Target: Optimal fault tolerance strategy selection for each task under local and global constraints Local constraint: Response time of t1 < 1000 ms Global constraint: Success-rate of the whole service plan > 99% Candidates: Basic fault tolerance strategies and their combinations

Dataset Publication Dataset Publication [ICWS’10, Best Student Paper Award] The evaluation results are released at: http://www.wsdream.net Downloaded about 2000 times by more than 150 universities (or research institutes) from more than 30 counties. The datasets can be used in research topics of: Web service selection and composition Web service recommendation Web service QoS prediction Fault-tolerant Web services ………………… The largest-scale real-world Web service reliability evaluation Related papers have been cited more than 1000 times