ADVANCED ANOMALY DETECTION IN CANARY TESTING

Slides:



Advertisements
Similar presentations
1 VLDB 2006, Seoul Mapping a Moving Landscape by Mining Mountains of Logs Automated Generation of a Dependency Model for HUG’s Clinical System Mirko Steinle,
Advertisements

Unsupervised Learning
Sensor-Based Abnormal Human-Activity Detection Authors: Jie Yin, Qiang Yang, and Jeffrey Junfeng Pan Presenter: Raghu Rangan.
AP Statistics Section 11.2 B. A 95% confidence interval captures the true value of in 95% of all samples. If we are 95% confident that the true lies in.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
SKELETON BASED PERFORMANCE PREDICTION ON SHARED NETWORKS Sukhdeep Sodhi Microsoft Corp Jaspal Subhlok University of Houston.
Anomaly Detection in the WIPER System using A Markov Modulated Poisson Distribution Ping Yan Tim Schoenharl Alec Pawling Greg Madey.
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
On Community Outliers and their Efficient Detection in Information Networks Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei Han 1.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Unsupervised Training and Clustering Alexandros Potamianos Dept of ECE, Tech. Univ. of Crete Fall
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Business Statistics - QBM117 Revising interval estimation.
Leon-Guerrero and Frankfort-Nachmias,
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
EVALUATION David Kauchak CS 451 – Fall Admin Assignment 3 - change constructor to take zero parameters - instead, in the train method, call getFeatureIndices()
Anomaly detection Problem motivation Machine Learning.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Sampling: Theory and Methods
Confidence Intervals for the Mean (σ known) (Large Samples)
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates David Speights Senior Research Statistician HNC Insurance.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
The ANTARES neutrino telescope is located on the bottom of the Mediterranean Sea, 40 km off the French coast. The detector is installed at a depth of 2.5.
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
Agresti/Franklin Statistics, 1 of 88  Section 11.4 What Do We Learn from How the Data Vary Around the Regression Line?
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 5-5 Poisson Probability Distributions.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Inferences about a Mean Vector Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
STATISTICAL METHODS AND DATA MANAGEMENT TOOLS FOR OUTLIER DETECTION IN TRI DATA Dr. Nagaraj K. Neerchal and Justin Newcomer Department of Mathematics and.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Consensus Extraction from Heterogeneous Detectors to Improve Performance over Network Traffic Anomaly Detection Jing Gao 1, Wei Fan 2, Deepak Turaga 2,
Chapter 8. Process and Measurement System Capability Analysis
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Chapter Confidence Intervals 1 of 31 6  2012 Pearson Education, Inc. All rights reserved.
Section 6.1 Confidence Intervals for the Mean (Large Samples) © 2012 Pearson Education, Inc. All rights reserved. 1 of 83.
A Method to Approximate the Bayesian Posterior Distribution in Singular Learning Machines Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Inference: Conclusion with Confidence
Advanced Quantitative Techniques
Copyright © Cengage Learning. All rights reserved.
Data Mining: Concepts and Techniques
QianZhu, Liang Chen and Gagan Agrawal
Machine Learning for Signal Quality Monitoring
Recent Advances in Iterative Parameter Estimation
Lecture 2d1: Quality of Measurements
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Roland Kwitt & Tobias Strohmeier
Intro to Machine Learning
Statistical Inference for the Mean Confidence Interval
Image Captions With Deep Learning Yulia Kogan & Ron Shiff
Why does sampling work?.
- Viterbi training - Baum-Welch algorithm - Maximum Likelihood
Intro to Machine Learning
Emna Krichene 1, Youssef Masmoudi 1, Adel M
Confidence Intervals for the Mean (Large Samples)
Jia-Bin Huang Virginia Tech
Statistical Inference for the Mean: t-test
Using Clustering to Make Prediction Intervals For Neural Networks
Evaluation David Kauchak CS 158 – Fall 2019.
Presentation transcript:

ADVANCED ANOMALY DETECTION IN CANARY TESTING Presented by Tamas Cser – Functionize, Inc. © All rights reserved

Advanced Deployment Methods Advanced deployment methods - Multiple versions in product with split traffic © All rights reserved

Modern Code Deployment Current App Version 95% New App Version Users 5% Users become the QA for your app! © All rights reserved

Canary Testing Approach Release To All Users Current App Version Build User ML Model Measure Behavior Yes Release To User Subset New App Version Measure Behavior Anomaly Detection Good For Release? No Revise App © All rights reserved

User Journeys © All rights reserved

Experimental Method - Clustering Collected 10000 session and 150000 actions Recorded user sessions on sample application Clusters session data by url/element interaction Identified 5 unique clusters in the data Each session is then analyzed to find how far it is from its cluster mean Cluster standard deviation data is transformed to be normally distributed and confidence interval is created. Sessions falling far enough away from their cluster mean to be outside the confidence interval are deemed anomalous --- Clustering of User Sessions Calculate Confidence Intervals for Clusters Algorithms used: Expectation Maximization Iterative method to find maximum likelihood estimate of parameters Akaike Information Criterion Given a collection of models, selects best one relative to the others © All rights reserved

LSTM Model Source: Analytics Vidhya Long Short-Term Memory Type of Recurrent Neural Network Analyzes data sequences/time series Predicts the probability of the next user action Source: Analytics Vidhya © All rights reserved

LSTM model can work together with the EM clustering Hybrid Approach LSTM model can work together with the EM clustering Build LSTM for each cluster of user sessions Since clusters represent similar user behavior, LSTM can more effectively predict the next user action Anomalous session are easier to identify © All rights reserved

Model trained on user data from live sites Process and Results Model trained on user data from live sites 10,000 user sessions were analyzed 80% of data used for training and 20% for testing Predicted the next user action 85% of the time Since there are hundreds of action/element combinations on a page, this is actually quite accurate © All rights reserved

Split Traffic and Canary Modeling © All rights reserved

2% of sessions were identified as outliers Conclusion Successfully segmented sessions into 5 clusters based on similar behaviors 2% of sessions were identified as outliers Anomalous user sessions can be easily identified LSTM model can accurately predict probabilities of actions in a user session. This analysis can be used for gating releases and canary testing Cluster 1: Single event sessions Cluster 2: Login interactions Cluster 3: Account interactions Cluster 4: Blog interactions Cluster 5: Subdomain interactions © All rights reserved