Aligning Event Logs and Process Models for Multi- perspective Conformance Checking: An Approach Based on ILP Massimiliano de Leoni Wil M. P. van der Aalst.

Slides:



Advertisements
Similar presentations
Jorge Muñoz-Gama Josep Carmona
Advertisements

Deductive and inductive methods for program synthesis Jelena Sanko, Jaan Penjam Institute of Cybernetics October 29, 2005.
Han-na Yang Trace Clustering in Process Mining M. Song, C.W. Gunther, and W.M.P. van der Aalst.
Michael Alves, Patrick Dugan, Robert Daniels, Carlos Vicuna
A Case-based Approach to Business Process Monitoring S. Montani 1, G. Leonardi 1 1 Dipartimento di Informatica, University of Piemonte Orientale, Alessandria,
CONFORMANCE CHECKING IN THE LARGE: PARTITIONING AND TOPOLOGY Jorge Munoz-Gama, Josep Carmona and Wil M.P. van der Aalst.
Decision Tree Approach in Data Mining
Aligning Event Logs And Declare Models for Conformance Checking Massimiliano de Leoni, Fabrizio Maggi Wil van der Aalst.
Data Conformance Checking using Optimal Alignments Felix Mannhardt, Massimiliano de Leoni, Hajo A. Reijers.
Models vs. Reality dr.ir. B.F. van Dongen Assistant Professor Eindhoven University of Technology
Beyond Process Mining: Discovering Business Rules From Event Logs Marlon Dumas University of Tartu, Estonia With contributions from Luciano García-Bañuelos,
Mining Declarative Models using Intervals Jan Martijn van der Werf Ronny Mans Wil van der Aalst.
Block-Structured Process Discovery: Filtering Infrequent Behaviour Sander Leemans Dirk Fahland Wil van der Aalst Eindhoven University of Technology.
A Survey of Process Mining in ProM By Jantima Polpinij Decision Systems Lab (DSL) Seminar School of Computer Science and Software Engineering Faculty of.
Boudewijn van Dongen /t Multi-phase process mining Building instance graphs.
Process Mining in CSCW Systems All truths are easy to understand once they are discovered; the point is to discover them. Galileo Galilei ( )
CES 514 – Data Mining Lecture 8 classification (contd…)
Business Alignment Using Process Mining as a Tool for Delta Analysis Prof.dr.ir. Wil van der Aalst Eindhoven University of Technology Department of Information.
/faculteit technologie management Process Mining and Security: Detecting Anomalous Process Executions and Checking Process Conformance Wil van der Aalst.
Boudewijn van Dongen April 27, 2005 The ProM-framework A framework for integrating process mining tools.
Process Mining: Discovering processes from event logs All truths are easy to understand once they are discovered; the point is to discover them. Galileo.
Process Mining: An iterative algorithm using the Theory of Regions Kristian Bisgaard Lassen Boudewijn van Dongen Wil van.
/faculteit technologie management 1 Process Mining: Extension Mining Algorithms Ana Karla Alves de Medeiros Ana Karla Alves de Medeiros Eindhoven University.
1 Software Testing and Quality Assurance Lecture 5 - Software Testing Techniques.
A university for the world real R © 2009, Chapter 17 Process Mining and Simulation Moe Wynn Anne Rozinat Wil van der Aalst Arthur.
A university for the world real R © 2009, Chapter 23 Epilogue Wil van der Aalst Michael Adams Arthur ter Hofstede Nick Russell.
Insuring Sensitive Processes through Process Mining Jorge Munoz-Gama Isao Echizen Jorge Munoz-Gama and Isao Echizen.
Scientific Workflows Within the Process Mining Domain Martina Caccavale 17 April 2014.
Jorge Munoz-Gama Advisor: Josep Carmona December 2014 CONFORMANCE CHECKING AND DIAGNOSIS IN PROCESS MINING.
HIERARCHICAL CONFORMANCE CHECKING OF PROCESS MODELS BASED ON EVENT LOGS Jorge Munoz-Gama, Josep Carmona and Wil M.P. van der Aalst.
Jorge Muñoz-Gama Universitat Politècnica de Catalunya (Barcelona, Spain) Algorithms for Process Conformance and Process Refinement.
Process Mining Control flow process discovery
EVENT-BASED REAL-TIME DECOMPOSED CONFORMANCE ANALYSIS Seppe vanden Broucke, Jorge Munoz-Gama, Josep Carmona, Bart Baesens, and Jan Vanthienen CoopIS 2014.
Types of IP Models All-integer linear programs Mixed integer linear programs (MILP) Binary integer linear programs, mixed or all integer: some or all of.
New Modeling Techniques for the Global Routing Problem Anthony Vannelli Department of Electrical and Computer Engineering University of Waterloo Waterloo,
Jianmin Wang 1, Shaoxu Song 1, Xiaochen Zhu 1, Xuemin Lin 2 1 Tsinghua University, China 2 University of New South Wales, Australia 1/23 VLDB 2013.
Chapter 1. Formulations 1. Integer Programming  Mixed Integer Optimization Problem (or (Linear) Mixed Integer Program, MIP) min c’x + d’y Ax +
Efficiency of Alignment-based algorithms B.F. van Dongen Laziness! (Gu)estimation! Implementation effort?
Compliance Management Elham Ramezani 1. Enterprises are governed by regulations Compliance Report ✓ ✗ ✗ ✓ ✗ ✗ Laws & Regulations Diagnostic Information.
A State Perspective Mentoring Conference New Orleans, LA 2/28/2005 RCRAInfo Network Exchange.
Decision Mining in Prom A. Rozinat and W.M.P. van der Aalst Joosung, Ko.
Alignment-based Precision Checking A. Adriansyah 1, J. Munoz Gamma 2, J. Carmona 2, B.F. van Dongen 1, W.M.P. van der Aalst 1 Tallinn, 3 September 2012.
Operations Research The OR Process. What is OR? It is a Process It assists Decision Makers It has a set of Tools It is applicable in many Situations.
Decomposing Data-aware Conformance Checking Massimiliano de Leoni, Jorge Munoz-Gama, Josep Carmona, Wil van der Aalst PAGE 0.
"Decomposing Alignment- based Conformance Checking of Data-aware Process Models" Massimiliano de Leoni, Jorge Muñoz-Gama, Josep Carmona, Wil van der Aalst.
LINEAR PROGRAMMING.
Decomposing Replay Problems: A Case Study Eric Verbeek and Wil van der Aalst.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Controlling Break-the- glass Through Alignment A. Adriansyah B.F. van Dongen N. Zannone Washington D.C., 11.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 2 - Chapter 7 Optimization.
Maikel Leemans Wil M.P. van der Aalst. Process Mining in Software Systems 2 System under Study (SUS) Functional perspective Focus: User requests Functional.
Understanding Deviations AIS Nov.2014 Elham Ramezani Dirk Fahland Wil M.P. van der Aalst.
Process Mining – Concepts and Algorithms Review of literature on process mining techniques for event log data.
Discovering Models for State-based Processes M.L. van Eck, N. Sidorova, W.M.P. van der Aalst.
The Automated Discovery of Hybrid Processes Fabrizio M. Maggi University of Tartu Tijs Slaats* IT University of Copenhagen Exformatics Hajo A. Reijers.
Multi-phase Process Mining: Building Instance Graphs
Profiling based unstructured process logs
1 Queensland University of Technology, Brisbane, Australia
Exploring processes and deviations
Efficient implementations of Alignment-based algorithms
David Redlich, Thomas Molka, Wasif Gilani, Awais Rashid, Gordon Blair
A General Framework for Correlating Business Process Characteristics
Wil van der Aalst Eindhoven University of Technology
Decomposed Process Mining: The ILP Case
Chapter 8 Goal Programming.
VLSI CAD Flow: Logic Synthesis, Placement and Routing Lecture 5
Business Alignment Using Process Mining as a Tool for Delta Analysis
Neo4j for Process Mining
5 juli 2019 Process Mining and Security: Detecting Anomalous Process Executions and Checking Process Conformance Wil van der Aalst Ana Karla A. de Medeiros.
Chapter 1. Formulations.
Presentation transcript:

Aligning Event Logs and Process Models for Multi- perspective Conformance Checking: An Approach Based on ILP Massimiliano de Leoni Wil M. P. van der Aalst

Conformance Checking PAGE 1 model conformance /  A B event log event stream DB extract solved for control- flow unsolved for other perspectives

(a; {A = 3000;R = Michael; E = Pete}); (b; {V = OK;E = Sue}); (c; {I = 530;D = OK;E = Sue}); (f; {E = Pete}); Example: A Credit Institute PAGE 2 For such a credit amount, should be interest <450 «Sue» not authorized to perform b: is not Assistant Activity h hasn’t been executed: D cannot be OK (a; {A = 3000;R = Michael; E = Pete}); (b; {V = OK;E = Pete}); (c; {I = 530;D = OK;E = Sue}); (d, {I = 599; D = NOK; E = Sue}); (f; {E = Pete}); (a; {A = 3000;R = Michael; E = Pete}); (b; {V = OK;E = Pete}); (c; {I = 530;D = OK;E = Sue}); (d, {I = 599; D = NOK; E = Sue}); (f; {E = Pete}); (a; {A = 5001;R = Michael; E = Pete}); (b; {V = OK;E = Pete}); (c; {I = 530;D = NOK;E = Sue}); (f; {E = Pete}); (a; {A = 5001;R = Michael; E = Pete}); (b; {V = OK;E = Pete}); (c; {I = 530;D = NOK;E = Sue}); (f; {E = Pete}); Activity d should have occurred, since amount<5000

C C B B F F A A S S x = y x >= 10 and y’ >= y y >= x+z Petri Nets with Data E E y <= x+z X Y Z

A A B B C C S S F F x = y x >= 10 and y’ >= y y >= x+z A trace without problems S {z=1,y=0} – A{x=10} – C{y=11} – E – A{x=10} – B{y=11} - F E E y <= x+z X Y Z

A A B B C C S S F F x =y x >= 10 and y’ >= y y >= x+z A trace with problems S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} E E y <= x+z X Y Z >> F A{x=10} C{y=20}B{y=20}A{x=10}

A A B B C C S S F F x =y x >= 10 and y’ >= y y >= x+z Alignments between Log and Process Traces Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - E E y <= x+z X Y Z Process: S {z=10,y=0} – A{x=10} – C{y=20} – E – A{x=10} – B{y=20} - F Move in both with incorrect write operations Move in process Move in both with correct write operations

Determining where misconformances occur also depends on which are considered «more severe» Many Alignment Exists! Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - A A B B C C S S F F x =y x >= 10 and y’ >= y y >= x+z E E y <= x+z X Y Z Process: S {z=10,y=0} – A{x=10} – C{y=20} – E – A{x=10} – B{y=20} - F Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z=1, y=0} – A{x=10} – C{y=11} – E – A{x=3} – B{y=13} - F Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z=1, y=0} – A{x=1} – B{y=11} – E – A{x=3} – B{y=13} - F The problem of checking conformance can be formulated as finding an «optimal» alignment

Cost of alignments Each move is associated with a cost Cost of alignment is the sum of the costs of its moves PAGE 8 A A B B C C S S F F E E X Y Z : Cost of “move on model” : Cost of “move on log” : Cost of writing a wrong value : Cost of non- writing a variable

Cost of alignments: some examples PAGE 9 Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z=10,y=0} – A{x=10} – C{y=20} – E – A{x=10} – B{y=20} - F Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z=1, y=0} – A{x=10} – C{y=11} – E – A{x=3} – B{y=13} - F Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z=1, y=0} – A{x=1} – B{y=11} – E – A{x=3} – B{y=13} - F A A B B C C S S F F E E X Y Z An optimal alignment: an alignment with the lowest cost

Process: S – A – C – E – A – B - F Process: S {z=1, y=0} – A{x=10} – C{y=11} – E – A{x=3} – B{y=13} - F Finding an optimal alignment 1.Computing the control-flow alignment using existing techniques [1] PAGE 10 Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - 2.Enriching the alignment with the data operations. The alignment is enriched, thus minimizing the cost of the alignment Naturally formulated as an Mixed Integer Linear Program [1] A. Adriansyah, A., B. F. van Dongen, B.F., W.M.P. van der Aalst,W.M.P. : Conformance Checking Using Cost-Based Fitness Analysis. IEEE International Enterprise Distributed Object Computing Conference (EDOC 2011)

Construction of the ILP problem : converting guards to ILP constraints Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S – A – C – E – A – B - F Variables: v i = the i-th write operation for variable v Process: S {z= z 1,y=y 1 } – A{x=x 1 } – C{y= y 2 } – E – A{x= x 2 } – B{y= y 3 } - F A A B B C C S S F F E E X Y Z x =y x >= 10 and y’ >= y y <= x+z... y >= x+z

Construction of the ILP problem : the objective function Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z= z 1,y=y 1 } – A{x=x 1 } – C{y= y 2 } – E – A{x= x 2 } – B{y= y 3 } - F A A B B C C S S F F E E X Y Z x =y x >= 10 and y’ >= y y <= x+z y >= x+z

Construction of the ILP problem : converting guards to constraints Optimal Solution: Log: S {z=10,y=0} – A{x=1} – C{y=11} – E – A{x=3} – B{y=13} - Process: S {z= z 1,y=y 1 } – A{x=x 1 } – C{y= y 2 } – E – A{x= x 2 } – B{y= y 3 } - F Alignment cost= Value Objective Function at minimum + Cost of Log/Process Move=2+2

Additional features Support for the OR operator in the transition guards Support for any time of primitive types (through a bidirectional conversion to numerical value): Boolean Timestamp String PAGE 14

Implementation and Experiments Implemented as ProM plugins Tested with a real-life event log The event log recorded the execution of process cases with a Dutch insurance institute The process model was generated using the new ProM Decision Miner [2] PAGE 15 [2] Massimiliano de Leoni, Wil M. P. van der Aalst: Data-aware process mining: discovering decisions in processes using alignments. 28th Annual ACM Symposium on Applied Computing (SAC '13)

Visualization of the optimal alignments in ProM Each row is the optimal alignment of a different log trace Alignments are visualized a sequence of triangles Each triangle corresponds to a move The type of move is identified by the colour −Green: move in both with correct write operations −White: move in both with incorrect write operations −Purple: move in model; Yellow: move in log PAGE 16

A Helicopter view on the optimal alignments : projection onto models When there are so many alignments (12319), difficult to gain a general insight into the most common problems We also provide a visualization where alignments are projected onto the model Darker colour for activities/variables more involved into deviations PAGE 17

A Helicopter view on the optimal alignments : root causes of deviations Transitions are associated with decision trees: Features: Process Variables + #execution of transitions in the prefix Classification Attribute: The move types (in log, model, etc.) PAGE 18

Execution-Time Analysis PAGE 19