SCMG’12 Igor Trubin, PhD IBM, Sr. IT Architect SEDS-Lite: Using Open Source Tools (R, BIRT.

Slides:



Advertisements
Similar presentations
© 2013 IBM Corporation Reducing Cost with R in IBM Storage Products Manufacturing Elaine Jones Integrated Supply Chain Engineering.
Advertisements

Copyright © SoftTree Technologies, Inc. DB Tuning Expert.
Chapter 9. Performance Management Enterprise wide endeavor Research and ascertain all performance problems – not just DBMS Five factors influence DB performance.
FAST Radar System Engineering Overview. FAST Radar Overview –What’s Required? IIS 6.0  With Microsoft.NET Framework 1.1 and SMTP for MS SQL Server.
SCMG’11 Igor Trubin, PhD SEDS-Lite: Using Open Source Tools (R, BIRT and MySQL) to Report and Analyze Performance Data.
Did Something Change? Using Statistical Techniques to Interpret Service and Resource Metrics Frank Bereznay.
Technical BI Project Lifecycle
To Hear Narration Click “Slide Show” and then Click “From Beginning” “Sample Output”
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part IV Capacity Planning Methodology.
OLAP Cubes and Pivot Tables Leveraging the Power of a Microsoft EPM Solution EPM Customization Series Part 1 February 21 st, 2007 Brendan Giles, PMP, MCP.
1 Part IV Capacity Planning Methodology © 1998 Menascé & Almeida. All Rights Reserved.
Cognos 8.4 Upgrade Business Intelligence. Why Cognos 8.4 Increased Performance on Database due to optimized SQL and more filters passed in native SQL.
CS2032 DATA WAREHOUSING AND DATA MINING
Jet Enterprise.
How Business Intelligence Software Works and a Brief Overview of Leading Products Jai Windsor MIS 5973 December 8, 2005.
Automatic Daily Monitoring of Continuous Processes Theory and Practice MP Welch – Merrill Consultants Frank Bereznay - IBM.
Building Ad-Hoc Reports using the SQL Server 2005 Reporting Services (SSRS) Report Builder (SQL307) Adrian Rupp Business Intelligence Solutions Specialist.
Module 8: Monitoring SQL Server for Performance. Overview Why to Monitor SQL Server Performance Monitoring and Tuning Tools for Monitoring SQL Server.
What is Business Intelligence Business Intelligence (BI) encompasses the processes, tools, and technologies required to transform enterprise data into.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Reporting and Build Statistics Using Business Intelligence By Naga Sowjanya Karumuri Build Team, VMware, Cambridge Summer Internship 2008.
IBM Start Now Business Intelligence Solutions. Agenda Overview of BI Who will buy and why Start Now BI solution Benefit to customer.
>> Building a PPT from the ActiveInterface web pages Chris Harrington Active Interface, Inc.
June 2004Page 1 Disk Subsystem Capacity Management, Based on Business Drivers, I/O Performance Metrics and MASF Igor Trubin, Ph.D. and Linwood Merritt.
Introducing Reporting Services for SQL Server 2005.
© 2008 Ocean Data Systems Ltd - Do not reproduce without permission - exakom.com creation Dream Report O CEAN D ATA S YSTEMS O CEAN D ATA S YSTEMS The.
Informix IDS Administration with the New Server Studio 4.0 By Lester Knutsen My experience with the beta of Server Studio and the new Informix database.
Data Presentation & Graphing Introduction to Mechanical Engineering The University of Texas-Pan American College of Science and Engineering.
OLAP Cubes and Pivot Tables Leveraging the Power of a Microsoft EPM Solution EPM Customization Series Part 1 February 21 st, 2007 Brendan Giles, PMP, MCP.
Enterprise Reporting Solution
DEPICT: DiscovEring Patterns and InteraCTions in databases A tool for testing data-intensive systems.
Performance Dash A free tool from Microsoft that provides some quick real time information about the status of your SQL Servers.
1 Data Warehouses BUAD/American University Data Warehouses.
September 2004Page 1 Mainframe Global and Workload Levels Statistical Exception Detection System, Based on MASF Igor Trubin, Ph.D. and Linwood Merritt.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Perfmon and Profiler 101.
What is Sure Stats? Sure Stats is an add-on for SAP that provides Organizations with detailed Statistical Information about how their SAP system is being.
Touchstone Automation’s DART ™ (Data Analysis and Reporting Tool)
CS 157B: Database Management Systems II April 3 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
7 Strategies for Extracting, Transforming, and Loading.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
Correlator GUI Sonja Vrcic Socorro, April 3, 2006.
Drill-Through Features Cognos 8 BI. Objectives  In this module we will examine:  Cognos 8 Drill Through Overview  Model / Package Drill Through  Cross.
Summary Cognos 8 BI. Objectives  In this module we will examine:  major innovations in Cognos 8  review of new functionality in Cognos 8  customer.
Intro to Power BI Azhagappan Arunachalam.  Senior Database Architect   PowerBICentral.com  (blog on getting started.
LabSpeed ™ Data Management software LabSpeed ™ Data Management Software.
TECHVERZE Oracle BI Publisher Online Training. Introduction to Oracle BI Publisher Oracle BI Publisher is the reporting solution to deliver, author, and.
Andy Roberts Data Architect
Business Intelligence Environment Integration with Dynamics NAV Rogers Family Company Matthew McGinley Devraj Ghosh Dominic Miller.
Developing SQL/Server database in Visual Studio Introducing SQL /Server Data Tools Peter Lu.Net Practices Director Principle Architect Nexient March 19.
1 Creating Situational Awareness with Data Trending and Monitoring Zhenping Li, J.P. Douglas, and Ken. Mitchell Arctic Slope Technical Services.
May 2003 Statistical Exception Detection System, Based on MASF Technique Igor Trubin, Ph.D., Kevin McLaughlin Capital One Services, Inc.
MSU Cognos Future Data Services September Cognos Improvements  Architecture  64- bit vs 32- bit  More server power, faster servers  Ghost.
Comprehensive Flexible Global Storage and Search Responsive Available Secure Manageable Federation Coordination Consolidation Transformation Synchronization.
QAD Financial Report Writer
Smarter Technology for Better Business
Capital One Services, Inc.
Build a Dashboard in one Hour
OVirt Data Warehouse 02/11/11 Yaniv Dary BI Software Engineer, Red Hat.
Data Virtualization Tutorial: Introduction to SQL Script
Overview – SOE PatchTT November 2015.
SQL Server Monitoring Overview
Presentation of the eTendersNI service Business Intelligence Module
Exploring Azure Event Grid
Presentation & Demo August 7, 2018 Bill Shelden.
SSIS Data Integration Data Warehouse Acceleration
SSIS Data Integration Data Warehouse Acceleration
Integrated Statistical Production System WITH GSBPM
Presentation transcript:

SCMG’12 Igor Trubin, PhD IBM, Sr. IT Architect SEDS-Lite: Using Open Source Tools (R, BIRT and MySQL) to Report and Analyze Performance Data

SCMG’12 2 A genda oSEDS-Lite project: Open source based implementation of Statistical Exception Detection System - SEDS-Lite Introduction SEDS-Lite Introduction oOpen Source and “free” tools for Capacity Management – review oControl Charts as a good test of using different tools for Capacity management - Power of Control Charts and IT-Chart Concept (Part 1) Power of Control Charts and IT-Chart Concept (Part 1) oR-system scriping to built Control Charts ( System Management by Exception: Power of Control Charts Mar 25, 2009) System Management by Exception: Power of Control Charts oHow To Build IT-Control Chart - Use the Excel Pivot Table!How To Build IT-Control Chart - Use the Excel Pivot Table! oBIRT based Control Chart BIRT based Control Chart oBIRT Data Cubes Usage for Performance Data Analysis BIRT Data Cubes Usage for Performance Data Analysis oBuilding IT-Control Chart by BIRT against Data from the MySQL Database Building IT-Control Chart by BIRT against Data from the MySQL Database oR-Script to Aggregate (ETL to MySQL) Actual data with Base-line data for IT-Control ChartsR-Script to Aggregate (ETL to MySQL) Actual data with Base-line data for IT-Control Charts oEV-Control Chart (to build using BIRT)EV-Control Chart oSimplified Exception Detection using BIRT: UCL=LCL : How many standard deviations do we use for Control Charting? Use ZERO! UCL=LCL : How many standard deviations do we use for Control Charting? Use ZERO! oNon-open source tool usage example - COGNOSNon-open source tool usage example - COGNOS oBenefits and Values of implementing SEDS

SCMG’12 SEDS-Lite Open source based implementation of Statistical Exception Detection System - SEDS-Lite IntroductionSEDS-Lite Introduction

SCMG’12 4 Open Source and “free” tools for Capacity Management Previous reviews: oMeasureIT - Issue Open Source Capacity & Performance Management Tools for Windows & Unix Systems ( Brian Johnson )MeasureIT - Issue Open Source Capacity & Performance oCapacity Planning and Performance Monitoring with Free Tools (for mspcmg 2011! - Adrian Cockcroft)Capacity Planning and Performance Monitoring with Free Tools Adrian Cockcroft The missed point is BI and Generic DB open source tools usage for Performance Analysis and Reporting oBIRT (Business Intelligence and Reporting Tools)BIRT oMySQLMySQL oNoSQLNoSQL oPostgreSQLPostgreSQL

SCMG’12 5 Control Charts as a good test of using different tools for Capacity management What is the Control Chart? o Power of Control Charts and IT-Chart Concept (CMG’09 -’10) Power of Control Charts and IT-Chart Concept

SCMG’12 6 R-system to built Control Charts The input data is Unix File Space Utilization: R-script R-script (published on my blog):published on my blog (FYI: qcc: An R package for quality control charting :qcc: An R package for quality control charting project.org/web/packages/qcc/index.html project.org/web/packages/qcc/index.html ) oEXAMPLE3: Monthly Profile R download: CMG’09 workshop “ Power of Control Charts “, SCMG - Mar 25, 2009 Power of Control Charts

SCMG’12 7 R-system to built Control Charts CMG’09 workshop “ Power of Control Charts “, SCMG - Mar 25, 2009 Power of Control Charts

SCMG’12 BIRT based Control Chart What is BIRT – see BIRT Report Designer Tutorial DemoBIRT Report Designer Tutorial Demo There is a tool that generates control charts against performance data using BIRT:a tool The IT-Control Chart would give much more value for analyzing time stamped historical data. Is that possible to build using BIRT?IT-Control Chart 8

SCMG’12 9 The case where the input data source is CSV file: BIRT based IT-Control Chart: Live demo But how to calculate that UCL and LCL numbers!?

SCMG’12 10 Weekhour calculation: How to calculate that UCL and LCL numbers!? Use EXCEL Pivot Table Pivot Table creation (IT-Chart Builder)IT-Chart Builder Pivot Chart built

SCMG’12 11 Data behind IT Control chart is a Cube: How to calculate that UCL and LCL numbers!? BIRT Data Cubes Usage for Performance Data Analysis Input data is raw stamped measures The result (in the form of BIRT report designer preview) is on the picture at left

SCMG’12 12 (1) The Reference set or base-line is based on the input raw data with some filtering and computed columns (weekday and weekhour) How to calculate that UCL and LCL numbers!? BIRT Data Cubes Usage for Performance Data Analysis (2) the Actual data set is the same but having the different filter: (raw[“date”] Greater “ ”)

SCMG’12 13 (3) To combine both data sets for comparing base-line vs. actual, the “Data Set1” is built as a “Joint Data Set” by the following BIRT Query builder: How to calculate that UCL and LCL numbers!? BIRT Data Cubes Usage for Performance Data Analysis (4) Then the Data Cube was built in the BIRT Data Cube Builder with the structure shown on the following screen: (5) The next step is building report starting with Cross Table (which is picked as an object from BIRT Report designer “Pallete”): The picture above shows also what fields are chosen from Cube to Cross table.

SCMG’12 14 (6) The final step is dropping “Chart” object from “Palette” and adding UCL calculation using Expression Builder for additional Value (Y) Series: How to calculate that UCL and LCL numbers!? BIRT Data Cubes Usage for Performance Data Analysis

SCMG’12 15 To see the result one needs just to run the report or to use a "preview' tab on the report designer window: BIRT Data Cubes Usage for Performance Data Analysis

SCMG’12 16 How to calculate that UCL and LCL numbers!? SQL script for MySQL Database (upload) (1) That raw data (the same) needs to be uploaded to some table (CPUutil) in the MySQL schema (ServerMetric) by using the following script (sqlScriptToUploadCSVforSEDS.sql):

SCMG’12 17 (2) Then the output (result) data (ActualVsHistoric table) is built using the following script (sqlScriptToControlChartforSEDS.sql): How to calculate that UCL and LCL numbers!? SQL script for MySQL Database (Calculate)

SCMG’12 How to calculate that UCL and LCL numbers!? R script against MySQL Database 18 The same SQL script transformed to the RODBC based R-Script:SQL script The result of the script run is the same "ActualVsHistorical" table in the servermentrics database on MySQL The result data is identical with the data used for plotting IT- Control Chart. The data itself can be seen by just typing the data frame name in the R-Console window:

SCMG’12 19 (3) In the BIRT, the connection to MySQL database is established (to MySQLti with schema ServerMetrics to table ActualVsHistorical): Building IT-Control Chart by BIRT against Data from the MySQL Database

SCMG’12 20 (4) the chart is developed the same way like that was done in BIRT based Control Chart post (slide 14):BIRT based Control Chart Building IT-Control Chart by BIRT against Data from the MySQL Database

SCMG’12 21 (5) In BIRT you can specify report parameters, that could be then a part of any constants including for filtering (to change a baseline or to provide server or metric names). Building IT-Control Chart by BIRT against Data from the MySQL Database (6) Finally the report should be run to get the following result, which is almost identical with the one built for BIRT based Control Chart post (slide 15)BIRT based Control Chart Lastly: R-system can be used to run the same SQL script against MySQL by using Package 'RMySQL‘!!!! Package 'RMySQL‘

SCMG’12 22 EV meta-metric introduced in 2001 as a measure of anomaly severity. ( The Exception Value Concept to Measure Magnitude of Systems Behavior Anomalies) The Exception Value Concept to Measure Magnitude of Systems Behavior Anomalies EV-Control Chart (to build using BIRT) It is the difference (integral) between actual data and control limits. In CMG’08 paper “Exception Based Modeling and Forecasting” That metric was plotted using Excel to explain how it could be used for a new trend starting point recognition: “Exception Based Modeling and Forecasting

SCMG’12 23 Plotting that meta-metric and/or two their components (EV+ and EV-) over time gives a valuable picture of system behavior. EV-Control Chart (to build using BIRT) The BIRT example of the EV-chart against the same sample data: EV calculation

SCMG’12 24 EV-Control Chart (to build using BIRT) The EXCEL example of the EV-chart against the same sample data:

SCMG’12 25 It is based on real script developed and successfully tested against real data for BIRT tool to get exceptional servers list: BIRT tool SEDS-Lite: Exception Detection (EV-based) using BIRT Below is example of DB2- like SQL script to implement that. Lite version: (UCL=LCL)

SCMG’12 SEDS elements to build by non-open source tools oTivoli Common Reporting portal to run report and get result: COGNOS example: oReport Studio to write reports: 26 o Framework Manager to build meta data model with some preliminary calculation like: WeekDay=(DAYOFWEEK([ODBC].[cpuutil].[date])-1)*24 +([ODBC].[cpuutil].[hour]) oSimple Exception Detector (EV top list)

SCMG’12 Benefits and Values of implementing SEDS and/or SETDS (Statistical Exception and Trend Detection) FeatureBenefit Automate the IT- and EV- Control Charts generation SEDS-liteHelps to reduce system defects and severe incidents; achieves a faster time to value for analysis; reduce manual work Implement Anomaly detection (as a part of SETDS) SEDS-liteReduce system severe incidents related to Performance and Capacity. Provides smart alerting based on system/apps behavior learning and will be used for truly proactive Capacity Management. Implement Automatic trend recognition (Statistical pattern recognition based) SETDSIncreases time for upgrade planning. Reduce manual work to build trend-forecast. A repeatable process allows the trend recognition to be applied to more metrics, increasing the field of view for potential issues and prediction accuracy. Automate Demand forecasts and trend- forecast reporting with Out-Layers free base- lines SETDSIncreases time for upgrade planning. Eliminate manual work to do correlation analysis and increases time for analysis and decision making. VALUE: Provide deeper insights into the use of technology platform resources and more time to react based on the insights, by correlating types of forecast data to resulting, sometimes subtle changes in technology platform utilization and capacity, and displaying them in a consistent, accurate, repeatable manner. 27

SCMG’12 28 References Global and Application Levels Exception Detection System, Based on MASF Technique Global and Application Levels Exception Detection System, Based on MASF Technique Igor Trubin: “ Global and Application Levels Exception Detection System, Based on MASF Technique ”, Proceedings of the Computer Measurement Group, ( Global and Application Levels Exception Detection System, Based on MASF Technique ( Disk Subsystem Capacity Management Based on Business Drivers I/O Performance Metrics and MASF Disk Subsystem Capacity Management Based on Business Drivers I/O Performance Metrics and MASF Linwood Merritt, Igor Trubin: “ Disk Subsystem Capacity Management Based on Business Drivers I/O Performance Metrics and MASF ”, Proceedings of the Computer Measurement Group, ( Disk Subsystem Capacity Management Based on Business Drivers I/O Performance Metrics and MASF ( Mainframe Global and Workload Level Statistical Exception Detection System, Based on MASF Mainframe Global and Workload Level Statistical Exception Detection System, Based on MASF Linwood Merritt, Igor Trubin: : “ Mainframe Global and Workload Level Statistical Exception Detection System, Based on MASF ”, Proceedings of the Computer Measurement Group, ( Mainframe Global and Workload Level Statistical Exception Detection System, Based on MASFhttp:// Capturing Workload Pathology by Statistical Exception Detection System Capturing Workload Pathology by Statistical Exception Detection System Igor Trubin: “ Capturing Workload Pathology by Statistical Exception Detection System ”, Proceedings of the Computer Measurement Group, ( Capturing Workload Pathology by Statistical Exception Detection System ( System Management by Exception, Part 6 System Management by Exception, Part 6 Igor Trubin: “ System Management by Exception, Part 6 ”, Proceedings of the Computer Measurement Group, ( System Management by Exception, Part 6 ( System Management by Exception, Part Final System Management by Exception, Part Final Igor Trubin: “ System Management by Exception, Part Final ”, Proceedings of the Computer Measurement Group, 2007.( ) System Management by Exception, Part Final Exception Based Modeling and Forecasting Exception Based Modeling and Forecasting Igor Trubin: “ Exception Based Modeling and Forecasting ”, Proceedings of the Computer Measurement Group, ( ) Exception Based Modeling and Forecasting IT-Control Chart IT-Control Chart Igor Trubin: “ IT-Control Chart ”, Proceedings of the Computer Measurement Group, ( ) IT-Control Charthttp://itrubin.blogspot.com/2010/07/my-new-cmg10-paper-it-control-charts.html

SCMG’12 29 SCMG’12 Questions? Everything is ready to build SEDS-Lite: Using Open Source Tools (R, BIRT and MySQL) Igor Trubin, PhD BLOG: