How (and Why) to Build a Data Warehouse 101

Slides:



Advertisements
Similar presentations
Conducting your own Data Life Cycle Audit
Advertisements

Writing Good Use Cases - Instructor Notes
Railcar 101 The Basics of Railcars: Car Types, Components and Rules
Bringing the Benefits of Technology to Your Organisation Dr Simon Davey, Omega Alpha Limited.
Managing Hardware and Software Assets
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Chapter 3 Launching a Business on the Internet. Awad –Electronic Commerce 1/e © 2002 Prentice Hall 2 OBJECTIVES Introduction of E-Business Life Cycle.
Cost Management ACCOUNTING AND CONTROL
Science Subject Leader Training
ASYCUDA Overview … a summary of the objectives of ASYCUDA implementation projects and features of the software for the Customs computer system.
Readers Build Good Habits
Readers Build Good Habits
Accounting and Financial Reporting
Modern Systems Analyst and as a Project Manager
QA & QI And Accreditation.  A continuous process to review, critique, and implement measurable positive change in public health policies, programs or.
1 Implementing Internet Web Sites in Counseling and Career Development James P. Sampson, Jr. Florida State University Copyright 2003 by James P. Sampson,
The SeETL Business Presentation 1/1/2012
Chapter 5 – Enterprise Analysis
Effective Test Planning: Scope, Estimates, and Schedule Presented By: Shaun Bradshaw
© 2003 By Default! A Free sample background from Slide 1 A First Course in Database Management Jeanne Baugh Department of.
Students will be able to
Fifth Edition 1 M a n a g e m e n t I n f o r m a t i o n S y s t e m s M a n a g I n g I n f o r m a t i o n T e c h n o l o g y i n t h e E – B u s i.
© 2010 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. TIBCO Spotfire Application Data Services TIBCO Spotfire European User Conference.
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
Chapter 1 Business Driven Technology
Record Keeping F OR A S MALL B USINESS. RECORD KEEPING 2 Welcome 1. Agenda 2. Ground Rules 3. Introductions.
Global Entrepreneurship and Small Business Management
Node Lessons Learned James Hudson Wisconsin Department of Natural Resources.
Helping Families Promote Children’s Social Emotional Competence Based on materials from Center for Social Emotional Foundation of Early Learning (CSEFEL)
© Paradigm Publishing Inc Chapter 10 Information Systems.
Missouri Enterprise Helping Missouri Manufacturers Make More, Sell More, Earn More Missouri Manufacturer Survey: The Top Ten Things You Told Us.
© Prentice Hall CHAPTER 15 Managing the IS Function.
Mike Fisher Royal Liverpool and Broadgreen University Hospitals NHS Trust 1.
Chapter 15 Living a Balanced Life Chapter 15 Living a Balanced Life Lesson 15.1 Work Isn’t Everything! Lesson 15.1 Work Isn’t Everything!
Key Concepts and Skills
Do not put content on the brand signature area For agent use only. Not for public distribution. ©2014 Voya Services Company. All rights reserved. CN
How to Grow Your Business Larry N. Harrison & Co. CPAs & Consultants Larry N. Harrison & Co. CPAs & Consultants.
The Internet of Riedwaan Bassadien Platform Strategy Manager Microsoft Everything Your things.
Enterprise Business Processes and Reporting (IS 6214) MBS MIMAS 19 th Jan 2011 Fergal Carton Business Information Systems.
1 Meeting the Reporting Challenges at International Paper.
Accelerated Access to BW Al Weedman Idea Integration.
Data Warehouse Toolkit Introduction. Data Warehouse Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
VENDORS, CONSULTANTS AND USERS
SharePoint 2010 Business Intelligence Module 2: Business Intelligence.
Database Systems – Data Warehousing
+ Facts + Figures: Your Introduction to Meetings Data Presented By: Name Title Organization.
GBA IT Project Management Final Project – “ FoodMart Corp - Making use of Business Intelligence” July 12, 2004 N.Khuda.
Getting synergies from rapid access to data
Database Systems – CRM DEFINITIONS CRM - Customer Relationship Management CRM usually refers to a strategic solution that helps businesses identify the.
Business Intelligence Zamaneh Jahed. What is Business Intelligence? Business Intelligence (BI) is a broad category of applications and technologies for.
© 2007 by Prentice Hall 1 Introduction to databases.
BMI Consulting Business Intelligence Roadmap Business Analysis Requirements Subject Modeling.
Dan Grady The search for the killer productivity application is over… Copyright 2009, Information Builders. Slide 1.
LS Retail BI Information/requirements/deployment steps.
Certification for Enterprise Account Services Class Two Presenting the Boundless Story.
Darryl Thompson, DOM360 Results versus Promises: Is Your Vendor a Good Bet.
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
Chapter 7 Enterprise Resource Planning (ERP). Objectives After studying the chapter, students should be able to.. Explain definition of Enterprise Resource.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Rajesh Bhat Director, PLM Analytics Applications
Intended for Knowledge Sharing only Optimizing Product Decisions with Insights THE PRODUCT MANAGEMENT & INNOVATION EVENT 2016 Jan 2016.
Foundations of Information Systems in Business
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Unifying Talent Management. Harnessing the Power of Workforce Intelligence in Talent Planning to Drive Business Performance.
ERP vendor perspective
Business Intelligence & Data Warehousing
of our Partners and Customers
C.U.SHAH COLLEGE OF ENG. & TECH.
Data Warehousing Concepts
Presentation transcript:

How (and Why) to Build a Data Warehouse 101 From Questions to Eight Steps to DW Heaven! @aupward & #meshU

Declaration of Data Independence When in the Course of an increasingly competitive global economy it becomes necessary for one data set to dissolve its connections to a constraining environment, the separate but inherently unequal station to which the Laws of Whose budget is larger prevails…. <snip> We hold these truths to be self-evident, that all data is created equal, that they are endowed by their Creator with metadata that holds important context and critical unalienable rights, that among these are compliance, security and the right to drive business value…

And finally for the database query writers… http://xkcd.com/327/

Who is Antony Upward Funny after lunch? Management Consultant – Business Systems – 20+ years Business Analyst, Project Manager / Program Director, Business Architect MIS @ Apple – GUI to Data Warehouse over WAN (DECNet) in 1990! SAP @ Bell family of companies and elsewhere – including full SAP Data Warehouse Implementation Academic… Teaching / Learning Ryerson University School of Management Business Technology Management Program Business Process Analysis and Design IT Governance and the Role of the CIO Producing the people you need / want to hire Understand Business AND Technology New…Edward James Consulting… Sustainability Business Architect Aligning People, Process and Technology to achieve Sustainable Results… Sustainable for People, Planet and Profits Returning to University… Masters in Environmental Studies with Graduate Diploma in Business and the Environment @ York / Schulich

The Data Warehouse – a key tool Why? Your world is… Complex Dynamic Data… coming at you like a fire hose How do you: Make sense of your world? Make informed decisions to: Lower costs, increase revenues, increase productivity? Make plans… and know if you’re achieving them? The Data Warehouse – a key tool

Customer – Supplier – Industry You’ve Launched… Your Communities Customer – Supplier – Industry Your Market Your Suppliers Your Customers Your Company Your Talent Pool Potential Customers Employees Your Regulator Your Bank Your Investors Your Advisors (Accountant, Lawyer, etc.) ..and You’re Starting to see Your World is Complex…

…which you’ve started to interconnect… …and you have systems… Internet / Cloud Customer Care Email Order Entry Billing Assets Projects Retail Web Site / Portal Finance A/R GL A/P Inventory/ Warehouse Web Site Content Management Purchasing Community Wholesale & Supplier Web Sites Supplier Bank …which you’ve started to interconnect…

…and you have transactions… Customer Sales Orders Financial Journal Entries Payments to Suppliers (Cheques) Proposals From Suppliers Marketing Campaigns (Qualified Prospects) Community (blog postings, tweets/buzz, wall postings…) Payroll Proposals & Quotes for Customer Customer Invoices Purchase Orders Customer Quotes Supplier Invoices Website Activity (browsing, choosing, sharing, using…) Customer Payments Customer Inquiries Praise & Complaints …more and more all the time…

… and you have started to realize your data has structure … Transactional Data Data related to specific business events Master data Data related to the objects involved in your business, which change over time – but remain constant over many transactions Meta data Data about your transactional and master data Transactional Data Data related to specific business events Customer placing an order (Sales Orders), Buying something from a vendor (Purchase Orders), etc. Master data Data related to the objects involved in your business, which change over time – but remain constant over many transactions Customers, Products, Employees, Suppliers, G/L Account Numbers, Warehouse Layout, Payment Methods, etc. Meta data Data about your transactional and master data Usually not written down… but is known… e.g. how do people know what each field of a customer order means? … perhaps realizing that all is not right…

…but most of all you have questions How many customers do I have? When did the last buy from me? What did they buy? Where are my customers? What do my customers have in common with each other? Which combinations of products do they buy…so I can make recommendations? How did they behave on the website when deciding what to buy? How are they using my application, product or service? Who is paying me on time / late? Who has upgraded and what led them to decide to upgrade…so I can encourage other customers to do the same? Which suppliers ship to me reliably (on-time, right quantities, no DOAs)? Which suppliers invoice me accurately? What do my customers tell me they like? What is being discussed about my company in the community? What are the trends in the complaints I receive … how can I improve? Am I easy to do business with? Am I meeting my delivery promises to my customers? (no back-orders, on-time delivery, etc.) Am I spending the right amount on my inventory? (too much, not enough) How long does it take for me from getting an order to getting paid? Is every order profitable? Did I make money in the last month, week, day, hour? How much did I spend on X? …and despite all this data… …you don’t have ready answers!

… an aside… wondering why… if we have all the data why we don’t have information? The way we build systems* to help run our transactional business processes … rarely has anything to do with the information we need to manage, plan, change those same processes† Plus… Data is very very slippery‡… it requires huge business discipline to keep it all consistent over time… and soon as data is inconsistent it is hard to turn it into information * Historically this was because we had no choice – it simply wasn’t possible to cost effectively buy the hardware or build the systems with the complexity to meet both needs simultaneously † New technologies like in-memory databases will change this… but it will take 5-10 years before it is normal for transactional systems to also be able to provide management information ‡ Despite the claims of the semantic web people we are no closer to solving this problem! Don’t believe the hype

Operational Process (OLTP) The Big Picture…Where the data Warehouse Fits in Running Your Business… Transaction (e.g. Take Order) (e.g. pack & ship) (e.g. invoice) Act Operational Process (OLTP) Delivery of Organizations Value Proposition to Stakeholders Transactional Systems 1...n OLTP = on line transaction processing (aka ERP, CRM, etc.) Management Process (OLAP) OLAP = on line analytical processing (aka data warehouse) Analyze Measure Gain Understanding…Answer Your Questions OK we’ve just finished a deep dive on measurement – down at the detailed level. Before we wrap up our discussion of measurement let’s look at measurement from the big picture / macro perspective – more at the top-level process level. Key: OLTP (on-line transaction processing) systems are usually used to support the transactional or operational variants (aka transactional sub-processes) of the Core (Customer facing) and other types of top-level processes in an organization. OLAP (on-line analytical processing) systems are usually used to support the management variants (aka the management sub-processes) of the Core and other types of top-level processes in an organization. Review Slide The management sub-processes measure, analyse and report on the transactional sub-processes. Then the management sub-processes planning acitivities will make decisions to alter the plans based on the analysis which drive the operational sub-processes. For example in an Order to Cash top level process there will be (transactional) sub-processes which are focused on taking the customers order, ensuring it is manufactured, shipped, billing and the cash received. In addition there will (management) sub-processes which are focused on understanding how many orders are flowing through the process per minutes, hour, day, week, month, quarter and trying to forecast what will happen and identify problems / opportunities for improvement. The result of these analyese will result in new plans / targets being set and associated changes being made (increasing or reducing staff, increase or decreasing inventory levels, etc.) in order to optimize the delivery of the customers value proposition. See more notes on next (Hidden) Slide THE data warehouse Plan Make Decisions… Decide How to Action Them

So…You Need a Data Warehouse (DW)… How Do You Get One So…You Need a Data Warehouse (DW)… How Do You Get One? OR Eight Steps to DW Heaven * Like data warehouse technology sales people!

First…Recognize That Building a DW is Both Like and Unlike Building Other Systems… Same…you need: People… who will be using the DW to be involved in designing, building and using it IF you want an ROI Process… a great project manager who has the right plan – one customized to your needs Technology… and great people who know the technology Don’t let anyone* tell you they know the answer and can “magically” give it to you tomorrow Different… The users of a DW are YOU… Your leaders / managers! Are you ready to get involved? Are you ready to change – make decisions using the information from the DW? You really don’t know what you don’t know about your data… get comfortable learning by iterating You’ll spend less and get more * e.g. DW technology vendor or consulting vendor sales people! There is a lot of hype out there. Be cautious.

Second… Do You Have the Skills You Need? Enterprise Data Architect* (Data Modeller, Entity Relationship Diagrams) * Technological owner of the definition of all data in the transactional and DW systems – not to be confused with the business owner DW Tool Experts ..In which ever tool you decide to use† Business Intelligence (BI) Business Analyst Understands the tech… But can learn, understand and talk to you YOUR business, YOUR questions Your Management Team Whose questions is the reason the DW is being Built! DataBase Knowledge (Oracle, SQL Server, etc.) DBA (Administrator) Query Writer (SQL etc.) DW Infrastructure Specialists (Hardware, Operating System, etc.) Plan to get the skills you need at the right time… 1a. BI Business Analyst, 1b. Enterprise Data Architect 2. Tool Expert † You’ll need more of this resource early on than later… so perhaps a good place to use contractors/consultants … at least initially and only hire later when you know the level of on-going need for this skill set

...Third… Decide on What To Do First… Make a list of the most pressing questions you can’t answer today Get your management team involved Brainstorm what the answers might be… and what you will change if that answer turns out to be correct Get everyone used to the idea that the DW is a tool which will become a normal part of the process by which you will: Make decisions , Make and prioritize plans for improving your business Drive change, Measure management team member success Determine the potential benefits if you were to decide to implement the changes… Your Business Case Which answers would drive the biggest benefits? Pick the n questions whose answers would enable the biggest benefits 2 < N < 10 Drop the rest for now… you need to focus… DON’T get side tracked…focus! Make a list of the most pressing questions you can’t answer today At all Frequently enough Accurately enough Get your management team involved Brainstorm what the answers might be… and what you will change if that answer turns out to be correct Get your management team involved NOW… if they don’t buy in you are wasting your DW investment Get everyone used to the idea that the DW is a tool which will become a normal part of the process by which you will: Make decisions Make and prioritize plans for improving y our business Drive change Measure management team member success Determine the potential benefits if you were to decide to implement the changes This is your business case What are the benefits of each change? Which answers would drive the biggest benefits? Pick the n questions whose answers would enable the biggest benefits 2 < N < 10 Drop the rest for now… you need to focus… DON’T get side tracked…focus! Yes some people will be unhappy that “their” questions won’t be tackled in first wave… get over it

Fourth…Which Data Do You Need ? Mock-up the output from the DW which will answer each question Show it to everyone… listen to the feedback Remember why you are doing this… it is the ability to decide to change as a result of answering the question that matters not what the screen / report looks like! How do you “calculate” each part of the answer? Get everyone involved in figuring this out… Decide which senior manager “owns” each calculation / measure What data is required? Identify both the master data (customer master, product master, etc.) and the transactional data (sales orders, payments, etc.) You do have an enterprise data architect and model don’t you? If not time to: Hire that person Build that model…and have your transactional systems people keep it up to date Mock-up the output from the DW which will answer each question Show it to everyone… listen to the feedback Confirm that if the output of the DW looks like the mock-up people will be able to answer the questions and make the decisions they want to be able to make Remember why you are doing this… it is the ability to decide to change as a result of answering the question that matters not what the screen / report looks like! How do you “calculate” each part of the answer? Get everyone involved in figuring this out… otherwise you’ll end up in arguments over whether the answer is valid rather than using the answer to make decisions and drive change Decide which senior manager “owns” each calculation / measure Who will defend the answer? Who will decide when the calculation needs to be changed / improved? What data is required? Identify both the master data (customer master, product master, etc.) and the transactional data (sales orders, payments, etc.) You do have an enterprise data architect and model don’t you? If not time to: Hire that person Build that model…and have your transactional systems people keep it up to date

Fourth (part Deux)… Where Is Your Data? Where is the data? Does it exist… ask the enterprise data architect… if not can you substitute… Start the feedback loop to future versions of your transactional systems Is that data clean (consistent), what anomalies exist in that data? Normal to find data isn’t clean… expect work arounds Ensure everyone knows how these imperfections will impact the accuracy of the answer Who in the business is responsible for cleansing the data and keeping clean

…Fourth (part Trois)… Get the Tech Ready Set a budget for the tech Use the benefits of answering the first group of questions to decide how much to spend Decide on the tools Ensure they can grow with you Get help deciding Hire the DW tools expert(s) The people who know the tool you’ve chosen Consider contractors or consultants Set the technology standards and design principles Will help ensure reliability, flexibility, agility Response time, refresh frequency, Data Integrity, Data Security, Disaster Recovery, Business Continuance Change Control (Development, Test, Production, Training)

The (Tech) Parts of a Data Warehouse The Data Warehouse DW Control & Operations Data Includes: scheduling, reporting schedules, extract, transformation, refresh, DR rules etc. These need to be easy to use… Management will the users! Manual Data Maintenance Meta Data† † Data about the data in the ODS and Cubes – to allow users to understand, and “self document” their queries Dimension Tables (Shared) Your Transaction Systems (OLTP) Transactional Databases Operational Data Store (ODS) aka Persistent Staging Area (PSA) Clean Master Data Read the cubes End User Query & Reporting Tools Clean Trans-actional Data Cube 1* Cube 2 Cube n * Each cube (aka Data Mart) has the” fact” tables containg the transactional data transformed and the applicable dimension tables to answer groups of related questions Extract, Transform and Load (ETL)‡ ‡ Includes: matching, cleansing, versioning. Can be “pull” or “push” depending on volumes Read from & Write results to cubes End User Analytic Tools “Close Loop” Analytic Tools Make your decisions “real” by pushing process changes directly into operational / transactional systems. Can make changes in near real time! Technical tools – to be used by DW Tool Experts and BI Biz Analysts

…Fifth… Build & Test a Prototype …Iteratively Document the detailed design (BI Analyst and DW Tool Techs) Take earlier mock-ups (step 4) and build a prototype focus on getting to 80% Expect the build to take 5-10x longer to answer first 2-3 questions than questions 3-10 There is a lot of one time set-up… Make sure your tech team know this is a prototype… Test with users Set expectations… people should be starting to get excited that its becoming real… not upset because its not perfect! Can your management now answer the questions they had and (more important) can they make (and execute) the decisions based on the answers Expect this step to take 2-12 weeks depending on complexity Time box it at 12 weeks… reduce the scope Document the detailed design (BI Analyst and DW Tool Techs) Take earlier mock-ups (step 4) and build a prototype focus on getting to 80% Expect the build to take 5-10x longer to answer first 2-3 questions than questions 3-10 There is a lot of one time set-up… Make sure your tech team know this is a prototype… they will have time to productionize later… much will need to change based on what the users learn Test with users Set expectations… people should be starting to get excited that its becoming real… not upset because its not perfect! Can your management now answer the questions they had and (more important) can they make (and execute) the decisions based on the answers Expect this step to take 2-12 weeks depending on complexity Time box it at 12 weeks… reduce the scope

…Sixth… Use it… Make Decisions (Mistakes)… Learn… Improve Get the prototype so it can deliver answers Expect regular manual effort Cleansing data, running ETL or reports, hand holding users if UI isn’t perfect Use the answers to make and execute decisions Have a party… you have the start of a successful data warehouse! Learn After 2-12 weeks (depends on decision frequency) have a formal lessons learned Are you getting the benefits you expected? Involve everyone – leaders, managers, BI Analyst, Tech team… you all need to understand the challenges and what’s been learned This is the difference between great high performing (BI/DW) teams and mediocre ones… their ability to learn together Plan the changes needed to have the DW reliably deliver the answers

…Seventh… Productionize… The real go live… Formal training of users Prepare documentation Make them confident in the use of the tools Empower them with great meta data Mechanising manual steps Scheduling of all the jobs make it run smoothly No wee hours of the morning handholding! Implement disaster recovery If your DW is enabling operational decisions your DW is mission critical! Party some more You’ve been working hard at this for 3-8 months at this stage and can now see real benefits happening! Every day your decisions are being informed by your data via the DW

… Eighth… Now… Which Questions Didn’t You Answer?… Start Wave #2 Build on your success… Go back to Third step and repeat…but go faster! Remember incremental cost of answering more questions and adding more data to the warehouse is much lower than the first wave due to: One time investments made The experience you have

You Are On Your Way to a Sustainable Business Having all management decisions and strategic planning informed by a single, consistent set of information which your management team all align around is the basis for the long term health of your business The DW is just a tool, but one which when used with significant management / leadership discipline can make the difference between long term success and failure The DW is not a one-time project… it is a tool to enable a better way of running your business… forever! Expect to want to evolve it as you and the DW tools become more and more sophisticated

Thank-you… download slides from http://www.EdwardJames.biz/documents