Corporate Data Vault Data Warehousing Workshop Sept. 23 2015 Data Warehousing Workshop Sept. 23 2015.

Slides:



Advertisements
Similar presentations
National Institute of Statistics, Geography and Informatics (INEGI) Implementation of SDMX in Mexico.
Advertisements

The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Information Technologies Page 1 Information Technologies Page 1 Information Technologies Page 1 Information Technologies Page 1Information Technologies.
State of Indiana Business One Stop (BOS) Program Roadmap Updated June 6, 2013 RFI ATTACHMENT D.
Staying In Technology’s Top Tier UCC Filings, Searches and Images.
<<Date>><<SDLC Phase>>
Ginnie Mae MISMO Adoption Update Tamara Togans and Nicole Jackson Ginnie Mae January 13, 2014 MISMO Winter 2014 Summit The MISMO Winter 2014 Summit Education.
Luxembourg, Ville Kajala Senior Officer on Transparency Directive Issues Pan-European Access to Financial Information Disclosed by Listed Entities.
Alternate Software Development Methodologies
Scottish Transport Appraisal Guidance - STAG Hugh Gillies Transport Scotland.
By Mary Anne Poatsy, Keith Mulbery, Eric Cameron, Jason Davidson, Rebecca Lawson, Linda Lau, Jerri Williams Chapter 9 Fine-Tuning the Database 1 Copyright.
Password?. Project CLASP: Common Login and Access rights across Services Plan
Humboldt University: A workflow model for digital theses and dissertations ETD A workflow model for digital theses and dissertations Developments.
Data format translation and migration Future possibilities Alasdair Crockett, Data Standards Manager UK Data Archive.
Fundamentals of Information Systems, Second Edition
1 Project Management & Project Management Software Yale Braunstein School of Information Management & Systems UC Berkeley.
Database System Development Lifecycle Transparencies
Chapter 1: The Database Environment
1 Components of A Successful Data Warehouse Chris Wheaton, Co-Founder, Client Advocate.
Michael Solomon Tugboat Software Managing the Software Development Process.
PROJECT OMNIGLEAN Team Members: Kenny Trytek Derek Woods Abby Birkett Joe Briggie Advisor: Simanta Mitra Client: Kingland Systems.
9 Feb 2004Mikko Mäkinen & Saija Ylönen Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Geneva, 9-11 February 2004, Topic (ii): Metadata.
Database Planning, Design, and Administration Transparencies
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Introducing... NPF Connect Press [Space Bar] to continue...
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
CO1301: Games Concepts Dr Nick Mitchell (Room CM 226) Material originally prepared by Laurent Noel.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
CONCEPTUAL MODELLING OF ADMINISTRATIVE REGISTER INFORMATION AND XML - TAXATION METADATA AS AN EXAMPLE Ottawa, May 2005.
 A database is a collection of data that is organized so that its contents can easily be accessed, managed, and updated. What is Database?
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
University of Wisconsin System HRS Project Update to ITC November 19, 2010.
Software Project Documentation. Types of Project Documents  Project Charter  Requirements  Mockups and Prototypes  Test Cases  Architecture / Design.
© Copyright 2011 John Wiley & Sons, Inc.
Database System Development Lifecycle 1.  Main components of the Infn System  What is Database System Development Life Cycle (DSDLC)  Phases of the.
Visit our Focus Rooms Evaluation of Implementation Proposals by Dynamics AX R&D Solution Architecture & Industry Experts Gain further insights on Dynamics.
MIS 327 Database Management system 1 MIS 327: DBMS Dr. Monther Tarawneh Dr. Monther Tarawneh Week 2: Basic Concepts.
State of Wisconsin Department of Revenue Data Warehouse Presentation August 16, 2000.
Implementation Experiences METIS – April 2006 Russell Penlington & Lars Thygesen - OECD v 1.0.
1 XBRL Pilot Project at BOJ May 2004 Yoshiaki Wada Bank Examination and Surveillance Department Bank of Japan © 2004 Bank of Japan.
Fundamentals of Information Systems, Second Edition 1 Systems Development.
Copyright © The OWASP Foundation Permission is granted to copy, distribute and/or modify this document under the terms of the OWASP License. The OWASP.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
CSO ITSIP Project - implementation of new Data Management System (DMS) ITDG meeting, Luxembourg, October 2006 Presentation by Joe Treacy CSO, Ireland.
Equations for Ecademy Client: ISU Computation Center Faculty Advisor: Dr. Robert Anderson Technical Advisor: Dr. Pete Boysen Team Members:  Tim Arganbright,
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
1 PSI/PhUSE Single Day Event – SAS Applications – June 11, 2009 SAS Drug Development from the Inside Magnus Mengelbier Director.
2050AP Project WP5: “Conclusions” UPM Madrid 11 de Octubre 2013.
A S P. Outline  The introduction of ASP  Why we choose ASP  How ASP works  Basic syntax rule of ASP  ASP’S object model  Limitations of ASP  Summary.
Chapter 9 Database Planning, Design, and Administration Transparencies © Pearson Education Limited 1995, 2005.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Components of A Successful Data Warehouse
CENTRAL STATISTICS OFFICE IRELAND ITSIP PROJECT OVERVIEW
James Blankenship March , 2018


Chapter 1: The Database Environment
The Database Environment
Marine Environment and Water Industry Unit
BCS Template Presentation February 22, 2018
Proposal of a Geographic Metadata Profile for WISE
ITAS Cash Management Integration to an ERP
Proposed June 2020 UK Link Major Release Scope / Governance Timeline
The Database Environment
Presentation transcript:

Corporate Data Vault Data Warehousing Workshop Sept Data Warehousing Workshop Sept

Background to CDV Project Feb 2012 – Review of Corporate Data Model published Apr 2012 – Technical group set up Dec 2012 – Proposal for CDV sent to SMC

The Proposal Option 1 File Store Option 2 File Store with Direct Access Option 3 Database tables Option 4 Data warehouse  Data stored in the same format as lodged by the data custodian;  Data retrieved only through the front-end application and copied to local work space.  Data stored in the same format as lodged by the data custodian;  Data can be accessed directly by third party products (e.g. SAS).  Data converted and stored in database table with similar structure to source;  Database tables can be accessed directly by third party products (e.g. SAS).  Data converted and stored in standardised relational database tables;  Database tables can be accessed directly by third party products (e.g. SAS).

ProsCons Option 1: File Store  Simplest concept  Lowest development effort  No direct access with 3rd party products  Possible proliferation of copies of files in local work areas  Long term usability of data more difficult to manage Option 2: File Store Direct Access  Simple concept  Provides direct access to data  Security more difficult to manage than for database options  Long term usability of data more difficult to manage Option 3: Database tables  Provides direct access to data  Data stored in single platform  Easier to manage long term usability issues  Data transformed from original format – transformed data may need validation Option 4: Data warehouse  Provides direct access to data  Standardized data in relational databases  Enables easier linkages between data  Opportunities to build other applications on the warehouse  Data transformed from original format – transformed data may need validation  Difficult to design and build  Business effort high as data standardization required

Project Stage 1 Two Prototypes Early 2013, the SMC requested that working prototypes of both Option 2 and 3 be developed Prototypes were designed, built & tested between June and Oct 2013 A recommendation on the optimal solution was submitted to the SMC in Nov 2013.

Design, Build and Assessment In-scopeOut of Scope Focus of system developmentProduce a working systemFinal screen designs Functions of the system (1) Lodging data & metadata (2) Storing data & metadata (3) Viewing of catalogue (1) Security (2) Reports Testing of system Testing to focus primarily on the “happy path”. Only major bugs and issues to be addressed. Robust testing of the system File TypesSAS files only as (1) High risk (2) Benefit of variable metadata available within the file (3)Structured nature provided suitable test for both prototypes All other file types

Issues with Database Prototype IssueImpact on Database Prototype Unable to distinguish between a date and a date/time variable in a SAS dataset SAS dataset is rejected because the date/time column is created as a date and a date/time variable cannot be loaded into a date column. Maximum length of a character variable can be Character variables longer than will be truncated. Maximum number of columns currently allowed is 254 SAS dataset is rejected is the number of variables exceed 254 There are 995 different formats available in SAS Data integrity may be compromised or the dataset may be rejected if an unknown format is encountered. It would require each format to be coded for individually during conversion program.

Project Stage 2 CDV v1 Build & Design The second stage of this project involved the further design, build and testing of the file store solution. It also included information sessions to users and the initial “Go Live” of the CDV. This second project ran from Jan 2014 until Dec 2014.

Project Stage 3 CDV v1 Implementation The third stage of this project is ongoing since Jan 2015 Roll-out of the system across the office Requirements gathering and specifications for CDV v2.

About the CDV Independent of production processes Data stored in the same format as lodged Access data through a third party product CDV v1 accepts SAS datasets only

Technical Specs Three tier application Client tier: Java Business Logic tier: Weblogic Data Tier: Sybase database.

Functionality Lodge Data and Metadata Browse/Search the Catalogue Reports Security

Lodge Data and Metadata: Step 1

Lodge Data and Metadata: Step 2

Variable Details Screen Link Classification from CARS

Metadata Stored File Level Survey Name Periodicity Time Period Version No. Linked Themes Micro/Macro Data Reference Documentation Description Reason for Version Date Lodged Lodged By Variable Level Name Description Primary Key Unit Type Length Data Type Linked Classification Details

Lodgement Summary

Access To Data

The End Any Questions?