Presentation is loading. Please wait.

Presentation is loading. Please wait.

UK Data Warehouse Work 23 rd May 2012 Paul Tutton, Sarah Ravenhill.

Similar presentations


Presentation on theme: "UK Data Warehouse Work 23 rd May 2012 Paul Tutton, Sarah Ravenhill."— Presentation transcript:

1 UK Data Warehouse Work 23 rd May 2012 Paul Tutton, Sarah Ravenhill

2 Outline 1.Background 2.Approach 3.Warehouse Concepts 4.Prototyping & Modelling 5.Data Harmonisation 6.Recommendations and Next Steps

3 1. Background Other Services Data Sources Data Repository Staging Operational Data Store Data Consumers

4 2. Approach What do we want? How do we want to work? Does that work? Build it and see What can we put in there? How would we implement one? What are the costs and benefits?

5 3. What and How Define Interrogate Store Data And Metadata Validate Derive Aggregate Input & Update Extract Find Gaps

6 4. Build It… Integrate data from multiple sources Make extracts to support current and new statistics Define a method for describing extracts Identify gaps in extracts Automate choice between or combination of sources

7

8

9

10 Source Level Indicators

11 Variable Level Indicators Rate my data – what are we consistently suspicious of?

12 4. …and See Warehouses work Statistical processes must change Shared Information Models are important Think about the minimum acceptable amount of data

13 5. Assess Potential Conceptual Overlap Meaning of the Data Conceptual Overlap Meaning of the Data Dataset Shape Shape of the population Dataset Shape Shape of the population Statistical Activity Process surrounding the data Statistical Activity Process surrounding the data Harmonisation Analysis

14 5. Analysis Steps List your sources Describe variables Pool the list Find the concepts Classify variables Assess results

15 5. Overlap findings Exact Replication Conceptually Close Otherwise Derivable General Feasibility Combinations Small numbers found

16 Employee Count Employment Foreign Investment Hours/ Pay Pension Schemes Employee Count Employment Foreign Investment Hours/ Pay Pension Schemes 5. Example Concepts Acquisitions/ expenditure Business Operation Business Structure Comments/ Narrative Disposals/ Income Acquisitions/ expenditure Business Operation Business Structure Comments/ Narrative Disposals/ Income Profit/ Loss Statistical Units Stock Taxes/ National Insurance Turnover

17 5. Interview Findings Pooling data: May assist imputation Enables consist stories across outputs Allows congruence checking at unit level Is more useful if it exposes timelier sources to output managers Is of more benefit for some subjects than others (e.g. employment over finance)

18 6. Recommendations and Next Steps Continue development of CIM Analyse extent of process change due to movement away from survey silos Implement a warehouse in stages: Integrate storage first De-duplicate and harmonise once integration is complete Consider the addition of statistical processing facilities to reap further benefits


Download ppt "UK Data Warehouse Work 23 rd May 2012 Paul Tutton, Sarah Ravenhill."

Similar presentations


Ads by Google