Presentation is loading. Please wait.

Presentation is loading. Please wait.

…optimise your IT investments Data Discovery Understanding data relationships Philip Howard Research Director – Bloor Research.

Similar presentations


Presentation on theme: "…optimise your IT investments Data Discovery Understanding data relationships Philip Howard Research Director – Bloor Research."— Presentation transcript:

1 …optimise your IT investments Data Discovery Understanding data relationships Philip Howard Research Director – Bloor Research

2 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Agenda What are data relationships and why are they important? Different approaches to discovering data relationships Features you might look for in a data discovery tool

3 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 What is a data relationship?  A relationship between database tables, either within or across databases  A relationship within or across non- relational data sources  A relationship between a relational and non- relational source Note that relationships may be complex and/or involve more than 2 elements

4 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 1.Data migration

5 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 2. Data archival

6 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 3. Master data management

7 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 4. Data governance

8 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 5. Data modelling

9 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 6. Business intelligence

10 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships important? 7 & 8 & 9 & … Data integration Legacy migration Data warehousing …

11 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Why are data relationships difficult? No definition exists across multiple sources Within a source many relationships are not explicit Ownership of relationships is diverse Many relationships are defined within application software and not in the data source

12 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Data relationships in place Different issues arise when you consider relationships within systems versus across systems

13 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Data relationships within systems Typical functions: Identification of primary-foreign key pairs Dependency analysis Redundant columns Usually provided through data profiling, which also provides error statistics

14 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Data relationships across systems Requirement for relationship discovery No requirement for error statistics Requirement for rule violations where this represents a violation of a cross- source relationship

15 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Specific requirements For MDM – overlap & precedence analysis, transformation & business rules and exceptions, outlier analysis, matching keys For data migration & archival – business entities

16 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 General functions Automation of MDM and Profiling functions Visualisation of relationships Semantics the semantic type of the data e.g. zip code context-free discovery – e.g. recognising that cust# is equivalent to custID Data classification: recognising the relationship between a pre-defined, business-user-maintained domain of values and the actual content of a field in order to identify the content of a field as well as unexpected values. Business glossary

17 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Tools Landscape

18 …optimise your IT investments Confidential © Bloor Research 2009 telling the Information Management story Confidential © Bloor Research 2009 Conclusion Understanding data relationships across data sources is important in many data management disciplines There are relatively few tools that are good at discovering such relationships – moreover, data discovery is a broad discipline and no one tool is good at all aspects of relationship discovery.

19 …optimise your IT investments Confidential © Bloor Research 2009


Download ppt "…optimise your IT investments Data Discovery Understanding data relationships Philip Howard Research Director – Bloor Research."

Similar presentations


Ads by Google