Presentation is loading. Please wait.

Presentation is loading. Please wait.

Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University.

Similar presentations


Presentation on theme: "Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University."— Presentation transcript:

1 Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University Vienna

2 29.3.2000Metadata Resources in Europe2 Contents  Introduction  Contents of Meta-data  IT- Structures for Meta-data  Processing Meta-data  Conclusions

3 29.3.2000Metadata Resources in Europe3 Introduction Continuing hot topics in the meta-data discussion  Content-orientation versus IT- orientation There is a lack of communication between these two groups

4 29.3.2000Metadata Resources in Europe4 Introduction  Meta-data providers versus meta- data users Who provides which type of information for whom?

5 29.3.2000Metadata Resources in Europe5 Contents of Meta-data What kind of objects should be documented?  Basic statistical structures  Variables  Values  Data sets ____________________  Statistical output  Statistical Systems  Statistical Processing

6 29.3.2000Metadata Resources in Europe6 Contents of Meta-data Approaches towards meta-data content  The template oriented approach  The data warehouse approach  The process oriented approach

7 29.3.2000Metadata Resources in Europe7 Contents of Meta-data The template oriented approach Templates defined by a number of working groups  For micro data and data sets DDI, Dublin Core  For (economic) macrodata OECD, IMF, ECE (Internet)

8 29.3.2000Metadata Resources in Europe8 Contents of Meta-data The template oriented approach The OECD Template: Concepts and sources Data Collection Data manipulation by national source Data quality Data Transmission International Standards Data Storage and Manipulation by OECD Output preparation and delivery by OECD

9 29.3.2000Metadata Resources in Europe9 Contents of Meta-data The template oriented approach The IMF Template: Coverage Periodicity Timeliness Quality of disseminated data Integrity of disseminated data Access by the public

10 29.3.2000Metadata Resources in Europe10 Contents of Meta-data The template oriented approach Although the OECD approach seems more reliable from statistical point of view, IMF is favoured at the moment by international organisations (EUROSTAT)

11 29.3.2000Metadata Resources in Europe11 Contents of Meta-data The warehouse approach  Integration of the data inside the NSIs in a data warehouse  Output and dissemination as first step  Meta-data are oriented towards the needs of the data warehouse

12 29.3.2000Metadata Resources in Europe12 Contents of Meta-data The warehouse approach Projects in this direction in many NSI Best documentation: Australian Office Definitional meta-data Procedural meta-data Operational meta-data Systems meta-data Datasets meta-data

13 29.3.2000Metadata Resources in Europe13 Contents of Meta-data The process oriented approach  Combines statistical and IT considerations  Statistical data are considered not as final products but as the result of a process chain  More detailed consideration of statistical terminology

14 29.3.2000Metadata Resources in Europe14 Contents of Meta-data The process oriented approach Starting point was the SCB-DOC model (Rosen and Sundgren, 1991) A sequence of templates accompanying the statistical production process Ongoing activities at Statistics Sweden A number of NSIs want to adopt the model

15 29.3.2000Metadata Resources in Europe15 Contents of Meta-data The process oriented approach The IDARESA model Object oriented representation based on SCB-DOC with emphasis on possible semi-automatic processing

16 29.3.2000Metadata Resources in Europe16 Contents of Meta-data The process oriented approach The US-Bureau of census model (Gillman, Appel et al. running project): Statistical system defined as an identifiable process.... to produce one or more deliverables

17 29.3.2000Metadata Resources in Europe17 Contents of Meta-data Summary Process oriented approach seems to be favourable for a number of reasons Two Examples: Classification servers Data Quality

18 29.3.2000Metadata Resources in Europe18 Contents of Meta-data Summary: Classification server A classification server should  Support unified use of terminology inside NSIs or international organisations  Support harmonisation between (international) standard classifications and locally defined (adapted) classifications

19 29.3.2000Metadata Resources in Europe19 Contents of Meta-data Summary: Classification server Requirements for a classification server A data base supporting easy and user friendly manipulation of hierarchy trees A mapping tool supporting the definition of correspondence tables between classifications A management strategy for implementation

20 29.3.2000Metadata Resources in Europe20 Contents of Meta-data Summary: Classification server Up to now only few successful implementations for partial solutions EUROSTAT (SIMONE-Server) New Zealand,

21 29.3.2000Metadata Resources in Europe21 Contents of Meta-data Summary: Data Quality Data Quality  Criteria for quality of statistics are well known (Relevance, accuracy, timeliness, accessibility, comparability, coherence, completeness)  The problem Achieve quality in the production process Document quality by appropriate meta-data

22 29.3.2000Metadata Resources in Europe22 Contents of Meta-data Summary: Data Quality Experience shows that documentation quality is rather poor as soon as it is separated from the production process Example for an integration project SIDI-approach by ISTAT

23 29.3.2000Metadata Resources in Europe23 IT Structures for Meta-data Internet and data warehouse offer new opportunities for  Meta-data and data repositories  Meta-data access and exchange Lead towards a more open policy in data dissemination

24 29.3.2000Metadata Resources in Europe24 IT Structures for Meta-data Meta-data repositories Approaches towards repositories  The thesaurus approach  The template oriented approach  The Data Warehouse oriented approach

25 29.3.2000Metadata Resources in Europe25 IT Structures for Meta-data Meta-data repositories Example for a thesaurus oriented approach EUROSTAT servers for concepts and definitions Advantage: available on the Internet Problem: Navigation not so easy

26 29.3.2000Metadata Resources in Europe26 IT Structures for Meta-data Meta-data repositories Contents –Descriptions (dictionaries) –Semantic (coverage, standard classifications coherence of information) –Administration (responsible persons) –Selection (keywords, search facilities)

27 29.3.2000Metadata Resources in Europe27 IT Structures for Meta-data Meta-data repositories Example for the template oriented approach StatBase: supporting access to meta-data as well as data and reports Meets quite well the requirements of OECD data template No direct connection between data and meta-data

28 29.3.2000Metadata Resources in Europe28 IT Structures for Meta-data Meta-data repositories Example for the warehouse oriented approach StatLine(CBS): Based on data access from multidimensional tables (cubes) Accompanying meta-information is only in Dutch Extraction of special meta-data items is not so easy as in StatBase

29 29.3.2000Metadata Resources in Europe29 IT Structures for Meta-data Meta-data access and exchange Ongoing work in access and exchange  New Standards for access and exchange  Accessing distributed sources  Combination of information

30 29.3.2000Metadata Resources in Europe30 IT Structures for Meta-data Meta-data access and exchange Actual trends in standardization Traditional standards for data and meta-data exchange like GESMES or CLASET will probably switch to XML-platform. New standards from the Object Management Group (OMG)

31 29.3.2000Metadata Resources in Europe31 IT Structures for Meta-data Meta-data access and exchange Example MOF (Meta Object Facility) –Extensible Framework for meta-data model definition –Programming interface for storage and access of meta-data –Integration facilities across domains But note: This is a general approach for warehouses not necessarily tied with statistics

32 29.3.2000Metadata Resources in Europe32 IT Structures for Meta-data Meta-data access and exchange Example for Accessing and processing distributed sources ADDSIA: Accessing and processing distributed sources for analysis purposes Minimum requirements for standardisation in advance Orientation towards statistical problems

33 29.3.2000Metadata Resources in Europe33 Processing Meta-data Goal  Data and meta-data are processed together 

34 29.3.2000Metadata Resources in Europe34 Processing Meta-data Advantages  Reduction of documentation effort  More consistency in meta-data Requirements  Software tools supporting this view  Operational models for meta-data

35 29.3.2000Metadata Resources in Europe35 Processing Meta-data Up to know only prototypes with emphasis on different aspects of processing  The planning approach  The throughput approach  The transformation approach

36 29.3.2000Metadata Resources in Europe36 Processing Meta-data The planning approach  Develop software tools (workbench) for setting up meta-data documentation BRIDGE/IMIM:  A desktop for planning surveys and statistical production  Meta-data generated in the planning phase are managed by the system  No data are processed

37 29.3.2000Metadata Resources in Europe37 Processing Meta-data The planning approach  Improvement and adaptation of meta-data models for new tasks like quality and use of administrative sources SIDI (Statistics Italy)  Integration of quality in the statistical production process  Standardization of the production process

38 29.3.2000Metadata Resources in Europe38 Processing Meta-data The throughput approach Use as much meta-data as possible from OldMeta-data to obtain NewMeta-data CBS (ongoing work):  Use BLAISE meta-data as input  Produce StatLine meta-data as output

39 29.3.2000Metadata Resources in Europe39 Processing Meta-data The transformation approach Define meta-data algorithms for all types of data algorithms  Throughput meta-data  Modified meta-data  New meta-data  Meta-data summarization

40 29.3.2000Metadata Resources in Europe40 Processing Meta-data The transformation approach IDARESA project Meta-data algorithms for elementary data base operations ISMIS Identification of added value in meta-data (new meta-data) Pursuit of the production process inside EUROSTAT

41 29.3.2000Metadata Resources in Europe41 Processing Meta-data The transformation approach

42 29.3.2000Metadata Resources in Europe42 Conclusions Is there progress in meta-data research and development? Yes, but rather slow because  There is a lack of co-ordination in research (Probably improved by a forthcoming meta-data working group)  There is an information gap between meta- data research groups and NSIs  NSIs seem to prefer their own solutions


Download ppt "Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University."

Similar presentations


Ads by Google