An Introduction to the Data Documentation Initiative (DDI) ICPSR OR Meeting 2001 Wendy L. Thomas Data Access Core Director William C. Block Information.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

A Gentle Introduction to DDI - What's in it for me? Jim Jacobs University of California, San Diego Wendy Thomas University of Minnesota.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
HTML Basics Customizing your site using the basics of HTML.
PREPARING AND PUBLISHING METADATA Tools and procedures Vipavc Brvar Irena, ADP, Slovenia CESSDA - Seminar, October, 2005.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers Converting Legacy Documentation to DDI:
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Creating DDI Compliant Codebooks Wendy L. Thomas William C. Block Robert P. Wozniak Joshua J. Buysse A workshop presented at IASSIST 2001 Amsterdam NL.
Demonstration of a Blaise Instrument Documentation System “BlaiseDoc” Gina-Qian Cheung May 25, 2005 Institution for Social Research University of Michigan.
Advanced Technical Writing 2006 Session #3. Today in Class… ► Teams pitch poster concepts:  Meet with your editorial team, show us how your material.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
Codebook Centric to Life-Cycle Centric In the beginning….
The Future of the Document Paper is OUT Trees are IN UVic Humanities Computing and Media Centre.
Introduction to HTML 2006 CIS101. What is the Internet? Global network of computers that are connected and communicate via a series of Protocols Protocols.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
The Information School at the University of Washington LIS 549 U/TU: Intro to Content Management Fall 2003 * Bob Boiko * MSIM Associate Chair XML Schemas.
Introduction to XML This material is based heavily on the tutorial by the same name at
Tutorial 3: Adding and Formatting Text. 2 Objectives Session 3.1 Type text into a page Copy text from a document and paste it into a page Check for spelling.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
Computer Science : Information Systems Design and Development Unit Web Sites - National 4 / 5 St Andrew’s High School-Revised January 2013 Slide 1 St Andrew’s.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
Overview of Previous Lesson(s) Over View  ASP.NET Pages  Modular in nature and divided into the core sections  Page directives  Code Section  Page.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
XML: Overview MIS 181.9: Service Oriented Architecture 2 nd Semester,
FIGIS’ML Hands-on training - © FAO/FIGIS An introduction to XML Objectives : –what is XML? –XML and HTML –XML documents structure well-formedness.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Electronic Commerce COMP3210 Session 4: Designing, Building and Evaluating e-Commerce Initiatives – Part II Dr. Paul Walcott Department of Computer Science,
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Content and Computer Platforms Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
WEB APPLICATION DEVELOPMENT For More visit:
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Pindaro Demertzoglou Lally School of M&T B2C and B2B.
Background Cornell Institute for Social and Economic Research (CISER): Data and Computing Support for Social and Economic Researchers at Cornell University.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
LBSC 690 Session 5A Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
LBSC 690 Session 5A Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
Web Page Design Introduction. The ________________ is a large collection of pages stored on computers, or ______________ around the world. Hypertext ________.
Advanced Technical Writing 2006 Session #4. Today in Class… ► Meet with your editorial team, refine/post deliverables ► Send URL for deliverables to Bill.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Structured Documents - XML and FrameMaker 7 Asit Pant.
Representing data with XML SE-2030 Dr. Mark L. Hornick 1.
Introduction to XML XML – Extensible Markup Language.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
LBSC 690 Session 4 Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Tutorial #1 Using HTML to Create Web Pages. HTML, XHTML, and CSS HTML – HyperText Markup Language The tags the browser uses to define the content of the.
Lifecycle Metadata for Digital Objects November 13, 2002 Rights Management Metadata.
Advanced Accounting Information Systems Day 28 Introduction to XBRL October 30, 2009.
XML intro. What is XML? XML stands for EXtensible Markup Language XML is a markup language much like HTML XML was designed to carry data, not to display.
Project 1 Introduction to HTML.
Chapter 1 Introduction to HTML.
W3C Web standards and Recommendations
XML QUESTIONS AND ANSWERS
Data Management: Documentation & Metadata
The Re3gistry software and the INSPIRE Registry
Part of the Multilingual Web-LT Program
DDI for the Uninitiated

5.00 Apply procedures to organize content by using Dreamweaver. (22%)
Meta-Data: the key to accessing Data and Information
Presentation transcript:

An Introduction to the Data Documentation Initiative (DDI) ICPSR OR Meeting 2001 Wendy L. Thomas Data Access Core Director William C. Block Information Technology Core Directory Minnesota Population Center 26 October 2001

What is the DDI … DDI = Data Documentation Initiative XML = eXtensible Markup Language DTD = Document Type Definition Archive quality machine readable metadata designed to be human AND computer understandable and processable … and so much more

…and why is it important to you? Increases the depth of access to your collection Allows sharing of discovery tools Allows functional sharing of all metadata materials Encourages cooperative metadata collection development Encourages FULL documentation of data

Jakob Nielsen, Distinguished Engineer at Sun Microsystems “ XML is one of the greatest advances in the Web in a long time. Whereas most other Web innovations since 1993 have focused on glitz and on making superficially glamorous but useless fancy layouts, XML attacks the usefulness of the Web by adding structure and meaning to its vast seas of information."

Stewart Brand, Founder of the Whole Earth Catalog “ Perpetually obsolescing and thus losing all data and programs every 10 years (the current pattern) is no way to run an information economy or a civilization."

Brian Behlendorf, President, Apache Software Foundation "XML has become increasingly crucial throughout the software industry, as well as the Open Source community, as a non-proprietary method of storing and exchanging complex data."

James Clark, interview with Dr. Dobbs Journal "[What's the next step for XML?] That's a difficult question...it's like asking me, "What's the next application for ASCII text?"

The Session: XML & where you might encounter DDI The ‘Bill’ Experience: helping the hapless Using and exploiting DDI compliant files Managing large scale coding projects Tools of the trade Questions

XML basics XML is to a document’s intellectual content what HTML is to the physical structure of that document Elements Attribute Attribute types (imposing controls) Hierarchies and nesting

World Population Table Example of Final Proposed Aggregate Tagging Model Wendy L. Thomas 13. June 2001

Age

Population by Gender, Continent, and Year Persons

Is XML DDI? The DDI is often used to refer to the specific XML document type definition file(s) created to describe social science data files Understanding the basics of XML will help you understand the ‘DDI’

Where you might encounter DDI DDI compliant documents distributed with data Creating DDI codebooks for your own collection Assisting researchers with creating DDI codebooks for their own research projects

The ‘Bill’ Experience: helping the hapless :-) What I was doing Why I documented using DDI Issues raised in this experience: –Broad to specific or specific to broad? –The glories of the ID attribute –OR’s support role

Specific to Broad Learning: Learning every element at once is NOT recommended This goes on for 6 pages in 10 point type

Broad to Specific Learning: Learn one section at a time Document Description: Items describing the marked-up document itself as well as its source documents Study Description: Items describing the overall data collection (title, citation, methodology, study scope, data access, etc) Data Files Description: Items relating to the format, size, and structure of the data files (physical descriptions) Variables Description: Items relating to variables in the data collection (logical descriptions) Other Study-Related Materials : Other study-related material not included in the other sections (bibliography, separate questionaire file, etc.)

Lowering the Learning Curve: Creating customized views and subsets

January 1, 1879 June 1, 1880 November 1, 1989 July 21, 1993 August 1, 1990 July 21, 1998 The resident rural population of the United States on June 1, 1880 living in sampled states and counties. agline > 0. Owners, Tenants, or Managers of farms greater than 3 acres in size or producing and selling at least $500 in product during the year. census/enumeration data

Number of persons in household. public Respondent Person 9999 missing 23806

Value of farm, including land, fences and building. public Respondent Farm Farm Values. Of farm, including land, fences and buildings. Dollars 2006

The BIGGEST Lesson The importance of the TAG LIBRARY!! “If you could only take one thing to a deserted island to do DDI…make it the Tag Library.”

Using/Exploiting DDI compliant files The key lies in uniformity and consistency within an XML instance or within a series Never forget that a computer as well as a human being will be reading this –Element contents are for people –Attribute contents are for machines

The Concept of Inheritance The idea that lower elements within an intellectual tree ‘inherit’ the attributes of the higher levels unless a new value is provided Inheritance allows you to: Increase uniformity Reduce entry time Speed up processing

Looking for inheritance options Within a single xml instance –Within an element type –Within a section –Within the ‘codebook’ Within a series of xml instances –External references –Cut and paste

The power of the ID attribute Every element should have an ID Developing a schema for ID’s IDRef and IDRefs: –sdatRef –methRef –pubRef –Others (var, nCube, varGrp, locMap…)

Managing large scale coding projects The order of things: complete a document vs. completing all like parts Specialization: everyone learn everything vs. creating section experts Notification: automatic notification of step completion Training: mid-process training Contact: established “chain of command” Models: creating a “Model Book”

The World According to the Unfortunates Is MADDIE the tool we want to use? Will there be models to guide our work? What’s the difference between universe and measurement unit? How uniform do the lettered/numbered variables need to be? Are there standard names for geography levels? When do I use category and when cohort? At what level do we describe units of measurement?

Tools of the Trade Free Resources Commercial Resources Plug-ins to Word DDI specific editors –NESSTAR –MADDIE

Free Resources 1.XED 2.MERLOT 3.SIXPACK 4.Others worth checking out: –LOGILAB’s XML Editor xmleditor.html –VISUAL XML 1.Best for small to medium sized XML documents; does not validate 2.Runs on any Java 2 virtual machine; extensible via custom editor interface 3.Works on Macintosh

Commercial Resources 1.AuthorIT 2.X-Ray XML Editor 3.Xmetal 4.XMLwriter 5.Morphon XML-Editor index.shtml 6.XML Spy 4.0 Document Editor products_doc.html/ 1.Ideal for large multi- user documentation projects 2.Diagnoses XML errors in real time 3.“open and scriptable” development environment 4.Customizable interface 5.Multi-platform 6.For non-tech types

Plug-ins to Word 1.B-Bop Xfinity Author xW products_xfinity_author_wX.htm 2.WorX nforamtion/xml/worxseOverview.xm l&display=information/xsl/default.xsl 1.Unique “Save As” feature allows conversion to any DTD (Industry standard or user-defined) 2.Seybold Reports currently rate WorX as “the most sophisticated tool available for creating structured content in a MS Word environment”

DDI Specific Editors NESSTAR Publisher MADDIE Followed by QUESTIONS

Wendy Thomas Bill Block