High-Level View of a Source-Centric Genealogical Model: “The Model with Four Boxes” Randy Wilson March 9, 2005.

Slides:



Advertisements
Similar presentations
FUNDAMENTAL COMPUTER SKILLS FOR GENEALOGISTS. Scope of Program Internet Searches Computer Files and Folders Scanning Images and Documents.
Advertisements

An introduction to the work of the Scottish Archive Network Internet access to the written history of Scotland.
Final project Steve Krug Don’t make me think. Principle #1 How we really use the web Have something on the page that catches the readers interest. -Users.
Finding Primary Source Documents The Student’s View.
Information System Engineering
Autocorrelation and Linkage Cause Bias in Evaluation of Relational Learners David Jensen and Jennifer Neville.
Lesson #3 Merge Duplicates, Edit Info, Establish Relationships.
Library & Information Services Using the Library Catalogue Part 1: Searching the Catalogue Rachael Hartiss 2008.
Information Competency: Research for Group Discussion John A. Cagle.
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting ( Howard Besser UCLA School of Education & Information.
Proposal 13 HUMAN CENTRIC COMPUTING (COMP106) ASSIGNMENT 2.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
Integrate your people maximize your knowledge Tel SalesBase Customer.
Merging Duplicate Records in Family Tree. Duplicate records – why not just delete one of them? This record for Elizabeth Berry shows her as the child.
Introduction to Genealogy By Al Barron Slidell Branch Library November 17, 2004.
{ CASE PREPARATION Preparing probate cases to submit to the Office of Hearings and Appeals Mandy Priester
A summary of the report written by W. Alink, R.A.F. Bhoedjang, P.A. Boncz, and A.P. de Vries.
1 DATABASE TECHNOLOGIES BUS Abdou Illia, Fall 2007 (Week 3, Tuesday 9/4/2007)
Information and Data What’s the difference between two? Information systems = hardware and software working together… It will take DATA that has been put.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
1 Spokane 22 nd Ward Lesson 4 “Getting Your Computer’s Help”
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
FRAD: Functional Requirements for Authority Data.
Collaborative Research Assistant 2007 Family History Technology Conference John Finlay Christopher Stolworthy Daniel Parker.
Researching a Valid Medical License It is important for many people to check on their doctors to ensure they have a valid medical license and are in good.
Research Records Organizing Accessing Sharing Preserving.
Practical Uses for the GENTECH Genealogical Data Model Beau Sharbrough, GENTECH President PO Box 3170 Grapevine TX
What’s New on Family Tree 1.Private Spaces 2.Upload Audio files up to Memories 3.Record Hints 4.Index to Find A Grave 5.Descendancy View 6.Help Center.
ONENOTE KEEPS TRACK OF STUFF AT WORK, HOME, OR SCHOOL.
TMG Downunder SA by the TMG Downunder Group SA Branch.
Microsoft ® Office Access ® 2007 Training Build a database I: Design tables for a new Access database ICT Staff Development presents:
Descendancy Resea rch Objective: Help my deceased family members receive the blessings of temple ordinances Option 1 - Find my grandparents, aunts, uncles.
Build it Tweak it Use it Know it Love it. A tool to collaborate on projects What does Collaborate mean? To work together.
GENEALOGY! WHERE DO I START? START SMALL..STAY FOCUSED.. DO ONE PIECE OF THE PUZZLE AT A TIME. START WITH YOURSELF AND WRITE DOWN WHAT YOU KNOW THEN WITH.
1.NET Web Forms Business Forms © 2002 by Jerry Post.
1 Archiving Michael J. Levin Harvard Center for Population and Development Studies
Research and Information Skills Portfolio creation 2006.
Research Cycle 5 Basic Steps. Known Family Information - Contact relatives and extended family members. - Contact other researchers. Organize - Set up.
(Spring 2015) Instructor: Craig Duckett Lecture 10: Tuesday, May 12, 2015 Mere Mortals Chap. 7 Summary, Team Work Time 1.
FAMILY TREE 133 POINTS. FAMILY TREE – PART 1(15 POINTS) Students should prepare a prezi or powerpoint that includes the following information. Step 1:
Data Structures and Algorithms Lecture 1 Instructor: Quratulain Date: 1 st Sep, 2009.
Task Analysis Methods IST 331. March 16 th
Hash Table March COP 3502, UCF 1. Outline Hash Table: – Motivation – Direct Access Table – Hash Table Solutions for Collision Problem: – Open.
Hannah Hawlk MEDT 7478 Fall 2012 From the opening menu, click on the “catalog” tab to open the catalog module.
FamilySearch - Basics A Quick Look at the Puzzle Pieces.
National Archives and Records Administration Status of the ERA Project RACO Chicago Meg Phillips August 24, 2010.
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
RESEARCH METHODS IN TOURISM Nicos Rodosthenous PhD 07/03/ /3/2013Dr Nicos Rodosthenous1.
How To Get Started Presented By: Doris Ashley.  Develop a Plan  Gather info from family  Look for a published history  Document your sources  Forms.
NWSA Database Storage System Company Tutorial National Wildfire Suppression Association February 2009.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search.
Research Cycle 5 Basic Steps. Known Family Information - Contact relatives and extended family members. - Contact other researchers. Organize - Set up.
Digital Gujarat Portal – Citizen User Manual. How Do I Open A Portal? Go to the URL :- Screen 1.1:-
REPORT DESIGNING YOUR REPORT FAMILY HISTORY HCOM 320.
Organization of Information LSIS Summer II (2005)
TechKnowlogy Conference August 2, 2011 Using GoogleDocs for Collaboration.
DIGITAL INFORMATION SOURCES, RESOURCES AND E-LEARNING : SCOPE AND CHARACTERISTICS.
New family search.
(Winter 2017) Instructor: Craig Duckett
Evaluating and Interpreting Oral History
Database Relationships
Change Control Module P5 LEARNING OBJECTIVES: LEARNING OUTCOMES
Module P4 Identify Data Products and Views So Their Requirements and Attributes Can Be Controlled Learning Objectives: Understand the value of data. Understand.
Family History Merge Duplicates, Edit Info, Establish Relationships
Reading and effective note-making
Employee Self-Service (ESS) Portal
The Four Stages of Research cont'd
Presentation transcript:

High-Level View of a Source-Centric Genealogical Model: “The Model with Four Boxes” Randy Wilson March 9, 2005

Necessary Elements Source Authority Source Authority Artifact Archive Artifact Archive Structured Data Archive Structured Data Archive Family Tree Family Tree

1. Source Authority List of all known potential sources of genealogical data. Assign unique ID to each source. Assign unique ID to each source. Provide way to find existing sources Provide way to find existing sources Provide way to add new sources Provide way to add new sources Assign unique id to each “page” of source Assign unique id to each “page” of source

2. Artifact Archive Hold scanned images for each page of each source each page of each source Uses id of page and source Uses id of page and source from Source Authority. from Source Authority. Enables indexing or deep extraction over the internet Enables indexing or deep extraction over the internet Enables verification and permanent preservation. Enables verification and permanent preservation.

3. Structured Data Archive a.k.a.: “Raw Data”, “Source Data”, “Evidence Database”, or “Extracted Records” Accurately represent what a source says Accurately represent what a source says Has unique ID for each “persona” (”name”/”reference to a person”) on each page. Has unique ID for each “persona” (”name”/”reference to a person”) on each page. Contains names, dates, place, relationships, etc., that are clear from the source itself. Contains names, dates, place, relationships, etc., that are clear from the source itself. Could possibly contain certain Could possibly contain certain low-level assertions. low-level assertions.

Census Image

Extracted Data

4. Family Tree Represents our conclusions about who has lived and how they are related. Contains copies of information from the extraction archive, along with persona IDs that point to where the information came from. Contains copies of information from the extraction archive, along with persona IDs that point to where the information came from. Links “personas” from different records together through relationships or merging/grouping. Links “personas” from different records together through relationships or merging/grouping.

“To Do” List  Locate all sources in the world and add them to the Source Authority.  Extract all genealogical data from all sources (usually from scanned images).  Link all extracted personas into the Family Tree  Verify that all of the above was done right  Perform all ordinances.

Tasks for Users Enter what they know Enter what they know Extract/index data from images (perhaps in a locality of interest to them) Extract/index data from images (perhaps in a locality of interest to them) Link extracted records into their line in the Family Tree Link extracted records into their line in the Family Tree Do verification work on extraction, linking. Do verification work on extraction, linking. Take a name to the temple Take a name to the temple

Source Authority, cont’d List of all sources, including, for example: Compiled family histories Compiled family histories Records from courthouses and parishes Records from courthouses and parishes Personal holdings (family bible, etc.) Personal holdings (family bible, etc.) Cemeteries (pictures and/or transcriptions) Cemeteries (pictures and/or transcriptions) Census records Census records Memory of each user (i.e., personal knowledge) Memory of each user (i.e., personal knowledge)

Structured Data Archive Important for: Computerized searching of data (traditional and record linkage) Computerized searching of data (traditional and record linkage) Pulling data into Family Tree for merging (grouping/linking). Pulling data into Family Tree for merging (grouping/linking). Knowing who on a page still needs work (and thus which sources still need work). Avoid missing people or repeating work forever. Knowing who on a page still needs work (and thus which sources still need work). Avoid missing people or repeating work forever. Simpler browsing of sources than using images. Simpler browsing of sources than using images. Additional context (e.g., who lives next door) Additional context (e.g., who lives next door)

Extraction Archive, cont’d Also: Having intermediate step separates extraction work from linking/merging work and other conclusions. Having intermediate step separates extraction work from linking/merging work and other conclusions. This makes it simpler to verify each step. This makes it simpler to verify each step. It also makes it more clear where differences of opinion are coming from It also makes it more clear where differences of opinion are coming from

Verification Currently thorough genealogists go back to the original source to confirm anything they find in an electronic database. Currently thorough genealogists go back to the original source to confirm anything they find in an electronic database. This would take each person forever on the Family Tree. This would take each person forever on the Family Tree. We must store the fact that each step has been verified so that eventually we can trust well-verified work and move on to something else. We must store the fact that each step has been verified so that eventually we can trust well-verified work and move on to something else.

“The Model with Four Boxes” Source / Evidence / Conclusions Record collections / extracted records / people

Summary Source-centric approach allows completeness without endless duplication of effort. Users can participate in various activities, but all of these can help move data from one point in the process to the next. Industry needs to think in a source-centric way to enable true collaboration.

Questions? Ideas? Blogs: eatslikeahuman.blogspot.com eatslikeahuman.blogspot.com source-centric-genealogy.blogspot.com source-centric-genealogy.blogspot.com