Resource Curation and Automated Resource Discovery.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
Mine Action Information Center
Web design Most digitisation projects are made available through Websites Effective Access depends on good web design Identify users and their information.
EuroCRIS Best Practices & Solutions Members Helping Members Move Forward.
1 Transportation Librarians Roundtable Transportation Research Thesaurus: WSDOT Use Cases February 14, 2008 Andy Everett Metadata Repository Administrator.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Health Literacy in Your Community Developed by National Network of Libraries of Medicine, Pacific Northwest Region ( funded by the.
UNDERSTANDING WEB AND WEB PROJECT PLANNING AND DESIGNING AND EFFECTIVE WEBSITE Garni Dadaian.
Methods for Data Discovery – Portals Portal facilitates access to and also assimilation of data Portal is not simply a web site: it offers services such.
An Introduction to Content Management. By the end of the session you will be able to... Explain what a content management system is Apply the principles.
SeaDataNet Ontology Use Case Roy Lowry British Oceanographic Data Centre Coastal Atlas Interoperability Workshop, Corvallis, July (+ Lessons.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
Best Practices for ADL Registry Metadata Thursday, August 29, 2007 Nina Pasini Deibler Joint ADL Co-Lab.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Setting Up an RSS Feed 1 Project by iWEBbic.com 1.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
WEB DESIGN USING DREAMWEAVER. The World Wide Web –A Web site is a group of related files organized around a common topic –A Web page is a single file.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
UKOLN is supported by: Approaches to Metadata Quality Marieke Guy QA Focus A centre of expertise in digital information management
Audio and Video Chris McConnell Department of Radio-TV-Film November 30, 2006.
1 TenStep Project Management Process ™ PM00.8 PM00.8 Project Management Preparation for Success * Manage Documents *
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
VIVO and Scholarly Repositories: Synergistic Opportunities.
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Introduction to the Semantic Web and Linked Data
The Uniform Resource Layer Anita Bandrowski Neuroscience Information Framework.
CS 3505 Projects Assignments Projects
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
ALA Metadata - Goals and Issues Donald Hobern, Director, Atlas of Living Australia 29 August 2008.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
The Neuroscience information framework A User’s Guide.
The Uniform Resource Layer Anita Bandrowski Neuroscience Information Framework.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Web Design – Week 2 Introduction to website basics Website basics: How the Web Works Client / server architecture Packet switching URL components.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Resources of a Resource By, Anupama Atmakur Pooja Adudodla.
Alison Prince Bodleian Libraries Web Manager Practical tips for creating online exhibitions Peter Pavement Surface Impression.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Adxstudio Portals Training
Glencoe Introduction to Multimedia Chapter 2 Multimedia Online 1 Internet A huge network that connects computers all over the world. Show Definition.
Gain Global Exposure: Partner with EBSCO to Promote your Scholarship
Making Sense of the Alphabet Soup of Standards
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Some Common Terms The Internet is a network of computers spanning the globe. It is also called the World Wide Web. World Wide Web It is a collection of.
Software Documentation
Module 6: Preparing for RDA ...
Cataloging the Internet
Fast, free, fun Weebly web sites.
An ecosystem of contributions
Register Federation Registration process
Structured Data Markup Helper
JISC Information Environment Service Registry (IESR)
Internet Vocabulary Terms
Presentation transcript:

Resource Curation and Automated Resource Discovery

NIF Resources NIF is cataloging websites that house information about databases, atlases, software tools, data, transgenic mice and other things that we consider of value to the neuroscience community.

Definition of Resource Individual resource boundary: shall be considered an individual resource if it is maintained by a single entity, and has the properties of one or more individual web pages that are related by a theme and html links.

Resource Nomination Registry (4500) Registry (4500) Public Registry (2100) Public Registry (2100) NIF Web (499,952) NIF Web (499,952) Level 2/3 (24) Level 2/3 (24) User Feedback *Automated tools Web Crawl Registry Subset Nomination Check: -Links -Annotation -Vocabulary *Automated updates Level 2 tools *In Development

Resource is Nominated NIF Staff, Contact at Meetings, Web Form Resource is Nominated NIF Staff, Contact at Meetings, Web Form In NIF already? Assign Metadata -short name, long name, url -description (short description 1-3 sentences, longer description) -parent organization (physical location, university) -support (grant numbers) -keywords (species, technique, structure, age, level, disease, topic) Assign Metadata -short name, long name, url -description (short description 1-3 sentences, longer description) -parent organization (physical location, university) -support (grant numbers) -keywords (species, technique, structure, age, level, disease, topic) Decision: Should it be included? Assign resource type Do not include Keep Record Do not include Keep Record

Resources Difficult to Categorize Link aggregates Large organizations (NIH) Poorly documented databases Private data sites Clinical trials that are still recruiting –Experimental protocol Commercial entities Journals –JOVE –supplemental materials

CINdy the resource curation tool

Resource Ontology (BRO) Data Resource: provides access to data; database, atlas, book Software Resource: software programs or source code Material Resource: reagents, tissue samples or organisms Funding Resource: grants or contracts Training Resource: educational materials, training programs Job Resource: employment opportunities People Resource: access to individual people’s web sites

NIF Service vs BRO Service

Solutions Consolidating Classes Synonyms where appropriate: ex. Material storage service vs. Material storage repository. Temporary mapping, where appropriate –*Deprecated terms must be maintained* Data loss Moving forward with a joint descriptive terminology!

Evolution of the NIF Resource Ontology ObjectFunctionTarget Audience Data TypeData Format Materials -Biomaterials -Reagents Software People Grants Jobs Information Service -Storage -Production Funding Job Service Community- building General Kids Student Medical Researcher Structured -Database -Atlas Unstructured -Journal -Webpage Text RDF Text Picture Video

Resource Boundary? Software Library –Software tool Plugin: I2B2 Our solution: use url as a uniqueness qualifier –Our problem: a single url may house several resources –Individual plugins can have individual urls

Boundary cont. Individual resource boundary: shall be considered an individual resource if it is maintained by a single entity, and has the properties of one or more individual web pages that are related by a theme and html links. Solution to random boundary problem: Human Curator

Issues of Scope Single line or short paragraph + keywords –Resource discovery problem *Stanford ontologies description is very short (as are many) finding this resource by keyword will be difficult unless we index the content of the website. Data dump –Small vs. Large databases –Updates

Internal referencing Stanford example: –License: “same as bioportal” – does not match any license types in any list. –Problem: non standard terminology, reference to another project (no url), can create loops also true in publications: ex., used same protocol as paper X, which used the same protocol as paper Y –Automated text mining tools have a hard time recognizing these

What can we gain from automated systems? Basic information: Name, url, contact info Some keywords Some descriptive text No resource boundary No resource description

How do we help the computers? Common naming project (neurocommons) Automated uri’s Community building: –Shared data models –Shared ontology –RDF entity tags? (mouse vs mouse)