Chapter 6 Text and Multimedia Languages and Properties

Slides:



Advertisements
Similar presentations
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
Advertisements

UKOLN, University of Bath
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
W3C and RDF. Why OCLC is a W3C Member Access to networked information resources –the browser and online access –the breath and depth of networked information.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
METS: An Introduction Structuring Digital Content.
RDF Tutorial.
The Web of data with meaning... By Michael Griffiths.
Recuperação de Informação B Cap. 06: Text and Multimedia Languages and Properties (Introduction, Metadata and Text) 6.1, 6.2, 6.3 November 01, 1999.
An Introduction to Metadata by Wendy Duff ECURE 2000 October 6, 2000.
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
WMES3103 : INFORMATION RETRIEVAL
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Representing Information as Bit Patterns
Content Types: Text and Metadata. Introduction Text documents come in many forms –Article (news, conference, journal, etc.) – , memo, … –Book, manual,
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
XML Primer. 2 History: SGML vs. HTML vs. XML SGML (1960) XML(1996) HTML(1990) XHTML(2000)
Data Representation (in computer system) Computer Fundamental CIM2460 Bavy LI.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Homework Full-text article – entire textual contents of article in online format Abstract – brief summary of article Citation – basic information required.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
2.1 Different Text Attributes Font A set of printable or displayable text characters with its style and size specified Arial 16 point bold Arial 32 point.
Chapter 2 TEXT.
Chapter 3 Representing Numbers and Text in Binary Information Technology in Theory By Pelin Aksoy and Laura DeNardis.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
Introduction to XML Eugenia Fernandez IUPUI. What is XML? From the World Wide Web Consortium (W3C) The Extensible Markup Language (XML) is the universal.
An Introduction to the Resource Description Framework Eric Miller Online Computer Library Center, Inc. Office of Research Dublin, Ohio 元智資工所 系統實驗室 楊錫謦.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
Binary Arithmetic & Data representation
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
Web Metadata, what is it? Ora Lassila Visiting Scientist (from Nokia) Definition Applications Current Standardization Efforts.
Document Formats How to Build a Digital Library Ian H. Witten and David Bainbridge.
Lis512 lecture 4 XML: documents and records. up until now Relational databases can store information that is internal to an organization. But a lot of.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
RDF (Resource Description Framework). 2 Table of Contents  Introduction  Basic RDF –Basic RDF Model –Basic Syntax  Containers  Statements about Statements.
Introduction to Interactive Media Interactive Media Components: Text.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Data Files on Computers Text Files (ASCII) Files that can be created by typing on the keyboard while using a text editor such as notepad or TextEdit.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
The ___ is a global network of computer networks Internet.
XML Extensible Markup Language
MARC Tags to BIBFRAME Vocabulary: a new view of metadata Sally McCallum Library of Congress ALA - January 2014.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
THE CODING SYSTEM FOR REPRESENTING DATA IN COMPUTER.
Lecture Coding Schemes. Representing Data English language uses 26 symbols to represent an idea Different sets of bit patterns have been designed to represent.
Electronic mail security
XML QUESTIONS AND ANSWERS
ELECTRONIC MAIL SECURITY
Cataloging the Internet
ELECTRONIC MAIL SECURITY
Learning Intention I will learn how computers store text.
Recuperação de Informação B
ASCII and Unicode.
Presentation transcript:

Chapter 6 Text and Multimedia Languages and Properties .. .

Introduction Document has given syntax and structure also has semantics may have presentation style associated with it Figure 6.1 depicts all these relationships document can also have information about itself, called metadata

one or more of these elements may be given together Syntax of document can express different elements such as structure, presentation style, semantics one or more of these elements may be given together structural element (e.g. section) can have fixed formatting style

Syntax of document can be implicit in its content expressed in declarative language or PL current trend is to use languages that provide information on document structure format semantics readable by humans and computers SGML is one such language

Metadata Metadata is data about data metadata associated with text include author date of publication source of publication document length (in pages, words, bytes) document genre (book, article, memo) Machine Readable Cataloging Record (MARC) is most used format for library records

In Web, metadata used for many purposes cataloging content rating (e.g. to protect children from reading some type of document) intellectual property rights digital signatures (for authentication) privacy levels (who should/should not have access to document) application to EC, etc.

New standard for Web metadata is Resource Description Framework (RDF) RDF allows description of Web resources consists of description of nodes and attached attribute/value pairs nodes can be any Web resource (any URI), that include URL attributes are properties of nodes, and their values are text strings or other nodes

Text With the advent of computers, necessary to code text in binary digits first coding schemes were EBCDIC and ASCII for internationalization of oriental languages like Chinese or Japanese Kanji, 16-bit Unicode (ISO10616) exists

Text Formats No single format for text document in the past, IR systems would convert document to internal format cannot change content of document current IR systems have filters to handle most popular documents, in particular Word, WordPerfect or Framemaker

Other text formats for document interchange Rich Text Format (RTF) used by word processors and has ASCII syntax Portable Document Format (PDF) developed for displaying and printing documents Multipurpose Internet Mail Exchange (MIME) used to encode electronic mail