Introduction to XML and TEI for Digital Archives Entities, ROMA, the ODD, and Transcribing Manuscripts.

Slides:



Advertisements
Similar presentations
XML-XSL Introduction SHIJU RAJAN SHIJU RAJAN Outline Brief Overview Brief Overview What is XML? What is XML? Well Formed XML Well Formed XML Tag Name.
Advertisements

CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
HTML. The World Wide Web Protocols Addresses HTML.
Intro to HTML Kin 260 Jackie Kiwata.
INTRODUCTION TO HYPERTEXT MARKUP LANGUAGE 1. Outline  Introduction  Markup Languages  Editing HTML  Common Tags  Headers  Text Styling  Linking.
XHTML Basics.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
HTML/XML XHTML Authoring. Creating Tables  Table: An arrangement of horizontal rows and vertical columns. The intersection of a row and a column is called.
CS 898N – Advanced World Wide Web Technologies Lecture 21: XML Chin-Chih Chang
XML: New or Old? XML was not an extention of HTML That already existed! SGML (ISO 8879) XML was a simplification of SGML  80 / 20 rule  (80% of the features.
XML Primer. 2 History: SGML vs. HTML vs. XML SGML (1960) XML(1996) HTML(1990) XHTML(2000)
Document Type Definitions. XML and DTDs A DTD (Document Type Definition) describes the structure of one or more XML documents. Specifically, a DTD describes:
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
Unit 4 – XML Schema XML - Level I Basic.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
XML Basics Hope Greenberg Center for Teaching & Learning.
Tutorial 1: Getting Started with HTML5
Web page - A Web page is a simple text file that contains HTML tags (code) that describe what should be displayed on the browser. -The Web browser interprets.
XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.
XML introduction to Ahmed I. Deeb Dr. Anwar Mousa  presenter  instructor University Of Palestine-2009.
Week 1.  Phillip Chee   Ext.1214 
XML - DTD. The building blocks of XML documents Elements, Tags, Attributes, Entities, PCDATA, and CDATA.
E0262 – MIS – Multimedia Storage Techniques XML (Extensible Markup Language)  XML is a markup language for creating documents containing structured information.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
 2002 Prentice Hall, Inc. All rights reserved.2 Chapter 2 — Introduction to HyperText Markup Language 4: Part I Outline 2.1Introduction 2.2Markup Languages.
1 HTML intro The development of HTMLThe development of HTML The transition from HTML to XHTMLThe transition from HTML to XHTML XHTML syntax, tags, and.
1 Web Developer Foundations: Using XHTML Chapter 2 Key Concepts.
XHTML. Introduction to XHTML What Is XHTML? – XHTML stands for EXtensible HyperText Markup Language – XHTML is almost identical to HTML 4.01 – XHTML is.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
15.1 Fundamentals of HTML.
HTML,DHTML & Javascript/Session1/1 of 39 Introduction and Basic Tags Session 1 of Using HTML, DHTML & JavaScript.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
E0262 – MIS – Multimedia Storage Techniques XML (Extensible Markup Language  XML is a markup language for creating documents containing structured information.
XML - DTD Week 4 Anthony Borquez. What can XML do? provides an application independent way of sharing data. independent groups of people can agree to.
XP 2 HTML Tutorial 1: Developing a Basic Web Page.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation An Introduction to XML.
HTML: Hyptertext Markup Language Doman’s Sections.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
XML Instructor: Charles Moen CSCI/CINF XML  Extensible Markup Language  A set of rules that allow you to create your own markup language  Designed.
HTML ( HYPER TEXT MARK UP LANGUAGE ). What is HTML HTML describes the content and format of web pages using tags. Ex. Title Tag: A title It’s the job.
An Introduction to XML Sandeep Bhattaram
1 Dr Alexiei Dingli XML Technologies DTD. 2 Document Type Definition Defines –the legal building blocks of an XML document –the document structure –The.
XML for Text Markup An introduction to XML markup.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
XML Basics A brief introduction to XML in general 1XML Basics.
Introduction to XML February 07, From HTML to XML As mentioned in previous classes, if you know HTML, then you already know XML… really! In this.
Kevin Murphy Basics of XML Masters Project CS 490.
Unit 10 Schema Data Processing. Key Concepts XML fundamentals XML document format Document declaration XML elements and attributes Parsing Reserved characters.
L. Anne Spencer (c) 2001 Basic Web Design Document, text, & layout formatting tags & attributes.
HTML Basics Computers. What is an HTML file? *HTML is a format that tells a computer how to display a web page. The documents themselves are plain text.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
ENG 5933 Humanities Computing Introduction to XML
225 City Avenue, Suite 106 Bala Cynwyd, PA , phone , fax presents… XML Syntax v2.0.
Well Formed XML The basics. A Simple XML Document Smith Alice.
Introduction to DTD A Document Type Definition (DTD) defines the legal building blocks of an XML document. It defines the document structure with a list.
Presentation On HTML & Podcast Done by: Shamelia Young & Sheriece Williamson.
DTD Document Type Definition. Agenda Introduction to DTD DTD Building Blocks DTD Elements DTD Attributes DTD Entities DTD Exercises DTD Q&A.
XP 2 HTML Tutorial 1: Developing a Basic Web Page.
XML CORE CSC1310 Fall XML DOCUMENT XML document XML document is a convenient way for parsers to archive data. In other words, it is a way to describe.
Introduction to XML Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
TEI presentation for IS 590 Robert Patrick Waltz July 10 th, 2012.
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML.
Collaborating with Co-Workers Microsoft Office Word 2007 Illustrated Complete.
HTML is about making documents. Simple Code for Simple Layout My Document This is an example HTML document First paragraph Second paragraph This is the.
WEBSITE DESIGN Chp 1
Javascript & jQuery XML.
Creating Web Documents
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML
Presentation transcript:

Introduction to XML and TEI for Digital Archives Entities, ROMA, the ODD, and Transcribing Manuscripts

Character Entities Character entities are special characters not included in the basic ASCII set. These must be defined in a “DOCTYPE” declaration at the beginning of a document. Two characters that are predefined in XML (and must be transcribed as character entities so that they aren’t mistaken for markup) are “&” (“&”) and “<“ (“<”). <!DOCTYPE rootelement [ ]>

Character Entity Codes Character entities can be represented in a number of ways, most of which can be read by most browsers that display them. The simplest representation is a decimal representation, which takes the format Ӓ. Hexadecimal representations use the format ሴ. You can find lists of character entities online: And an exhaustive series of lists here:

Marking Up Manuscripts There are a number of special tags designed to represent features in manuscripts. The most important of these are those tagging abbreviations, additions, and deletions. Abbreviations can be marked using an tag. Using this in combination with a tag and a tag permits one to expand the abbreviation. HMS Her Majesty’s Ship

Manuscript Additions and Deletions Manuscript additions are shown using (surprise surprise!) an tag. There are a number of useful attributes which can be added to this, most useful of which is the “place” attribute, used to indicate where the addition appears: Stuff added Manuscript deletions are shown in a similar fashion, using a tag. Stuff that’s been deleted

Combined Additions and Deletions Where additions and deletions occur together, as in a substitution of one word for another, they can be nested within a tag: hastle hassle

Indicating Responsibility There are a number of forms of “responsibility” that you may wish to indicate when tagging manuscripts. Most obviously, you may wish to indicate your best estimation as to the author of additions and deletions made to the manuscript. The best way to indicate responsibility is by using an “xml:id” attribute, which is set in the document header, inside a “profile description.” The element “provides a detailed description of non- bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting.”

Indicating Responsibility II The tag appears in the, which is in turn a child of the element. … <handNote xml:id=“RH" scribe=“RobertHooke" script="handwritten" medium=“pen“> the document's main hand, Robert Hooke …

Indicating Responsibility III Once you’ve established the identity of your “hands,” you can make abbreviated reference to this in your markup of the manuscript deletions and insertions. hastle hassle

Showing Restored Text In some instances, there may be an indication that deleted text has been “restored.” You can indicate this as well. out off This would indicate that the original deletion has been deemed acceptable after all, and is again a legitimate part of the “reading” of the text.