Sekimo Solutions mentioned by the TEI  CONCUR: an optional feature of SGML (not XML) that allows multiple.

Slides:



Advertisements
Similar presentations
CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
Advertisements

3.02B Authoring Languages 3.02 Develop webpages..
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
What is XML? a meta language that allows you to create and format your own document markups a method for putting structured data into a text file; these.
XML: text format Dr Andy Evans. Text-based data formats As data space has become cheaper, people have moved away from binary data formats. Text easier.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
Authoring Languages and Web Authoring Software 4.01 Examine web page development and design.
XML: Managing Data Exchange Stylesheets. Lesson Contents CSS The basic XSL file XSL transforms Templates Sort Numbering Parameters and Variables Datatypes.
Copyright © 2004 ProsoftTraining, All Rights Reserved. Lesson 3: XHTML Coding © 2007 Prosoft Learning Corporation All rights reserved ITD 110 Web Page.
Site Development Foundations © 2004 ProsoftTraining All rights reserved.
Stand-off Annotation Further details and examples: Durusau and O’Donnell’s (2001) powerpoint presentation Thompson and McKelvie’s (1997) “Hyperlink semantics.
Basics of HTML What is HTML?  HTML or Hyper Text Markup Language is the standard markup language used to create Web pages.  HTML is.
1 COS 425: Database and Information Management Systems XML and information exchange.
Tutorial 11 Creating XML Document
Chapter 1 Understanding the Web Design Environment
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
XML Basics Hope Greenberg Center for Teaching & Learning.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
HTML & XHTML Introduction. First HTML 1992 Marked-up text to represent a hypertext document for transmission over the network The hypertext mark-up language.
Modern Information Retrieval Chap. 02: Modeling (Structured Text Models)
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
XML introduction to Ahmed I. Deeb Dr. Anwar Mousa  presenter  instructor University Of Palestine-2009.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
Week 1.  Phillip Chee   Ext.1214 
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
NetTech Solutions Working with Web Elements Lesson 6.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Leiden University. The university to discover. DMT Week 3 Adriaan van der Weel and Peter Verhaar.
Session IV Chapter 9 – XML Schemas
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
Avoid using attributes? Some of the problems using attributes: Attributes cannot contain multiple values (child elements can) Attributes are not easily.
1 Chapter 10: XML What is XML What is XML Basic Components of XML Basic Components of XML XPath XPath XQuery XQuery.
E0262 – MIS – Multimedia Storage Techniques XML (Extensible Markup Language  XML is a markup language for creating documents containing structured information.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
XML Documents Chao-Hsien Chu, Ph.D. School of Information Sciences and Technology The Pennsylvania State University Elements Attributes Comments PI Document.
ECA 228 Internet/Intranet Design I Intro to Markup.
History Day Interpretive Web Site Category Notes on Web Site Category  Fourth year as a full NHD category.  Individual and Group entries are now.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
Introduction to Markup David J. Birnbaum University of Pittsburgh Slavic Digital Text Workshop University.
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
1 XML eXtensible Markup Language. 2 XML vs. HTML HTML is a HyperText Markup language HTML is a HyperText Markup language Designed for a specific application,
XLIFF 2.0 vs XLIFF 1.2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by.
1 Indexing The syntax for creating a index is: CREATE [UNIQUE] INDEX index_name ON table_name (column1, column2,... column_n) [ COMPUTE STATISTICS ]; Why.
Introduction to Markup Languages January 31, 2002.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
Jan Christoph Meister University of Hamburg
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
Module Road Map Assignment Road Map Notice we have linked the conduit directly to the presentation layer. This is normally a bad idea!
Lifecycle Metadata for Digital Objects October 2, 2006 Implementing Metadata in XML.
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
XML – Basic Concepts (modified version from Dr. Praveen Madiraju) 2015, Fall Pusan National University Ki-Joune Li.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
1 XML eXtensible Markup Language. 2 Introduction and Motivation Dr. Praveen Madiraju Modified from Dr.Sagiv’s slides.
XML QUESTIONS AND ANSWERS
Introduction to XHTML.
Microsoft Office Illustrated
Creating an XML Document
Introducing HTML & XHTML:
Part of the Multilingual Web-LT Program
XML Data Introduction, Well-formed XML.
More Sample XML By Sadia Anjum.
Presentation transcript:

Sekimo Solutions mentioned by the TEI  CONCUR: an optional feature of SGML (not XML) that allows multiple hierarchies to be marked up concurrently in the same document  milestone elements: empty elements that mark the boundaries between elements in a non-nesting structure  fragmentation of an item: the division of a single element into two or more parts, each of which nests properly within its context  virtual joins: the re-creation of a virtual element from fragments of text  redundant encoding: information encoded in multiple forms

Sekimo Problems with milestones  milestones are empty elements  milestones elements have no content  consequences:  no content model restriction can be stated by a document grammar  standard SG/XML editors cannot annotate these regions  SG/XML parsers cannot ensure proper nesting of the milestone elements  to process these regions by means of a style sheet is  more difficult (XSLT) or  impossible (CSS)

Sekimo CLIX/Horse-milestones  Differing type of milestones …...  CLIX Non-XML: s xyz t Would be : b xyz t

Sekimo Problems with the other TEI-solutions  CONCUR:  (de facto) not implemented (and not part of XML)  fragmentation of an item:  results in 'containers' containing only a part of the text, e.g. a fragmented sentence or para would not contain an entire sentence or paragraph, as implied  virtual joins:  requires a separate interpretation of the SGML document  redundant encoding:  results in multiple files  the files are not integrated in a larger unit  it exists no unit containing all the information

Sekimo Stand-off annotation  new layers of annotation are added by building a new tree whose nodes are SGML elements which do not contain textual content, but links to another layer  in some respects a generalization of the virtual joins (although not mentioned by the TEI), because  not only contents of elements are joined, but also ranges between points within the document  link base:  Distinction 1: markup already contained in an annotation layer vs. text content, addressed by character offsets  Distinction 2: one (dedicated) layer as the link target vs. (free) interlinking of several layers

Sekimo Advantages of stand-off annotation  Thompson & McKelvie (1997)  the source document might be read-only  annotation files can be distributed without distributing the source text  Michael Glass & Barbara Di Eugenio (2002)  discontinuous segments of text can be combined in a single annotation  independent parallel coders can produce independent annotations  different annotation files can contain different layers of information  Pianta & Bentivogli (2004)  elegance and clarity  processing conceptually simple

Sekimo Drawbacks of stand-off annotation  new layers require a separate interpretation  the layers, although separate, depend on each other  the information, although included, is difficult to access using generic methods  standard parsing or editing software cannot be employed  standard document grammars can only be used for the level, containing both markup and textual data  linking at a sub-element range is difficult  the primary layer should be a (primary) level

Sekimo Non SGML-based Markup Languages  some non-SGML-based markup languages have been proposed, e.g. Multi-Element Code System (MECS) or TexMECS  its major extension with respect to SGML and XML is that overlapping ranges are admitted within documents.  in 2002 the Layered Markup and Annotation Language (LMNL) was proposed Tennison and Piez 2002  LMNL is a markup language which not only allows to annotate overlapping elements but also to connect the element names to corresponding annotation levels.  LMNL solves both problems, but  (full) LMNL is not SGML-based