DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?

Slides:



Advertisements
Similar presentations
3.02B Authoring Languages 3.02 Develop webpages..
Advertisements

HTML I. HTML Hypertext mark-up language. Uses tags to identify elements of a page so that a browser such as Internet explorer can render the page on a.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
Computers: Tools for an Information Age Writing Your Own Web Page: Using HTML and Web Authoring Tools.
XML and Enterprise Computing. What is XML? Stands for “Extensible Markup Language” –similar to SGML and HTML –document “tags” are used to define content.
Authoring Languages and Web Authoring Software 4.01 Examine web page development and design.
1 eVenzia Technologies Learning HTML, XHTML & CSS Chapter 1.
Internet Research Internet Applications. The Internet is not the Web Because of the great popularity of the World Wide Web, people think the Internet.
Project 1 Introduction to HTML.
WMES3103 : INFORMATION RETRIEVAL
1 Pertemuan 17 Programming Languages for E-Business/E-commerce Matakuliah: M0284/Teknologi & Infrastruktur E-Business Tahun: 2005 Versi: >
Chapter 1 Understanding the Web Design Environment
1 Chapter 11 Developing Custom Help. 11 Chapter Objectives Use HTML to create customized Help topics for an application Use the HTML Help Workshop to.
Developing a Basic Web Page with HTML
1st Project Introduction to HTML.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
Introduce of XML Xiaoling Song CS157A. What is XML? XML stands for EXtensible Markup Language XML stands for EXtensible Markup Language XML is a markup.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Web Design Basic Concepts.
ECA 228 Internet/Intranet Design I Intro to XML. ECA 228 Internet/Intranet Design I HTML markup language very loose standards browsers adjust for non-standard.
Chapter 1 Variables in the Web Design Environment.
Chapter 1 Variables in the Web Design Environment
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
Introduction to XML Eugenia Fernandez IUPUI. What is XML? From the World Wide Web Consortium (W3C) The Extensible Markup Language (XML) is the universal.
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Introduction to HTML. What is HTML? Hyper Text Markup Language (HTML) is a language for describing web pages. HTML is not a programming language, it is.
Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)
Presentation Topic: XML and ASP Presented by Yanzhi Zhang.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
CEAL 2003 XML for CJK Wooseob Jeong School of Information Studies University of Wisconsin - Milwaukee.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
Lecture 3 Creating a Web Page with HTML. Objectives §Hypertext Document in WWW p §The HTML language p l Definition l Web browsers and.
Introducing the World Wide Web Internet- a structure made up of millions of interconnected computers whose users communicate with each other and share.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
XML The Extensible Markup Language (XML ), which is comparable to SGML and modeled on it, describes how to describe a collection of data. A standard way.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Martin Kruliš by Martin Kruliš (v1.1)1.
Writing Your Own Web Page: Using HTML and FrontPage Chapter 10.
Rendering XML Documents ©NIITeXtensible Markup Language/Lesson 5/Slide 1 of 46 Objectives In this session, you will learn to: * Define rendering * Identify.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
Pertemuan 17 Programming Languages for E-Business/E-commerce
2.01 Understand Digital Raster Graphics
2.01 Understand Digital Raster Graphics
Project 1 Introduction to HTML.
4.01B Authoring Languages and Web Authoring Software
Module: Software Engineering of Web Applications
Chapter 1 Introduction to HTML.
2.01 Understand Digital Raster Graphics
XML QUESTIONS AND ANSWERS
Project 1 Introduction to HTML.
XML in Web Technologies
Using Access and the Web
Microsoft Office Illustrated
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
4.01B Authoring Languages and Web Authoring Software
2.01 Understand Digital Raster Graphics
2.01 Understand Digital Raster Graphics
Understand basic HTML and CSS terminology, concepts, and basic operations. Objective 3.01.
Unit 6 - XML Transformations
Presentation transcript:

DOCUMENT TYPES

Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized? Converting documents to an electronic format will preserve those documents, but how would such a process be organized? And then, how could the electronic documents be distributed? And then, how could the electronic documents be distributed? Building a digital library for books and articles by: Building a digital library for books and articles by: Digitizing books and articles Digitizing books and articles Storing them in an indexed database Storing them in an indexed database

Mark-ups Mark-up is everything in a document that is not content. Mark-up is everything in a document that is not content.

Procedural mark-up Procedural mark-up are codes that contain information on how a specific application should process the document (example of procedural mark-up formats: Microsoft Word). Procedural mark-up are codes that contain information on how a specific application should process the document (example of procedural mark-up formats: Microsoft Word).

Presentational mark-up Presentational mark-up are codes that describe how the document should be presented or laid out, either on a computer screen or on a Presentational mark-up are codes that describe how the document should be presented or laid out, either on a computer screen or on a

Descriptive mark-up Descriptive mark-up are codes that describe the logical structure Descriptive mark-up are codes that describe the logical structure

printed page (example of presentational mark-up language: HTML). printed page (example of presentational mark-up language: HTML). by many different software applications (example of descriptive markup by many different software applications (example of descriptive markup meta-language: XML). Competition meta-language: XML). Competition

documents Microsoft Word Microsoft Word Rich Text Format Rich Text Format templates templates To reduce the time of creating documents of the same type or class, like memos, letters, technical reports, research articles and invoices, document can help you. To reduce the time of creating documents of the same type or class, like memos, letters, technical reports, research articles and invoices, document can help you.

Template contains styles sheet that will be used to format this type of document and framework with elements such as a standard front page, headers and footers, a standard set of sections and headings, etc. Template contains styles sheet that will be used to format this type of document and framework with elements such as a standard front page, headers and footers, a standard set of sections and headings, etc.

Word processing software uses the most common form of procedural mark-up. Word processing software uses the most common form of procedural mark-up. Word processing format, such as Word, is useful when you have to create or edit a document. Word processing format, such as Word, is useful when you have to create or edit a document. The mark-up in a word processor serves to specify how the document should be laid out when printed, and to control the functions of the word processing application. The mark-up in a word processor serves to specify how the document should be laid out when printed, and to control the functions of the word processing application.

Using a word processor such as Microsoft Word, you can set the style sheet, apply templates and create a visual structure for your document. Using a word processor such as Microsoft Word, you can set the style sheet, apply templates and create a visual structure for your document. Microsoft Word uses a proprietary, binary format: this causes problems in terms of standardization. Microsoft Word uses a proprietary, binary format: this causes problems in terms of standardization.

To resolve these problems, Microsoft have created another procedural format, RTF, that is a plain text format used as the exchange format between word processing applications. To resolve these problems, Microsoft have created another procedural format, RTF, that is a plain text format used as the exchange format between word processing applications.

HTML is an acronym, standing for Hypertext Markup Language. It is a language that can be transferred around the Internet and read by a Web Browser HTML is an acronym, standing for Hypertext Markup Language. It is a language that can be transferred around the Internet and read by a Web Browser

Simple HTML documents can be created easily using any text editor. Simple HTML documents can be created easily using any text editor. All content is defined by the markup "tags" of HTML, that are containers for whatever you put in the document. All content is defined by the markup "tags" of HTML, that are containers for whatever you put in the document. Using HTML you can define basic presentation of a document (headers, paragraphs, lists and tables), hyperlinks and multimedia information. from Word (doc) to HTML/PDF, from Word (doc) to XML, and XML to HTML/PDF. a rendition in a word processing format, such as Microsoft Word, is useful when creating or editing the document, Using HTML you can define basic presentation of a document (headers, paragraphs, lists and tables), hyperlinks and multimedia information. from Word (doc) to HTML/PDF, from Word (doc) to XML, and XML to HTML/PDF. a rendition in a word processing format, such as Microsoft Word, is useful when creating or editing the document, an HTML rendition is useful when viewing it on the Web, and an HTML rendition is useful when viewing it on the Web, and

a page rendition as a bitmap graphic or PDF format may be useful when a read-only page layout view is required. Conversion can be carried out: manually, when a person creates the rendition by re-keying the document content, and inserting the mark-up necessary. a page rendition as a bitmap graphic or PDF format may be useful when a read-only page layout view is required. Conversion can be carried out: manually, when a person creates the rendition by re-keying the document content, and inserting the mark-up necessary.

using one or more computer programs that automatically convert the document from Document one format to another. Microsoft Word is often chosen as the original document creation application using one or more computer programs that automatically convert the document from Document one format to another. Microsoft Word is often chosen as the original document creation application

However, many organizations are beginning to use XML to hold the source documents because it is easy to transform to other renditions; moreover, its mark-up captures the logical meaning of the content, it is open source and well defined with public specifications. However, many organizations are beginning to use XML to hold the source documents because it is easy to transform to other renditions; moreover, its mark-up captures the logical meaning of the content, it is open source and well defined with public specifications. There are a number of tools available on the market which can plug in to Word to help make the transformation to XML. There are a number of tools available on the market which can plug in to Word to help make the transformation to XML.

They generally use Word styles to make the transformation and rely on users of the word processor applying word styles in a consistent manner. They generally use Word styles to make the transformation and rely on users of the word processor applying word styles in a consistent manner. In this case it is necessary that users have created Word documents using styles and templates correctly. If not, it is quite difficult to make a fully automated transformation from Word to XML. In this case it is necessary that users have created Word documents using styles and templates correctly. If not, it is quite difficult to make a fully automated transformation from Word to XML.

One of the great advantages of XML is that it is very easy to transform XML mark-up to another format. The Extensible Style sheet Language for Transformations (XSLT) offers a standard way to transform XML and there are many XSLT transformation processors available, both as open source and as commercial products. One of the great advantages of XML is that it is very easy to transform XML mark-up to another format. The Extensible Style sheet Language for Transformations (XSLT) offers a standard way to transform XML and there are many XSLT transformation processors available, both as open source and as commercial products.

There is also a standard way to transform XML into page- formatted renditions such as PDF, Postscript or RTF, the XSL- FO. There is also a standard way to transform XML into page- formatted renditions such as PDF, Postscript or RTF, the XSL- FO. XSL-FO (XSL Formatting Objects) is a set of XML elements that represent objects such as pages, text blocks, tables, lists, footnotes, etc. XSL-FO (XSL Formatting Objects) is a set of XML elements that represent objects such as pages, text blocks, tables, lists, footnotes, etc.

GIF, JPG, PNG The photograph or scanned image is sampled and The photograph or scanned image is sampled and mapped as a grid of dots or picture elements (pixels). mapped as a grid of dots or picture elements (pixels). GIF, JPG, PNG GIF, JPG, PNG

PDF (Portable Document Format ) is a procedural mark-up language that allows page-formatted documents to be viewed and printed in their original format on almost any software platform. PDF (Portable Document Format ) is a procedural mark-up language that allows page-formatted documents to be viewed and printed in their original format on almost any software platform. PDF is an ideal format for scientific documents that contain unusual symbols, and for multilingual documents. PDF is an ideal format for scientific documents that contain unusual symbols, and for multilingual documents.

The compression and incremental loading features of PDF make it well suited for transmission of documents over the Internet. The compression and incremental loading features of PDF make it well suited for transmission of documents over the Internet. Many software packages can be used to create PDF documents, and PDF viewers are available free of charge. Many software packages can be used to create PDF documents, and PDF viewers are available free of charge.

A PDF document contains a set of pages which are described by three main object types: path objects, image objects and text objects. A PDF document contains a set of pages which are described by three main object types: path objects, image objects and text objects. Embedded TIFFs are PDF documents where the entire pages are TIFF images. Embedded TIFFs are PDF documents where the entire pages are TIFF images. XML, born as a profile of SGML, is an open standard for descriptive mark-up, used as exchange format between applications. XML, born as a profile of SGML, is an open standard for descriptive mark-up, used as exchange format between applications.

An XML document is well formed if it follows the basic rules of XML syntax. An XML document is well formed if it follows the basic rules of XML syntax. A Document Type Definition (DTD) and XML Schema are sets of rules which specify the logical structure that is allowable for a particular type of document. A Document Type Definition (DTD) and XML Schema are sets of rules which specify the logical structure that is allowable for a particular type of document. An XML document is valid if it complies with the rules set out in a DTD or XML Schema with which it is associated. An XML document is valid if it complies with the rules set out in a DTD or XML Schema with which it is associated.

A Cascading Style Sheet (CSS) is a separate style sheet which contains simple rendering instructions for a XML document. A Cascading Style Sheet (CSS) is a separate style sheet which contains simple rendering instructions for a XML document. Extensible Style sheet Language for Transformations (XSLT) is used to create style sheets which define transformations from XML to other XML or non-XML formats. Extensible Style sheet Language for Transformations (XSLT) is used to create style sheets which define transformations from XML to other XML or non-XML formats.