Presentation is loading. Please wait.

Presentation is loading. Please wait.

An overview of the TEI vocabulary ➢ markup makes explicit a theory about some aspect of a document ➢ some theories are more useful or generalizable than.

Similar presentations


Presentation on theme: "An overview of the TEI vocabulary ➢ markup makes explicit a theory about some aspect of a document ➢ some theories are more useful or generalizable than."— Presentation transcript:

1

2 An overview of the TEI vocabulary ➢ markup makes explicit a theory about some aspect of a document ➢ some theories are more useful or generalizable than others ➢ … so no markup language can reasonably claim to be exhaustive ➢ … so are we doomed to a further confusion of tongues?

3 Basic concepts The TEI is a modular system consisting of several modules which can be combined ad lib ● Each module defines groups of related elements and attributes ● Elements are also classified semantically and structurally This presentation gives an overview of some of the components of one view of the TEI: TEI Lite

4 Basic structure(s) ● Every TEI-conformant document comprises a header followed by (at least one) text ● the header contains: ● mandatory file description ● optional encoding, profile and revision descriptions ● the header is essential for: ● bibliographic control and identification ● resource documentation and processing

5 Structure of a TEI text ● A text may be unitary or composite ● a unitary text contains ● front matter ● back matter ● a body ● in a composite text, the body is a group of texts (or nested groups)

6 TEI basic structure teiHeader tei.2 teiCorpus.2 tei.2 teiHeader TEI.2 back front text body div group div back front text body s

7 TEI global attributes ● Available on all elements in all modules... ● id for unique identification ● n for (non-unique) name or number ● rend for rendition (appearance) ● lang for language ● Can be extended in some modules ● corresp, synch, ana for specific association types ● next, prev for aggregating fragmented elements

8 A text usually has divisions ● generic, hierarchic subdivisions ● vanilla or numbered ● type attribute ● associated head and trailer elements from the divtop class

9 for example... Book I. Of writing lives in general,...

10 Text components (prose base) ● What are divisions composed of? ● prose is mostly paragraphs ( ) ● verse is mostly lines ( ), sometimes in hierarchic groups ( ) ● drama is mostly speeches ( ) containing or and interspersed with stage directions ( ) ● These may be mixed, and may also appear directly within undivided texts.

11 Verse: an example Summer grass — all that's left of warriors' dreams.

12 Drama: an example Enter Barnardo and Francisco, two Sentinels, at several doors Barnardo: Who's there? Francisco: Nay, answer me. Stand and unfold yourself. Barnardo: Long live the king! Francisco: Barnardo? Barnardo: He. Enter Barnardo and Francisco, two Sentinels,at several doors Who's there? Nay, answer me. Stand and unfold yourself. Long live the king! Barnardo? He.

13 Texts are not just words... ● … but probably only people know that ● an encoding may claim to capture ● just visual salience, ● just its assumed causes ● both ● encoding makes explicit one (or more) sets of interpretations

14 For example... And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...

15 …or... And this Indenture further witnesseth that the said Walter Shandy, merchant, in consideration of the said intended marriage...

16 Who does the work? ● the TEI scheme allows for close reading -- and the reverse ● you can tag very detailed features of discourse function ● you can normalise or simplify (e.g. dates numbers, names) ● … or leave well alone

17 Core phrase level elements include... ● phrases that are conventionally typographically distinct ● “data-like” (names, numbers, dates, times, addresses) ● editorial intervention (corrections, regularizations, additions, omissions...) ● cross references and links

18 for example... Of writing lives in general,and particularly of Pamela, with a word by the bye of Colley Cibber and others. It is a trite but true observation, that examples work more forcibly on the mind than precepts.… Mr. Joseph Andrews, the hero of our ensuing history, was esteemed to be... Of writing lives in general,and particularly of Pamela, with a word by the bye of Colley Cibber and others. It is a trite but true observation, that examples work more forcibly on the mind than precepts.… Mr. Joseph Andrews, the hero of our ensuing history, was esteemed to be...

19 Spaulding, he came down into the office just this day eight weeks with this very paper in his hand, and he says:— I wish to the Lord, Mr. Wilson, that I was a red-headed man. Direct speech ● Use the who attribute to show speakers ● Speeches can be nested in other speeches ●.. but not across paragraph breaks

20 Have you read Die Dreigroschenoper ? Savoir-faire is French for know-how. John has real savoir- faire. Foreign language phrases ● The lang attribute may be attached to any element ● Use if nothing else is available ● Use Unicode!

21 My dear Mr. Bennet, said his lady to him one day, have you heard that Netherfield Park is let at last? Names and other referring strings ● The (referring string) element is used for any kind of name or reference

22 Today is Tuesday 29th. One afternoon in late November.. One afternoon in <dateRange from='1994-11-15' to='1994-11-30 exact='to'> late November.. Dates, times, numbers ● attributes can be used to quantify and expressions ● similarly, times, and numbers ● [AT P5 better validation will be available]

23 The multiple hierarchy problem ● XML allows only one hierarchy at a time ● Is a document ● chapter-paragraph-phrase ● gathering-page-leaf ● or both? ● discontinuous segments ● links and milestones

24 Diana and Mary approved the step unreservedly. Dia na announced that... Boundary markers ● page, column, and line breaks (,, ) ● generic

25 Some chunks are also phrases ● lists of all kinds ● notes (authorial or editorial) ● pictures or figures ● formulae ● tables ● bibliographic descriptions

26 Lists ● use for lists of any kind (use type attribute to distinguish) ● use in two-column lists as alternative to n attribute ● may be nested as necessary

27 for example... For my true love: * three calling birds * two french hens * a partridge in a pear tree For Uncle Joe: socks as usual For my true love three calling birds> two french hens a partridge in a pear tree For Uncle Joe socks as usual

28 Figures and graphics ● The presence of a graphic is indicated by the element ● The title of the graphic is tagged as a ● A description of the graphic may be supplied (as a ) for use by software unable to render the graphic ● The graphic itself is a separate object ● [At P5, it will be possible to embed SVG]

29 Mr Fezziwig's Ball A Cruikshank engraving showing Mr Fezziwig leading a group of revellers. for example...

30 Notes ● Use for notes of any kind (editorial or authorial) ● if in-line, use place attribute to specify location ● if out of line, either ● point from note into text (use target attribute) ● or point from text out to note (use

31 for example... The self-same moment I could pray> And from my neck so free The albatross fell off, and sank Like lead into the sea. The spell begins to break. The self-same moment I could pray> And from my neck so free The albatross fell off, and sank Like lead into the sea. The spell begins to break.

32 Bibliography ● Use simple with optional subcomponents: ● (for any kind of responsibility) or,, etc. ● with optional level attribute ● groups publication details ● adds page references etc. ● Use for list of references

33 Bibliography Ed Regis Great Mambo Chicken and the Trans-Human Experience London Penguin Books 1992 pp 144 ff for example... See for example Regis (1992)....

34 Generic problems call for generic solutions Links and pointers ● cross-referencing ● association of text and annotation ● association of image and text or audio and transcript ● alignment of text and translation...

35 ● Use (empty element) or (with content) ● Use target to specify an identifier (ID value) Cross References See especially section 12 on page 34. See especially.... Concerning Identifiers But what if the target is not in the current document?

36 TEI X-pointers ● TEI defined a "location ladder" style syntax later adapted by W3C as Xpath ● Syntax now under review ● Basic notion: tree navigation see especially see especially <xptr doc='doc2' from="DESCENDANT (2 DIV1) (4 P) CHILD (1 QUOTE LANG LAT)"/>

37 Also in TEI Lite... ● specialised front and back matter ● analytic tagging ● segmentation ● interpretations ● the header ● tags for editorial work ● tags for documentation

38 Further reading ● http://www.tei-c.org/Lite/ http://www.tei-c.org/Lite/ ● (also available in French, Italian, Korean, Russian, Spanish, and Japanese)


Download ppt "An overview of the TEI vocabulary ➢ markup makes explicit a theory about some aspect of a document ➢ some theories are more useful or generalizable than."

Similar presentations


Ads by Google