Presentation is loading. Please wait.

Presentation is loading. Please wait.

13-Jun-15 RELAX NG. 2 Caveat I did not have a RELAX NG validator when I wrote these slides. Therefore, if an example appears to be wrong, it probably.

Similar presentations


Presentation on theme: "13-Jun-15 RELAX NG. 2 Caveat I did not have a RELAX NG validator when I wrote these slides. Therefore, if an example appears to be wrong, it probably."— Presentation transcript:

1 13-Jun-15 RELAX NG

2 2 Caveat I did not have a RELAX NG validator when I wrote these slides. Therefore, if an example appears to be wrong, it probably is.

3 3 What is RELAX NG? RELAX NG is a schema language for XML It is an alternative to DTDs and XML Schemas It is based on earlier schema languages, RELAX and TREX It is not a W3C standard, but is an OASIS standard OASIS is the Organization for the Advancement of Structured Information Standards ebXML (Enterprise Business XML) is a joint effort of OASIS and UN/CEFACT (United Nations Centre for Trade Facilitation and Electronic Business) OASIS developed the highly popular DocBook DTD for describing books, articles, and technical documents RELAX NG has recently been adopted as an ISO/IEC standard

4 4 Design goals Simple and easy to learn Uses XML syntax But there is also a “concise” (non-XML) syntax Does not change the information set of an XML document (I’m not sure what this means) Supports XML namespaces Treats attributes uniformly with elements so far as possible Has unrestricted support for unordered content Has unrestricted support for mixed content Has a solid theoretical basis Can make use of a separate datatyping language (such W3C XML Schema Datatypes)

5 5 RELAX NG tools Jing An open source validator written in Java Sun’s MSV Another validator DTDinst Translates from DTDs into RNG (RELAX NG) syntax or RNG “compact” syntax Trang Translates RNG compact syntax into RNG syntax Translates RNG or RNG compact syntax into DTDs Sun’s RELAX NG Converter Translates DTDs into RNG syntax (but not well) Translates an XML Schema subset into RNG syntax (imperfectly)

6 6 Basic structure A RELAX NG specification is written in XML, so it obeys all XML rules The RELAX NG specification has one root element The document it describes also has one root element The root element of the specification is element If the root element of your document is book, then the RELAX NG specifications begins: and ends:

7 7 Data elements RELAX NG makes a clear separation between: the structure of a document (which it describes) the datatypes used in the document (which it gets from somewhere else, such as from XML Schemas) For starters, we will use the two (XML-defined) elements:... (usually written ) Plain character data, not containing other elements (usually written ) Does not contain anything Other datatypes, such as... are not defined in RELAX NG To inherit datatypes from XML Schemas, use: datatypeLibrary="http://www.w3.org/2001/XMLSchema-datatypes" as an attribute of the root element

8 8 Data types from XML Schemas Here are some of the predefined numeric types: Here are some of the predefined date/time types: xs:date -- A date in the format CCYY-MM-DD, for example, 2002-11-05 xs:time -- A date in the format hh:mm:ss (hours, minutes, seconds) xs:dateTime -- Format is CCYY-MM-DD T hh:mm:ss The T is part of the syntax xs:decimalxs:positiveInteger xs:bytexs:negativeInteger xs:shortxs:nonPositiveInteger xs:intxs:nonNegativeInteger xs:long

9 9 Predefined date and time types xs:date -- A date in the format CCYY-MM-DD, for example, 2002-11-05 xs:time -- A date in the format hh:mm:ss (hours, minutes, seconds) xs:dateTime -- Format is CCYY-MM-DD T hh:mm:ss The T is part of the syntax Allowable restrictions on dates and times: enumeration, minInclusive, minExclusive, maxInclusive, maxExclusive, pattern, whiteSpace

10 10 Defining tags To define a tag (and specify its content), use Example: The DTD Translates to: Note: As in the DTD, the components must occur in order

11 11 RELAX NG describes patterns Your RELAX NG document specifies a pattern that matches your valid XML documents For example, the pattern: Will match the XML: David Matuszek

12 12 Easy tags... The enclosed content occurs zero or more times... The enclosed content occurs one or more times... The enclosed content occurs once or not at all... Any one of the enclosed elements may occur

13 13 Example

14 14 Enumerations The... pattern matches a specified value Example: male female The contents of are subject to whitespace normalization: Leading and trailing whitespace is removed Internal sequences of whitespace characters are collapsed to a single blank

15 15 More about data Remember: To inherit datatypes from XML Schemas, add this attribute to the root element: datatypeLibrary = "http://www.w3.org/2001/XMLSchema-datatypes" You can access the inherited types with the tag, for instance, The pattern must match the entire content of the enclosing tag, not just part of it If you don't specify a datatype library, RELAX NG defines the following for you (along with and ): : No whitespace normalization is done : A sequence of characters containing no whitespace

16 16... is used as “fat parentheses” Example: choice #1 choice #2

17 17 Attributes Attributes are defined practically the same way as elements:... Example: Matches: David Matuszek

18 18 More about attributes With attributes, as with elements, you can use,, and It doesn’t make sense to use or with attributes In keeping with the usual XML rules, The order in which you list elements is significant The order in which you list attributes is not significant

19 19 Still more about attributes can be (and usually is) abbreviated as However, can not be abbreviated as If an element has no attributes and no content, you must use explicitly

20 20 pattern matches a whitespace- separated list of tokens, and applies the pattern to those tokens Example:

21 21... allows the contained elements to occur in any order is more sophisticated than you might expect If a contained element can occur more than once, the various instances do not need to occur together

22 22 Interleave example dave@acm.org 215-898-8122 matuszek@central.cis.upenn.edu

23 23 allows mixed content, that is, both text and patterns If pattern is a RELAX NG pattern, then pattern is shorthand for pattern

24 24 Example of Pattern: Matches: This is not a great example, but it should suffice. Without this we get one bold or one italic

25 25 The need for named patterns So far, we have defined elements exactly at the point that they can be used There is no equivalent of:...use person several places in the DTD... With the RELAX NG we have discussed so far, each time we want to include a person, we would need to explicitly define both person and name at that point: The element solves this problem

26 26 Syntax of...usual RELAX NG elements, which may include:...usual RELAX NG elements, attributes, groups, etc.

27 27 Use of To write a, Make the root element of your specification Hence it should say xmlns="http://relaxng.org/ns/structure/1.0" Use, as the element, a pattern that matches the entire (valid) XML document In each element, write a pattern that you want to use other places in the specification Wherever you want to use a defined element, put Note that defined elements may be used in definitions, not just in the element Definitions may even be recursive, but Recursive references must be in an element, not an attribute

28 28 Long example of XML is case sensitive-- Note that defined terms are capitalized differently

29 29 Common usage I A typical way to use RELAX NG is to use a with just the root element in and every element described by a...more...

30 30 Common usage II

31 31 Replacing DTDs With and multiple s, we can do essentially the same things as a DTD Advantages: RELAX NG is more expressive than a DTD; we can interleave elements, specify data types, allow specific data values, use namespaces, and control the mixing of data and patterns RELAX NG is written in XML RELAX NG is relatively easy to understand Disadvantages RELAX NG is extremely verbose But there is a “compact syntax” that is much shorter RELAX NG is not (yet) nearly as well known Hence there are fewer tools to work with it This situation seems to be changing

32 32 The End So by this maxim be impressed, USE THE TOOLS THAT WORK THE BEST. Do not yield your sovereign judgment, To any sort of political fudgement. The criterion of sound design Should be, must be, your guideline. And if you're designing documents, Try RNG. We charge no rents. -- John Cowan


Download ppt "13-Jun-15 RELAX NG. 2 Caveat I did not have a RELAX NG validator when I wrote these slides. Therefore, if an example appears to be wrong, it probably."

Similar presentations


Ads by Google