DOC PREVIEW
Penn CIT 594 - Extensible Markup Language

This preview shows page 1-2-3-27-28-29 out of 29 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 29 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

XMLHTML and XML, IHTML and XML, IIHTML and XML, IIIXML-related technologiesExample XML documentOverall structureXML building blocksElements and attributesWell-formed XMLEntitiesXML declarationProcessing instructionsCommentsCDATANames in XMLNamespacesNamespaces and URIsNamespace syntaxReview of XML rulesAnother well-structured exampleXML as a treeValid XMLMixed contentExample XML document, revisedViewing XMLExtended document standardsVocabularyThe EndJan 14, 2019XMLeXtensible Markup Language2HTML and XML, IXML stands for eXtensible Markup LanguageHTML is used to mark up text so it can be displayed to usersXML is used to mark up data so it can be processed by computersHTML describes both structure (e.g. <p>, <h2>, <em>) and appearance (e.g. <br>, <font>, <i>)XML describes only content, or “meaning”HTML uses a fixed, unchangeable set of tagsIn XML, you make up your own tags3HTML and XML, IIHTML and XML look similar, because they are both SGML languages (SGML = Standard Generalized Markup Language) Both HTML and XML use elements enclosed in tags (e.g. <body>This is an element</body>)Both use tag attributes (e.g.,<font face="Verdana" size="+1" color="red">)Both use entities (&lt;, &gt;, &amp;, &quot;, &apos;)More precisely,HTML is defined in SGMLXML is a (very small) subset of SGML4HTML and XML, IIIHTML is for humansHTML describes web pagesYou don’t want to see error messages about web pages you visitBrowsers ignore and/or correct as many HTML errors as they can, so HTML is often sloppyXML is for computersXML describes dataThe rules are strict and errors are not allowedIn this way, XML is like a programming languageCurrent versions of most browsers can display XMLHowever, browser support of XML tends to be inconsistent5XML-related technologiesDTD (Document Type Definition) and XML Schemas are used to define legal XML tags and their attributes for particular purposesCSS (Cascading Style Sheets) describe how to display HTML or XML in a browserXSLT (eXtensible Stylesheet Language Transformations) and XPath are used to translate from one form of XML to anotherDOM (Document Object Model), SAX (Simple API for XML, and JAXP (Java API for XML Processing) are all APIs for XML parsing6Example XML document<?xml version="1.0"?><weatherReport> <date>7/14/97</date> <city>North Place</city> <state>NX</state> <country>USA</country> High Temp: <high scale="F">103</high> Low Temp: <low scale="F">70</low> Morning: <morning>Partly cloudy, Hazy</morning> Afternoon: <afternoon>Sunny &amp; hot</afternoon> Evening: <evening>Clear and Cooler</evening></weatherReport>From: XML: A Primer, by Simon St. Laurent7Overall structure An XML document may start with one or more processing instructions (PIs) or directives: <?xml version="1.0"?><?xml-stylesheet type="text/css" href="ss.css"?>Following the directives, there must be exactly one tag, called the root element, containing all the rest of the XML: <weatherReport> ...</weatherReport>8XML building blocksAside from the directives, an XML document is built from:elements: high in <high scale="F">103</high>tags, in pairs: <high scale="F">103</high>attributes: <high scale="F">103</high>entities: <afternoon>Sunny &amp; hot</afternoon>character data, which may be:parsed (processed as XML)--this is the defaultunparsed (all characters stand for themselves)9Elements and attributesAttributes and elements are somewhat interchangeableExample using just elements: <name> <first>David</first> <last>Matuszek</last></name>Example using attributes: <name first="David" last="Matuszek"></name>You will find that elements are easier to use in your programs--this is a good reason to prefer themAttributes often contain metadata, such as unique IDsGenerally speaking, browsers display only elements (values enclosed by tags), not tags and attributes10Well-formed XMLEvery element must have both a start tag and an end tag, e.g. <name> ... </name>But empty elements can be abbreviated: <break />.XML tags are case sensitiveXML tags may not begin with the letters xml, in any combination of casesElements must be properly nested, e.g. not <b><i>bold and italic</b></i>Every XML document must have one and only one root elementThe values of attributes must be enclosed in single or double quotes, e.g. <time unit="days">Character data cannot contain < or &11EntitiesFive special characters must be written as entities: &amp; for & (almost always necessary) &lt; for < (almost always necessary) &gt; for > (not usually necessary) &quot; for " (necessary inside double quotes) &apos; for ' (necessary inside single quotes)These entities can be used even in places where they are not absolutely requiredThese are the only predefined entities in XML12XML declarationThe XML declaration looks like this:<?xml version="1.0" encoding="UTF-8" standalone="yes"?>The XML declaration is not required by browsers, but is required by most XML processors (so include it!)If present, the XML declaration must be first--not even whitespace should precede itNote that the brackets are <? and ?>version="1.0" is required (this is the only version so far)encoding can be "UTF-8" (ASCII) or "UTF-16" (Unicode), or something else, or it can be omittedstandalone tells whether there is a separate DTD13Processing instructionsPIs (Processing Instructions) may occur anywhere in the XML document (but usually first)A PI is a command to the program processing the XML document to handle it in a certain wayXML documents are typically processed by more than one programPrograms that do not recognize a given PI should just ignore itGeneral format of a PI: <?target instructions?>Example: <?xml-stylesheet type="text/css" href="mySheet.css"?>14Comments<!-- This is a comment in both HTML and XML -->Comments can be put anywhere in an XML documentComments are useful for:Explaining the structure of an XML documentCommenting out parts of the XML during development and testingComments are not elements and do not have an end tagThe blanks after <!-- and before --> are optionalThe character sequence -- cannot occur in the commentThe closing bracket must be -->Comments are not displayed by browsers, but can be seen by anyone who looks at the source code15CDATABy default, all text inside an XML document is


View Full Document

Penn CIT 594 - Extensible Markup Language

Documents in this Course
Trees

Trees

17 pages

Searching

Searching

24 pages

Pruning

Pruning

11 pages

Arrays

Arrays

17 pages

Stacks

Stacks

30 pages

Recursion

Recursion

25 pages

Hashing

Hashing

24 pages

Recursion

Recursion

24 pages

Graphs

Graphs

25 pages

Storage

Storage

37 pages

Trees

Trees

21 pages

Arrays

Arrays

24 pages

Hashing

Hashing

24 pages

Recursion

Recursion

25 pages

Graphs

Graphs

23 pages

Graphs

Graphs

25 pages

Stacks

Stacks

25 pages

Recursion

Recursion

25 pages

Quicksort

Quicksort

21 pages

Quicksort

Quicksort

21 pages

Graphs

Graphs

25 pages

Recursion

Recursion

25 pages

Searching

Searching

24 pages

Counting

Counting

20 pages

HTML

HTML

18 pages

Recursion

Recursion

24 pages

Pruning

Pruning

11 pages

Graphs

Graphs

25 pages

Load more
Download Extensible Markup Language
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Extensible Markup Language and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Extensible Markup Language 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?