![]() Capt. Horatio T.P. Webb | (SGML, HTML, XML, CSS, XSL, XHTML, DOM (DHTML), SHTML) Last Updated 10:30AM 11/9/99 |
To simplify, the following references rely primarily on W3C (World Wide Web Consortium) at www.w3.org. This is the authority on the entirety of markup languages currently evolving from SGML. The evolution goes like this:
SGML is the "mother" of markup languages and was defined in the original international standard (ISO 8879) in 1986. SGML defines how you embed descriptive markup in a document so that the content is treated as data rather than as characters AND provides a way to describe the structure of the document.There are three basic parts:
Arbortext says: "The OS is in the form of a particular DTD that allows the user to create a Formatting Output Specification Instance, or FOSI , for both printed and electronic output. A FOSI is essentially a powerful style sheet that specifies the formatting for each tag in a DTD. With the FOSI, the document, and the DTD, you have a complete interchange package for printed documents.
In 1996, the International Standards Organization (ISO) approved the final draft of the Document Style Semantics and Specification Language (DSSSL) for SGML-based documents. The complete DSSSL standard covers a broad scope, so subsets are being developed to handle varying levels of functionality. A subset whose functionality is approximately equivalent to FOSIs is expected, and work on tools to convert FOSIs to and from DSSSL is under way."
See the short tutorial at: Arbortext
Now see the SGML/XML Bibliography
What we know and love is derived from SGML and first proposed in 1992.
See W3C's HTML page
and their HTML 4 Reference
W3C says:
Abstract
The Extensible Markup Language (XML) is a subset of SGML ... Its goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. XML has been designed for ease of implementation and for interoperability with both SGML and HTML. Introduction Extensible Markup Language, abbreviated XML, describes a class of data objects called XML documents and partially describes the behavior of computer programs which process them. XML is an application profile or restricted form of SGML, the Standard Generalized Markup Language [ISO 8879]. By construction, XML documents are conforming SGML documents. XML documents are made up of storage units called entities, which contain either parsed or unparsed data. Parsed data is made up of characters, some of which form character data, and some of which form markup. Markup encodes a description of the document's storage layout and logical structure. XML provides a mechanism to impose constraints on the storage layout and logical structure. A software module called an XML processor is used to read XML documents and provide access to their content and structure. It is assumed that an XML processor is doing its work on behalf of another module, called the application. This specification describes the required behavior of an XML processor in terms of how it must read XML data and the information it must provide to the application. Origin and Goals XML was developed by an XML Working Group (originally known as the SGML Editorial Review Board) formed under the auspices of the World Wide Web Consortium (W3C) in 1996. It was chaired by Jon Bosak of Sun Microsystems with the active participation of an XML Special Interest Group (previously known as the SGML Working Group) also organized by the W3C. The membership of the XML Working Group is given in an appendix. Dan Connolly served as the WG's contact with the W3C. The design goals for XML are:
This specification, together with associated standards (Unicode and ISO/IEC 10646 for characters, Internet RFC 1766 for language identification tags, ISO 639 for language name codes, and ISO 3166 for country name codes), provides all the information necessary to understand XML Version 1.0 and construct computer programs to process it.
|
See XML at W3C (document date: 2/10/98).
Allows specification of significantly greater control over presentation with the creation of the >lt;STYLE> object. This allows control over object's font, color, text and the new BOX object. The style can be controlled both at the document and element level. A heirarchy defines how the objects inherit their properties. Most important is the ability to access and modify these object parameters in the script languages like VBScript and Javascript.
Adds things like:
Will (may) add:
Here is what WC3 says on XSL vs CSS 11/10/99:
CSS | XSL | |
Can be used with HTML? | yes | no |
Can be used with XML? | yes | yes |
Transformation language? | no | yes |
Syntax | CSS | XML |
The unique features are that CSS can be used to style HTML documents. XSL, on the other hand, is able to transform documents. For example, XSL can be used to transform XML data into HTML/CSS documents on the Web server. This way, the two languages complement each other and can be used together.
Both languages can be used to style XML documents.
CSS and XSL will use the same underlying formatting model and designers will therefore have access to the same formatting features in both languages. W3C will work hard to ensure that interoperable implementations of the formatting model is available.
A W3C Note on Using XSL and CSS together is available."
We are now attempting to resolve some of the issues of using HTML as a subset of SGML rather than HTML as a subset of XML. To make web things conform to XML, we have to reformat the HTML into XHTML by doing things like:
Note that all the discussion above reduces to:
For all the fuss and furor, as programmers, we have to be able to access all this marked up data in the programming languages (i.e., how can we create, access or modify the objects...
so, we get...
DHTML (Dynamic HTML) describes HTML pages with dynamic content. This just implies that HTML, CSS and the scripting languages (VBScript and Javascipt) operate together to create dynamic pages rather than static ones. There is no standard here -- it is just a phrase that denotes the interoperation of the three elements.
However, most important to the concept of DHTML is the ability to access the document elements from a scripting language. As we have seen, in order to talk about the document in the script languages we have to have a way to access and modify their properties. Thus far we have seen only a limited portion of the ways to access the HTML objects. To expose ALL the objects in a document to the script language, W3C proposed DOM (the Document Object Model) as a way to programmatically reference the document objects. See the current DOM recommendations at:
Used to indicate Server Side Includes are being added by the server to the HTML data stream shipped to the client. An "include" is a file or the output of a command that is "inserted" in the HTML as it is being sent to the client. SHTML is not a language, it is a prefix for the HTML file type that indicates "SSI" is being used.
A standard way to express syntax rules. Much of the above documentation is shown in BNF. See a short decription at: Th. Estier's BNF page or Lars Marius Garshol's page on EBNF (that is "E"xtended BNF).