HTML
Click on the red underlined text to get to the source
... create hypertext documents that are portable from one platform to
another. HTML documents are SGML documents with generic semantics
...
... domains.
As HTML is an application of SGML, this specification assumes a
working knowledge of [SGML ...
...
HTML has been in use by the World-Wide Web (WWW) global information
initiative since 1990. Previously, informal documentation on HTML has
...
... HTML has been in use by the World-Wide Web (WWW) global information
initiative since 1990. Previously, informal documentation on HTML has
been available from a number of sources on the Internet. This
...
... Internet. This
specification brings together, clarifies, and formalizes a set of
features that roughly corresponds to the capabilities of HTML in
common use prior to June 1994. A number of new features to HTML are
...
... features that roughly corresponds to the capabilities of HTML in
common use prior to June 1994. A number of new features to HTML are
being proposed and experimented in the Internet community.
...
... Internet community.
This document thus defines a HTML 2.0 (to distinguish it from the
previous informal specifications). Future (generally upwardly
compatible) versions ...
... previous informal specifications). Future (generally upwardly
compatible) versions of HTML with new features will be released with
higher version numbers.
...
... version numbers.
HTML is an application of ISO Standard 8879:1986, "Information
Processing Text and Office Systems; Standard Generalized Markup
Language ...
... Processing Text and Office Systems; Standard Generalized Markup
Language" (SGML). The HTML Document Type Definition (DTD) is a formal
...
... MIME] called `text/html'. As such,
it defines the semantics of the HTML syntax and how that syntax
should be interpreted by user agents.
...
...
This specification governs the syntax of HTML documents and aspects
of the behavior of HTML user agents ...
... This specification governs the syntax of HTML documents and aspects
of the behavior of HTML user agents.
...
...
A document is a conforming HTML document if:
* It is a conforming SGML ...
... agrees with [ISO-10646]; that is, each code position listed
in 13, "The HTML Coded Character Set" is included, and each
code position in the document character set ...
... ISO-2022-JP'
character encoding scheme can be used for HTML
documents, since its repertoire is a subset of the
[ISO-10646] repertoire. The critical ...
... The HTML DTD defines a standard HTML document type and several
variations, by way of feature test entities. Feature test entities
are declarations in the HTML ...
... HTML document type and several
variations, by way of feature test entities. Feature test entities
are declarations in the HTML DTD that control the inclusion or
exclusion of portions of the DTD ...
... For example, in order to preserve the structure of a
document, an editing user agent may translate HTML
documents to the recommended subset, or it may require
that the documents be in the recommended subset for
import.
...
...
An HTML user agent conforms to this specification if:
...
... user agent conforms to this specification if:
* It parses the characters of an HTML document into data
characters and markup according to [SGML].
...
... processes each character in the ISO Latin Alphabet No. 1 as
specified in 6.1, "The HTML Document Character Set".
...
... Character Set".
NOTE - To support non-western writing systems, HTML
user agents are encouraged to support
...
... traverse, resources permitting) all hyperlinks from <A>
elements in an HTML document.
An HTML ...
...
* It allows the user to express all form field values
specified in an HTML document and to (attempt to) submit the
values as requests to information services.
...
... form data set
a sequence of name/value pairs; the names are given by
an HTML document and the values are given by a user.
HTML document ...
... HTML document and the values are given by a user.
HTML document
An SGML document conforming to this document type
definition ...
... A user agent that conforms to this specification except
for form processing. It may only process level 1 HTML
documents.
must
...
... DTD and SGML declaration --
of the HTML document type definition are provided in 9, "HTML Public
...
... Text".
The term "HTML" refers to both the document type defined here and the
markup language ...
...
An HTML document is an SGML document; that is, a sequence of
characters organized physically into a set of entities, and logically
...
... <p>Some text. <em>*wow*</em></p>
An HTML user agent should use the SGML declaration that is given in
...
... SGML declaration that is given in
9.5, "SGML Declaration for HTML". According to its document character
set, `*' refers to an asterisk character, `*'.
...
... The start symbol of the DTD grammar is HTML, and the productions are
given in the public text identified by `-//IETF//DTD ...
... elements are delimited explicitly by tags, while the
boundaries of others are inferred. The <HTML> element contains a
<HEAD> element ...
... HTML Lexical Syntax ...
... SGML specifies an abstract syntax and a reference concrete syntax.
Aside from certain quantities and capacities (e.g. the limit on the
length of a name), all HTML documents use the reference concrete
syntax. In particular, all markup characters are in the repertoire of
[ISO-646 ...
... document character set. Each reference to one of the general entities
defined in the HTML DTD maps to a single-character string.
...
... abc &# 60 def => "abc &# 60 def"
A useful technique for translating plain text to HTML is to replace
each '<', '&', and '>' by an entity reference or numeric character
...
... implemented inconsistently, and because they conflict
with techniques for reducing HTML to 7 bit ASCII for
...
... transport, they are deprecated in this version of HTML.
See 5.5.2.1, "Example and Listing: XMP, LISTING".
...
... elements such as headings, paragraphs, lists, character
highlighting, and links. Most HTML elements are identified in a
document as a start ...
... Anchors and character highlighting may be put inside other
constructs. See the HTML DTD, 9.1, "HTML DTD ...
...
NOTE - The SGML declaration for HTML specifies SHORTTAG YES, which
means that there are other valid syntaxes for tags ...
... hyphens. The length of a name is limited to 72 characters by the
`NAMELEN' parameter in the SGML declaration for HTML, 9.5, "SGML
Declaration for HTML ...
... SGML declaration (9.5, "SGML
Declaration for HTML") limits the length of an attribute value to
1024 characters.
...
...
To include comments in an HTML document, use a comment declaration. A
comment declaration consists of `<!' followed by zero or more
comments followed by `>'. Each comment starts ...
... ignored.
NOTE - Some historical HTML implementations incorrectly consider
any `>' character to be the termination of a comment.
...
... EN">
<HEAD>
<TITLE>HTML Comment Example</TITLE>
<!-- Id: html-sgml.sgm,v 1.5 1995/05/26 21:29:50 connolly Exp -->
<!-- another -- -- comment -->
...
... HTML Public Text Identifiers ...
...
To identify information as an HTML document conforming to this
specification, each document must start with one of the following
...
... entity does not begin
with a document type declaration, an HTML user agent should infer
the above document type ...
... "Strict HTML DTD" and 9.4, "Strict Level 1 HTML DTD". They refer to
the more structurally rigid definition of HTML ...
... Example HTML Document ...
... <H1>First Header</H1>
<P>This is a paragraph in the example HTML file. Keep in mind
that the title does not appear in the document text, but that
...
... <LI> Note that lists can be nested;
<LI> Whitespace may be used to assist in reading the
HTML source.
</UL>
<LI>Third item in an ordered list.
...
... <IMG SRC ="triangle.xbm" alt="Warning: ">
Be sure to read these <b>bold instructions</b>.
</BODY></HTML>
...
... HTML user agent allows users to interact with resources which have
HTML representations. At a minimum, it must allow users to examine
and navigate the content of HTML level 1 documents. HTML ...
... HTML representations. At a minimum, it must allow users to examine
and navigate the content of HTML level 1 documents. HTML user agents
...
... HTML representations. At a minimum, it must allow users to examine
and navigate the content of HTML level 1 documents. HTML user agents
should be able to preserve all formatting distinctions represented in
...
... user agents
should be able to preserve all formatting distinctions represented in
an HTML document, and be able to simultaneously present resources
referred to by IMG elements (they may ignore some formatting
...
... elements (they may ignore some formatting
distinctions or IMG resources at the request of the user). Level 2
HTML user agents should support form entry and submission.
...
... 1521(-> 2049draft | 2048(-> 4289 | 4288) | 2047draft | 2046draft | 2045draft)[MIME]) may be given to specify the character
encoding scheme used to represent the HTML document as a
sequence of octets. The default value is outside the
...
... HTML Document Representation ...
... entity with a content type of `text/html' represents an
HTML document, consisting of a single text entity. The `charset'
...
... interoperability between
implementations of various versions of HTML, the installed base of
HTML user agents ...
... versions of HTML, the installed base of
HTML user agents supports a superset of the HTML 2.0 language ...
... CRLF, that is, octets 13, 10.
In practice, HTML documents are frequently represented and
transmitted using an end of line convention that depends on the
conventions of the source of the document; frequently, that
...
... start and record end characters.
Since there is no ambiguity, HTML user agents are encouraged to infer
the missing record start ...
... start and end characters.
An HTML user agent should treat end of line in any of its variations
as a word space in all contexts ...
... as a word space in all contexts except preformatted text. Within
preformatted text, an HTML user agent should treat any of the three
common representations of end-of-line as starting ...
...
The head of an HTML document is an unordered collection of
information about the document. For example:
...
...
Every HTML document must contain a <TITLE> element.
...
... context. A short title, such as "Introduction" may be meaningless out
of context. A title such as "Introduction to HTML Elements" is more
appropriate.
...
... LINK elements may occur in the <HEAD> element of an HTML
document. It has the same attributes as the <A> element (see 5.7.3,
"Anchor ...
... meta-information --
how it may be used is up to the individual server
implementation and the HTML user agent.
...
...
The <NEXTID> element is included for historical reasons only. HTML
documents should not contain <NEXTID> elements.
...
... hint for the name to use for a new <A>
element when editing an HTML document. It should be distinct from all
NAME attribute values on <A> elements. For example:
...
... elements, <H1> through <H6>, denote section headings.
Although the order and occurrence of headings is not constrained by
the HTML DTD, documents should not skip levels (for example, from H1
to H3 ...
... WIDTH attribute specifies the maximum number of characters for a line
and allows the HTML user agent to select a suitable font and
indentation.
...
... NOTE - Constraints on the processing of <PRE>
content may limit or prevent the ability of the HTML
user agent to faithfully render phrase markup.
...
... line break plus a blank line.
* The horizontal tab character (code position 9 in the HTML
document character set) must be interpreted as the smallest
positive nonzero number of spaces which will leave the
...
... SGML]).
NOTE - In a previous draft of the HTML specification, the syntax
of <XMP> and <LISTING> elements allowed closing tags ...
... Since CDATA declared content has a number of unfortunate interactions
with processing techniques and tends to be used and implemented
inconsistently, HTML documents should not contain <XMP> nor <LISTING>
elements -- the <PRE> tag ...
...
HTML includes a number of list elements. They may be used in
combination; for example, a <OL> may be nested in an <LI> element ...
... large.
Unless the COMPACT attribute is present, an HTML user agent may leave
white space between successive DT, DD ...
... elements may be nested within the content of other phrase
elements; however, HTML user agents may render nested phrase elements
...
... information in advance in the link. For example, the
HTML user agent may chose a different rendering as a
function of the methods ...
... Image Resources").
HTML user agents may process the value of the ALT attribute as an
alternative to processing the image ...
... An HTML user agent should present the body of an HTML document as a
collection of typeset paragraphs and preformatted text. Except for
preformatted elements ...
... The HTML Document Character Set ...
... character set specified in 9.5, "SGML Declaration for
HTML" must be supported by HTML user agents. It includes the graphic
...
... SGML Declaration for
HTML" must be supported by HTML user agents. It includes the graphic
characters of Latin Alphabet No. 1, or simply Latin-1. Latin-1
...
... NOTE - To support non-western writing systems, a larger character
repertoire will be specified in a future version of HTML. The
document character set will be [ISO-10646 ...
... heterogeneous networks and operating systems. In the HTML document
character set only three control characters ...
... Tab, Carriage Return, and Line Feed (code positions 9, 13, and 10).
The HTML DTD references the Added Latin 1 entity set, to allow
...
... ISO Latin 1 Character Entity Set" for a table of the
"Added Latin 1" entities, and 13, "The HTML Coded Character Set" for
a table of the code positions of [ISO-8859-1 ...
... a table of the code positions of [ISO-8859-1] and the control
characters in the HTML document character set.
...
... In addition to general purpose elements such as paragraphs and lists,
HTML documents can express hyperlinks. An HTML user agent allows the
...
... elements such as paragraphs and lists,
HTML documents can express hyperlinks. An HTML user agent allows the
user to navigate these hyperlinks.
...
... a variety of information retrieval protocols to obtain an entity that
represents the resource, such as an HTML document. The fragment
identifier, if present, refers to some view on, or portion of the
resource.
...
...
An HTML user agent allows the user to navigate the content of the
document and request activation of hyperlinks denoted by <A>
...
... document and request activation of hyperlinks denoted by <A>
elements. HTML user agents should also allow activation of <LINK>
...
... address of the head anchor. If the
representation is another HTML document, navigation may begin again
with this new document.
...
... Typically, the ROWS and COLS attributes determine the visible
dimension of the field in characters. The field is typically rendered
in a fixed-width font. HTML user agents should allow text to extend
beyond these limits by scrolling as needed.
...
...
An HTML user agent begins processing a form by presenting the
document with the fields in their initial state ...
... some historical HTTP server implementations to exhibit defective
behavior. As a result, some HTML forms are written using
`METHOD=POST' even though the form submission has no side-effects ...
... HTML 2.0//EN">
<title>Sample of HTML Form Submission</title>
<H1>Sample Questionnaire</H1>
<P>Please fill out this questionnaire:
...
... HTML Public Text ...
... SGML Document Access (SDA) Parameter Entities =====-->
<!-- HTML 2.0 contains SGML Document Access (SDA) fixed attributes
in support of easy transformation to the International Committee ...
... Flows ======================-->
<![ %HTML.Forms [
<!ENTITY % block.forms "BLOCKQUOTE | FORM | ISINDEX">
...
... ENTITY % block.forms "BLOCKQUOTE">
<![ %HTML.Deprecated [
<!ENTITY % preformatted "PRE | XMP | LISTING">
...
... <!--========== Document Body ===================-->
<![ %HTML.Recommended [
<!ENTITY % body.content "(%heading|%block|HR|ADDRESS ...
... <!--======= Document Structure =================-->
<![ %HTML.Deprecated [
<!ENTITY % html.content "HEAD, BODY, PLAINTEXT ...
... >
<!-- <HTML> HTML Document -->
...
...
<!-- <HTML> HTML Document -->
...
... HTML DTD with the
`HTML.Recommended' entity defined as `INCLUDE' rather than IGNORE;
that is, it refers to the more structurally rigid definition of HTML ...
... HTML.Recommended' entity defined as `INCLUDE' rather than IGNORE;
that is, it refers to the more structurally rigid definition of HTML.
<!-- html-s.dtd
...
... HTML DTD with the
`HTML.Forms' entity defined as `IGNORE' rather than `INCLUDE'.
Documents which contain <FORM> elements ...
... Document Type Definition for the HyperText Markup Language
with Level 1 Extensions (HTML Level 1 DTD).
...
... HTML DTD with
the `HTML.Recommended' entity defined as `INCLUDE' rather than
IGNORE; that is, it refers to the more structurally rigid definition
...
... entity defined as `INCLUDE' rather than
IGNORE; that is, it refers to the more structurally rigid definition
of HTML.
<!-- html-1s.dtd
...
... -- catalog: SGML Open style entity catalog for HTML --
-- $Id: rfc1866.xml,v 1.1 2005/06/02 05:35:05 nicmila Exp $ --
...
...
The HTML DTD defines the following entities. They represent
particular graphic characters which have special meanings in places
...
... ISO Standard
8879:1986//ENTITIES Numeric and Special Graphic//EN'. However, HTML
does not include for the entire entity set -- only the entities
...
... %ISOlat1;
-->
<!-- Modified for use in HTML
$Id: rfc1866.xml,v 1.1 2005/06/02 05:35:05 nicmila Exp $ -->
<!ENTITY ...
...
The HTML document type was designed by Tim Berners-Lee at CERN as
part of the 1990 World Wide Web project. In 1992, Dan Connolly wrote
...
... part of the 1990 World Wide Web project. In 1992, Dan Connolly wrote
the HTML Document Type Definition (DTD) and a brief HTML ...
... Since 1993, a wide variety of Internet participants have contributed
to the evolution of HTML, which has included the addition of in-line
images introduced by the NCSA Mosaic software for WWW. Dave Raggett
...
... played an important role in deriving the forms material from the
HTML+ specification.
Dan Connolly and Karen Olson Muldrow rewrote the HTML ...
... HTML+ specification.
Dan Connolly and Karen Olson Muldrow rewrote the HTML Specification
in 1994. The document was then edited by the HTML working group ...
... Dan Connolly and Karen Olson Muldrow rewrote the HTML Specification
in 1994. The document was then edited by the HTML working group as a
whole, with updates being made by Eric Schieler, Mike Knezovich, and
...
...
Special thanks to the many active participants in the HTML working
group, too numerous to list individually, without whom there would be
no standards process and no standard. That this document approaches
...
... no standards process and no standard. That this document approaches
its objective of carefully converging a description of current
practice and formalization of HTML's relationship to SGML is a
tribute to their effort.
...
... The HTML Coded Character Set ...
...
This list details the code positions and characters of the HTML
document character set, specified in 9.5, "SGML Declaration for
...
... character set, specified in 9.5, "SGML Declaration for
HTML". This coded character set is based on [ISO-8859-1].
...
