RFC-Ref is not longer maintained; use RFC browser at: http://zvon.org/comp/r/ref-RFC.html
RFC 2070:Internationalization of the Hypertext Mar...
RFC-Ref

HTML


Click on the red underlined text to get to the source

... The Hypertext Markup Language (HTML) is a markup language used to create ...
... create hypertext documents that are platform independent. Initially, the application of HTML on the World Wide Web was seriously restricted by its reliance on the ISO ...
... which is appropriate only for Western European languages. Despite this restriction, HTML has been widely used with other languages, using other coded character sets ...
... address the issue of the internationalization of HTML by extending the specification of HTML and giving additional recommendations for proper internationalization ...
... internationalization of HTML by extending the specification of HTML and giving additional recommendations for proper internationalization ...
... on multilingualism on the WWW [NICOL]. A foremost consideration is to make sure that HTML remains a valid application of SGML, while ...
... SGML document character set to be used for HTML, the proper treatment of the charset parameter associated with the "text/html" content type ...
... HTML has been in use by the World-Wide Web (WWW) global information initiative since 1990. This specification extends the capabilities of HTML ...
... HTML has been in use by the World-Wide Web (WWW) global information initiative since 1990. This specification extends the capabilities of HTML 2.0 (RFC 1866hist(-> 2854)), primarily by removing the restriction to the ...
... ISO-8859]. HTML is an application of ISO Standard 8879:1986, Information Processing Text and Office Systems -- Standard Generalized Markup Language ...
... Markup Language (SGML) [ISO-8879]. The HTML Document Type Definition (DTD) ...
... Type Definition (DTD) is a formal definition of the HTML syntax in terms of SGML. This specification amends the DTD ...
... SGML. This specification amends the DTD of HTML 2.0 in order to make it applicable to documents encompassing a character repertoire much ...
... conformant. Both formal and actual development of HTML are advancing very fast. The features described in this document are designed so that they can (and should) be added to other forms of HTML ...
... HTML are advancing very fast. The features described in this document are designed so that they can (and should) be added to other forms of HTML besides that described in RFC 1866hist(-> 2854). Where indicated, attributes introduced here should be ...
... This specification changes slightly the conformance requirements of HTML documents and HTML user agents. ...
... conformance requirements of HTML documents and HTML user agents. ...
... All HTML 2.0 conforming documents remain conforming with this specification. However, the extensions introduced here make valid ...
... specification. However, the extensions introduced here make valid certain documents that would not be HTML 2.0 conforming, in particular those containing characters or character references outside of the repertoire of ISO 8859-1 ...
... 1866hist(-> 2854), the following requirements are placed on HTML user agents. ...
... user agents MUST correctly interpret the charset parameter accompanying an HTML document received from the network. ...


... This overview explains a reference processing model used for HTML, and in particular the SGML concept of a document character set ...
... SGML document, and it should be carefully distinguished from the document character set of the abstract HTML document. SGML views the characters as a single set (called a "character repertoire ...
... set. HTML, as an application of SGML, does not directly address the ...
... question of the external character encoding. This is deferred to mechanisms external to HTML, such as MIME as used by the HTTP ...
... 6. Similarly, if HTML documents are transferred by electronic mail, the external character encoding ...
... No mechanisms are currently standardized for indicating the external character encoding of HTML documents transferred by FTP or accessed in distributed file systems ...
... file systems. In the case any other way of transferring and storing HTML documents are defined or become popular, it is advised that similar provisions be made to clearly identify the character encoding ...
... character set specified in Section 2.2 before processing specific to SGML/HTML. The reference processing model can be depicted as follows: ...
... entity manager, the parser, and the application, as far as character semantics are concerned, are using the HTML document character set only. ...
... character set implies a change in the SGML declaration specified in the HTML 2.0 specification (section 9.5 of [RFC1866]). The change amounts to removing ...
... create non- conformance of any expression, construct or document that is conforming to HTML 2.0. It does make conforming certain constructs that are not admissible in HTML 2.0. One consequence is that data ...
... conforming to HTML 2.0. It does make conforming certain constructs that are not admissible in HTML 2.0. One consequence is that data characters outside the repertoire of ISO-8859-1, but within that of ...
... NOTE -- the above SGML declaration, like that of HTML 2.0, specifies the character numbers 128 to 159 (80 to 9F hex) as UNUSED. This means that numeric character references within that ...
... UNUSED. This means that numeric character references within that range (e.g. ’) are illegal in HTML. Neither ISO 8859-1 nor ISO 10646 ...
... control characters. Another change was made from the HTML 2.0 SGML declaration, in the belief that the latter did not express its authors' true intent. The ...
... identical with US-ASCII. In principle, this introduces an incompatibility with HTML 2.0, but in practice it should increase interoperability by i) having the SGML ...
... versions of ISO 646.IRV are not actually used to express HTML syntax. ISO 10646 ...
... character set that could take its place as the document character set for HTML. If nevertheless for a specific application there is a need to use characters outside this ...


... Since any text can logically be assigned a language, almost all HTML elements admit the LANG attribute. The DTD ...
... elements in this version of HTML without the LANG attribute are BR, HR, BASE, NEXTID, and META. It is also intended that any new element ...
... element introduced in later versions of HTML will admit the LANG attribute, unless there is a good reason not to do so. ...
... The syntax and registry of HTML language tags is the same as that defined by RFC 1766(-> 3282draft | 3066(-> 4647 | 4646)) ...
... In the context of HTML, a language tag is not to be interpreted as a single token ...


... semantics, where applicable, are identical to [UNICODE], and ii) where functionality is moved to HTML as a higher level protocol, this is done in a way that allows straightforward conversion to the lower-level mechanisms defined in [UNICODE ...
... this. It is also intended that any new element introduced in later versions of HTML will admit the DIR attribute, unless there is a good reason not to do so. ...
... NOTE -- RFC 1866hist(-> 2854) section 4.2.2 specifies that an HTML user agent should treat an end of line as a word space, except in ...
... BIDI markup in the form of special-purpose formatting characters. This is also possible in HTML, which includes the five BIDI-related formatting characters (202A - 202E) of ISO 10646. As an alternative, ...
... formatting characters (202A - 202E) of ISO 10646. As an alternative, HTML provides equivalent SGML markup. ...
... from the parent element. The default directionality of the overall HTML document is left-to-right. On inline elements ...
... integrity, and alleviates some problems when editing bidirectional HTML text with a simple text editor, but some software may be more apt at using the 10646 characters. If both methods ...


... primarily a UI issue, there are some things that should be specified at the HTML level to guide behavior and promote interoperability. ...
... The HTML 2.0 form submission mechanism, based on the "application/x- www-form-urlencoded" media type, is ill-equipped with regard to ...


... In the case where a document is accessed from a hyperlink in an origin HTML document, a CHARSET attribute is added to the attribute list of elements ...
... followed (if any). When HTML text is transmitted directly in UCS-2 or UCS-4 form, the ...


... HTML Public Text ...
... HTML DTD ...
... This section contains a DTD for HTML based on the HTML 2.0 DTD of RFC ...
... This section contains a DTD for HTML based on the HTML 2.0 DTD of RFC 1866hist(-> 2854) ...
... Markup Language, extended for internationalisation (HTML DTD) ...
... <!ENTITY % HTML.Version "-//IETF ...
... "-//IETF//DTD HTML i18n//EN" ...
... -- Typical usage: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML ...
... HTML PUBLIC "-//IETF//DTD HTML i18n//EN"> <html> ...
... <!ENTITY % HTML.Recommended "IGNORE" -- Certain features of the language are necessary for ...
... --> <![ %HTML.Recommended [ <!ENTITY % HTML ...
... HTML.Recommended [ <!ENTITY % HTML.Deprecated "IGNORE"> ]]> ...
... <!ENTITY % HTML.Deprecated "INCLUDE" -- Certain features of the language are necessary for ...
... <!ENTITY % HTML.Highlighting "INCLUDE" -- Use this feature test entity to validate ...
... <!ENTITY % HTML.Forms "INCLUDE" -- Use this feature test entity to validate ...
... "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML"> %ISOlat1; ...
... SGML Document Access (SDA) Parameter Entities =====--> <!-- HTML contains SGML Document Access (SDA) fixed attributes in support of easy transformation to the International Committee ...
... <!--========== Text Markup =====================--> <![ %HTML.Highlighting [ <!ENTITY ...
... "> <![ %HTML.Recommended [ <!ENTITY % A.content "(%text)*" ...
... Flows ======================--> <![ %HTML.Forms [ <!ENTITY % block.forms "BLOCKQUOTE | FORM | ISINDEX"> ...
... ENTITY % block.forms "BLOCKQUOTE"> <![ %HTML.Deprecated [ <!ENTITY % preformatted "PRE | XMP | LISTING"> ...
... Language of contents --> <![ %HTML.Deprecated [ <!ENTITY ...
... <!--========== Document Body ===================--> <![ %HTML.Recommended [ <!ENTITY % body.content "(%heading|%block|HR|ADDRESS ...
... <!--======= Forms ====================--> <![ %HTML.Forms [ <!ELEMENT ...
... <!--======= Document Head ======================--> <![ %HTML.Recommended [ <!ENTITY % head.extra ""> ...
... <!--======= Document Structure =================--> <![ %HTML.Deprecated [ <!ENTITY % html.content "HEAD, BODY, PLAINTEXT ...
... <!ELEMENT HTML O O (%html.content)> <!ENTITY % version ...
... ENTITY % version.attr "VERSION CDATA #FIXED '%HTML.Version;'"> ...
... Version;'"> <!ATTLIST HTML %attrs; %version ...
... > <!-- <HTML> HTML Document --> ...
... <!-- <HTML> HTML Document --> ...
... SGML Declaration for HTML ...
... Markup Language version 2.x (HTML 2.x = HTML 2.0 + i18n). ...
... version 2.x (HTML 2.x = HTML 2.0 + i18n). -- ...
... These code values or the corresponding numeric character references must not be included when generating a new HTML document, and they should be ignored if encountered when processing a HTML document. ...
... generating a new HTML document, and they should be ignored if encountered when processing a HTML document. -- ...
... ISO Standard 8879:1986//ENTITIES Added Latin 1//EN. HTML includes the entire entity set, and adds entities for all missing characters in the right ...
... "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML"> %ISOlat1; --> ...


... Nebel, E., and L. Masinter, "Form-based File Upload in HTML", RFC 1867hist(-> 2854), Xerox Corporation, November 1995. ...
... Raggett, D., "HTML Tables", RFC 1942hist(-> 2854), W3C, May 1996. ...



Google
Web
RFC-Ref