RFC-Ref is not longer maintained; use RFC browser at: http://zvon.org/comp/r/ref-RFC.html
RFC 2070:Internationalization of the Hypertext Mar...
RFC-Ref

ISO


Click on the red underlined text to get to the source

... HTML on the World Wide Web was seriously restricted by its reliance on the ISO-8859-1 coded character set, which is appropriate only for Western European languages ...
... 1866hist(-> 2854)), primarily by removing the restriction to the ISO-8859-1 coded character set [ISO-8859]. ...
... HTML is an application of ISO Standard 8879:1986, Information Processing Text and Office Systems -- Standard Generalized Markup Language (SGML ...
... applicable to documents encompassing a character repertoire much larger than that of ISO-8859-1, while still remaining SGML conformant. ...
... HTML 2.0 conforming, in particular those containing characters or character references outside of the repertoire of ISO 8859-1, and those containing markup introduced herein. ...
... To ensure interoperability and proper support for at least ISO- 8859-1 in an environment where character encoding schemes other ...
... 8859-1 in an environment where character encoding schemes other than ISO-8859-1 are present, user agents MUST correctly interpret the charset parameter ...
... user-agents MUST at least parse correctly all numeric character references within the range of ISO 10646-1 [ISO-10646]. ...


... Content-Type: text/html; charset=ISO-2022-JP The term "charset ...
... Character Set (UCS) of ISO 10646:1993 [ISO-10646], as amended. Currently, this is code-by-code identical with the Unicode ...
... NOTE -- implementers should be aware that ISO 10646 is amended from time to time; 4 amendments have been adopted since the initial 1993 publication, none of which significantly affects this ...
... them with the following declaration: BASESET "ISO Registration Number 177//CHARSET ...
... Registration Number 177//CHARSET ISO/IEC 10646-1:1993 UCS-4 with implementation level 3 //ESC ...
... that are not admissible in HTML 2.0. One consequence is that data characters outside the repertoire of ISO-8859-1, but within that of UCS-4 become valid ...
... range (e.g. ’) are illegal in HTML. Neither ISO 8859-1 nor ISO 10646 contain characters in that range ...
... HTML. Neither ISO 8859-1 nor ISO 10646 contain characters in that range, which is reserved for control characters ...
... belief that the latter did not express its authors' true intent. The syntax character set declaration was changed from ISO 646.IRV:1983 to the newer ISO 646.IRV:1991, the latter, but not the former, being ...
... character set declaration was changed from ISO 646.IRV:1983 to the newer ISO 646.IRV:1991, the latter, but not the former, being identical with US-ASCII. In principle, this introduces an ...
... character set. The characters that differ between the two versions of ISO 646.IRV are not actually used to express HTML syntax. ...
... HTML syntax. ISO 10646-1:1993 is the most encompassing character set currently existing, and there is no other character set ...
... or future versions of ISO 10646, i.e. by assigning these characters to a private zone of the UCS-4 coding space [ISO-10646 ...
... With the document character set being the full ISO 10646, the possibility that a character cannot be displayed due to lack of appropriate resources (fonts) cannot be avoided. Because there are ...


... According to the suggestion of section 14 of [RFC1866], the set of Latin-1 entities is extended to cover the whole right part of ISO- 8859-1 (all code positions with the high-order bit set), including ...
... user-agent implementers. It is present in many character sets (including the whole ISO 8859 series and, of course, ISO 10646), and can always be included by means of the reference ­. Its semantics ...
... implementers. It is present in many character sets (including the whole ISO 8859 series and, of course, ISO 10646), and can always be included by means of the reference ­. Its semantics are different from the plain HYPHEN: it ...
... This is also possible in HTML, which includes the five BIDI-related formatting characters (202A - 202E) of ISO 10646. As an alternative, HTML provides equivalent SGML ...
... characters LEFT-TO-RIGHT EMBEDDING (202A) and RIGHT-TO-LEFT EMBEDDING (202B) of ISO 10646. The end tag of the element is ...
... directional properties. It is equivalent to using the LEFT-TO-RIGHT OVERRIDE (202D) or RIGHT-TO-LEFT OVERRIDE (202E) characters of ISO 10646, the end tag again being equivalent to the POP DIRECTIONAL ...
... elements (including BDO) concurrently with the use of the corresponding ISO 10646 formatting characters. Preferably one or the other should be used exclusively; the markup ...


... internationalization. In fact, since URLs are restricted to ASCII characters, the mechanism is akward even for ISO-8859-1 text. Section 2.2 of [RFC1738] specifies that octets may be encoded using ...


... Content-Type" CONTENT="text/html; charset=ISO-2022-JP"> This is not foolproof, but will work if the encoding ...
... established network byte order for two- and four-byte quantities, to the ISO 10646 requirement and Unicode recommendation for serialized ...
... UCS. The UTF-1 transformation format of ISO 10646:1993 (registered by IANA as ISO- ...
... transformation format of ISO 10646:1993 (registered by IANA as ISO- 10646-UTF-1), has been removed from ISO 10646 ...
... ISO- 10646-UTF-1), has been removed from ISO 10646 by amendment 4, and should not be used. ...


... <!ENTITY % ISOlat1 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML"> ...
... voice synthesis. For more information on SDA & ICADD: - ISO 12083:1993, Annex A.8, Facilities for Braille, large print and computer voice ...
... <!SGML "ISO 8879:1986" -- SGML ...
... CHARSET BASESET "ISO Registration Number 177//CHARSET ...
... Registration Number 177//CHARSET ISO/IEC 10646-1:1993 UCS-4 with implementation level 3//ESC ...
... 160 2147483486 160 -- In ISO 10646, the positions with hexadecimal values 0000D800 - 0000DFFF, used in the UTF-16 encoding of UCS ...
... 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 127 BASESET "ISO 646IRV:1991//CHARSET International Reference Version ...
... ISO Latin 1 entity set ...
... the Added Latin 1 entity set, along with its name, syntax for use, and description. This list is derived from ISO Standard 8879:1986//ENTITIES Added Latin 1//EN. HTML ...
... entity set, and adds entities for all missing characters in the right part of ISO-8859-1. <!-- (C) International Organization for Standardization ...
... conforming SGML systems and applications as defined in ISO 8879, provided this notice is included in all copies. --> <!-- Character entity ...
... <!ENTITY % ISOlat1 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML"> ...


... ISO 639:1988. International standard -- Code for the representation of the names of languages. Technical content in <http://www.sil.org/sgml/iso639a.html> ...
... ISO 8859. International standard -- Information pro- cessing -- 8-bit single-byte coded graphic character sets -- Part 1: Latin alphabet No. 1 (1987) -- Part 2: Latin alphabet No. 2 (1987) -- Part 3: Latin alphabet No. 3 (1988) -- Part 4: Latin alphabet No. 4 (1988) -- Part 5: Latin/Cyrillic alphabet (1988) -- Part 6: Latin/Arabic alphabet (1987) -- Part : Latin/Greek alphabet (1987) -- Part 8: Latin/Hebrew alphabet (1988) -- Part 9: Latin alphabet No. 5 (1989) -- Part 10: Latin alphabet No. 6 (1992) ...
... ISO 8879:1986. International standard -- Information processing -- Text and office systems -- Standard gen- eralized markup language (SGML). ...
... ISO/IEC 10646-1:1993. International standard -- Infor- mation technology -- Universal multiple-octet coded character Sset (UCS) -- Part 1: Architecture and basic multilingual plane. ...
... ISO/IEC 10646-1:1993 AMENDMENT 2 (1996). UCS Transfor- mation Format 8 (UTF-8). ...



Google
Web
RFC-Ref