ISO
Click on the red underlined text to get to the source
... HTML on the World Wide Web was seriously
restricted by its reliance on the ISO-8859-1 coded character set,
which is appropriate only for Western European languages ...
... 1866hist(-> 2854)), primarily by removing the restriction to the
ISO-8859-1 coded character set [ISO-8859].
...
...
HTML is an application of ISO Standard 8879:1986, Information
Processing Text and Office Systems -- Standard Generalized Markup
Language (SGML ...
... applicable to documents encompassing a character repertoire much
larger than that of ISO-8859-1, while still remaining SGML
conformant.
...
... HTML 2.0 conforming, in
particular those containing characters or character references
outside of the repertoire of ISO 8859-1, and those containing markup
introduced herein.
...
...
To ensure interoperability and proper support for at least ISO-
8859-1 in an environment where character encoding schemes other
...
... 8859-1 in an environment where character encoding schemes other
than ISO-8859-1 are present, user agents MUST correctly interpret
the charset parameter ...
... user-agents MUST at least parse correctly
all numeric character references within the range of ISO 10646-1
[ISO-10646].
...
... Character Set (UCS) of ISO 10646:1993 [ISO-10646], as amended.
Currently, this is code-by-code identical with the Unicode ...
...
NOTE -- implementers should be aware that ISO 10646 is amended
from time to time; 4 amendments have been adopted since the
initial 1993 publication, none of which significantly affects this
...
... Registration Number 177//CHARSET
ISO/IEC 10646-1:1993 UCS-4 with implementation level 3
//ESC ...
... that are not admissible in HTML 2.0. One consequence is that data
characters outside the repertoire of ISO-8859-1, but within that of
UCS-4 become valid ...
... range (e.g. ’) are illegal in HTML. Neither ISO 8859-1 nor
ISO 10646 contain characters in that range ...
... HTML. Neither ISO 8859-1 nor
ISO 10646 contain characters in that range, which is reserved for
control characters ...
... belief that the latter did not express its authors' true intent. The
syntax character set declaration was changed from ISO 646.IRV:1983 to
the newer ISO 646.IRV:1991, the latter, but not the former, being
...
... character set declaration was changed from ISO 646.IRV:1983 to
the newer ISO 646.IRV:1991, the latter, but not the former, being
identical with US-ASCII. In principle, this introduces an
...
... character set. The characters that differ between the two
versions of ISO 646.IRV are not actually used to express HTML syntax.
...
... HTML syntax.
ISO 10646-1:1993 is the most encompassing character set currently
existing, and there is no other character set ...
...
or future versions of ISO 10646, i.e. by assigning these characters
to a private zone of the UCS-4 coding space [ISO-10646 ...
...
With the document character set being the full ISO 10646, the
possibility that a character cannot be displayed due to lack of
appropriate resources (fonts) cannot be avoided. Because there are
...
... According to the suggestion of section 14 of [RFC1866], the set of
Latin-1 entities is extended to cover the whole right part of ISO-
8859-1 (all code positions with the high-order bit set), including
...
... user-agent implementers. It is present in many character
sets (including the whole ISO 8859 series and, of course, ISO
10646), and can always be included by means of the reference
­. Its semantics ...
... implementers. It is present in many character
sets (including the whole ISO 8859 series and, of course, ISO
10646), and can always be included by means of the reference
­. Its semantics are different from the plain HYPHEN: it
...
... This is also possible in HTML, which includes the five BIDI-related
formatting characters (202A - 202E) of ISO 10646. As an alternative,
HTML provides equivalent SGML ...
... characters LEFT-TO-RIGHT EMBEDDING (202A) and RIGHT-TO-LEFT
EMBEDDING (202B) of ISO 10646. The end tag of the element is
...
... directional properties. It is equivalent to using the LEFT-TO-RIGHT
OVERRIDE (202D) or RIGHT-TO-LEFT OVERRIDE (202E) characters of ISO
10646, the end tag again being equivalent to the POP DIRECTIONAL
...
... elements (including BDO) concurrently with the use of the
corresponding ISO 10646 formatting characters.
Preferably one or the other should be used exclusively; the markup
...
... internationalization. In fact, since URLs are restricted to ASCII
characters, the mechanism is akward even for ISO-8859-1 text.
Section 2.2 of [RFC1738] specifies that octets may be encoded using
...
... Content-Type"
CONTENT="text/html; charset=ISO-2022-JP">
This is not foolproof, but will work if the encoding ...
... established network byte order for two- and four-byte quantities, to
the ISO 10646 requirement and Unicode recommendation for serialized
...
... transformation format of ISO 10646:1993 (registered by IANA as ISO-
10646-UTF-1), has been removed from ISO 10646 ...
... voice synthesis. For more information on
SDA & ICADD:
- ISO 12083:1993, Annex A.8, Facilities for Braille,
large print and computer voice
...
... Registration Number 177//CHARSET
ISO/IEC 10646-1:1993 UCS-4 with
implementation level 3//ESC ...
... 160 2147483486 160
--
In ISO 10646, the positions with hexadecimal
values 0000D800 - 0000DFFF, used in the UTF-16
encoding of UCS ...
... 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 127
BASESET "ISO 646IRV:1991//CHARSET
International Reference Version ...
... the Added Latin 1 entity set, along with its name, syntax for use,
and description. This list is derived from ISO Standard
8879:1986//ENTITIES Added Latin 1//EN. HTML ...
... entity set, and adds entities for all missing characters in the right
part of ISO-8859-1.
<!-- (C) International Organization for Standardization ...
... conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
-->
<!-- Character entity ...
... ISO 639:1988. International standard -- Code for the representation of the names of languages. Technical content in <http://www.sil.org/sgml/iso639a.html> ...
... ISO 8859. International standard -- Information pro- cessing -- 8-bit single-byte coded graphic character sets -- Part 1: Latin alphabet No. 1 (1987) -- Part 2: Latin alphabet No. 2 (1987) -- Part 3: Latin alphabet No. 3 (1988) -- Part
4: Latin alphabet No. 4 (1988) -- Part 5: Latin/Cyrillic alphabet (1988) -- Part 6: Latin/Arabic alphabet (1987) -- Part :
Latin/Greek alphabet (1987) -- Part 8: Latin/Hebrew alphabet (1988) -- Part 9: Latin alphabet No. 5 (1989) -- Part 10: Latin
alphabet No. 6 (1992) ...
... ISO 8879:1986. International standard -- Information processing -- Text and office systems -- Standard gen- eralized markup language (SGML). ...
... ISO/IEC 10646-1:1993. International standard -- Infor- mation technology -- Universal multiple-octet coded character Sset (UCS) -- Part 1: Architecture and basic multilingual plane. ...
