language
Click on the red underlined text to get to the source
... The Hypertext Markup Language (HTML) is a markup language used to
create hypertext documents that are platform independent. Initially,
...
... ISO-8859-1 coded character set,
which is appropriate only for Western European languages. Despite
this restriction, HTML has been widely used with other languages ...
... languages. Despite
this restriction, HTML has been widely used with other languages,
using other coded character sets or character encodings ...
... character sets or character encodings, through
various ad hoc extensions to the language [TAKADA].
...
... valid application of SGML, while
enabling its use with all languages of the world.
The specific issues addressed are the SGML ...
... HTML is an application of ISO Standard 8879:1986, Information
Processing Text and Office Systems -- Standard Generalized Markup
Language (SGML) [ISO-8879]. The HTML Document ...
...
Language tags can be used to control rendering of a marked up
document in various ways: glyph disambiguation, in cases where the
character encoding ...
... quotation marks; hyphenation; ligatures; spacing; voice synthesis;
etc. Independently of rendering issues, language markup is useful as
content markup for purposes such as classification and searching.
...
... content markup for purposes such as classification and searching.
Since any text can logically be assigned a language, almost all HTML
elements ...
... unless there is a good reason not to do so.
The language attribute, LANG, takes as its value a language tag that
identifies a natural language ...
...
The language attribute, LANG, takes as its value a language tag that
identifies a natural language spoken, written, or otherwise conveyed
...
... language attribute, LANG, takes as its value a language tag that
identifies a natural language spoken, written, or otherwise conveyed
by human beings for communication of information to other human
beings. Computer languages ...
... language spoken, written, or otherwise conveyed
by human beings for communication of information to other human
beings. Computer languages are explicitly excluded.
The syntax and registry ...
... The syntax and registry of HTML language tags is the same as that
defined by RFC 1766(-> 3282draft | 3066(-> 4647 | 4646)) [RFC1766 ...
... defined by RFC 1766(-> 3282draft | 3066(-> 4647 | 4646)) [RFC1766]. In summary, a language tag is composed
of one or more parts: A primary language tag and a possibly empty
...
... RFC1766]. In summary, a language tag is composed
of one or more parts: A primary language tag and a possibly empty
series of subtags:
...
... tags are case-
insensitive. The namespace of language tags is administered by the
IANA. Example tags ...
... In the context of HTML, a language tag is not to be interpreted as a
single token, as per RFC 1766(-> 3282draft | 3066(-> 4647 | 4646)) ...
... 1766(-> 3282draft | 3066(-> 4647 | 4646)), but as a hierarchy. For example, a
user agent that adjusts rendering according to language should
consider that it has a match when a language tag in a style sheet
...
... user agent that adjusts rendering according to language should
consider that it has a match when a language tag in a style sheet
entry matches the initial portion of the language tag of an element ...
... consider that it has a match when a language tag in a style sheet
entry matches the initial portion of the language tag of an element.
An exact match should be preferred. This interpretation allows an
...
... 'plain' or 'international' English ("en").
NOTE -- using the language tag as a hierarchy does not imply that
all languages with a common prefix ...
... NOTE -- using the language tag as a hierarchy does not imply that
all languages with a common prefix will be understood by those
fluent in one or more of those languages ...
... languages with a common prefix will be understood by those
fluent in one or more of those languages; it simply allows the
user to request this commonality when it is true for that user.
...
... element and the
value (if any) of the HTTP Content-Language header. If none of these
are set, a suitable default, perhaps controlled by user preferences,
...
... Markup for language-dependent presentation ...
...
For the correct presentation of text in certain languages
(irrespective of formatting issues), some support in the form of
additional entities and elements ...
... default behaviour is not appropriate.
- Language-dependent rendering of short (in-line) quotations.
- Better justification control for languages ...
... Language-dependent rendering of short (in-line) quotations.
- Better justification control for languages where this is
important.
...
... important.
- Superscripts and subscripts for languages where they appear as
part of general text.
...
...
The phrase-level element Q is introduced to allow language-dependent
rendering of short quotations depending on language and platform
...
... element Q is introduced to allow language-dependent
rendering of short quotations depending on language and platform
capability. As the following examples show (rather poorly, because of
the character set ...
...
NOTE -- Q elements can be nested. Many languages use different
quotation styles for outer and inner quotations, and this should
be respected by user-agents ...
... this minimal level of support.
Many languages require superscript text for proper rendering: as an
example, the French "Mlle Dupont" should have "lle" in superscript.
The SUP element ...
... are restricted to PCDATA to avoid nesting problems.
Finally, in many languages text justification is much more important
than it is in Western languages, and justifies markup. The ALIGN
...
... Finally, in many languages text justification is much more important
than it is in Western languages, and justifies markup. The ALIGN
attribute, admitting values of LEFT, RIGHT, CENTER and JUSTIFY, is
added to a selection of elements ...
...
Many languages are written in horizontal lines from left to right,
while others are written from right to left. When both writing
directions are present, one talks of bidirectional ...
...
It is natural to expect input in any language in forms, as they
provide one of the only ways of obtaining user input. While this is
primarily a UI ...
...
Document Type Definition for the HyperText Markup Language,
extended for internationalisation (HTML ...
... ENTITY % HTML.Recommended "IGNORE"
-- Certain features of the language are necessary for
compatibility with widespread usage, but they may
...
... ENTITY % HTML.Deprecated "INCLUDE"
-- Certain features of the language are necessary for
compatibility with earlier versions ...
... elements --
"LANG NAME #IMPLIED -- RFC 1766(-> 3282draft | 3066(-> 4647 | 4646)) language tag --
DIR (ltr|rtl) #IMPLIED -- text directionnality --
ID ID #IMPLIED -- element ...
... ENTITY quot CDATA """ -- double quote -->
<!--Entities for language-dependent presentation (BIDI and
contextual analysis) -->
<!ENTITY ...
... <!-- <SPAN> Generic inline container -->
<!-- <SPAN DIR=...> New counterflow embedding -->
<!-- <SPAN LANG="..."> Language of contents -->
<!ELEMENT ...
...
<!-- <Q> Short quotation -->
<!-- <Q LANG=xx> Language of quotation is xx -->
<!-- <Q DIR=...> New conterflow embedding -->
...
...
<!-- <BDO DIR=...> Override directionality of text to value of DIR -->
<!-- <BDO LANG=...> Language of contents -->
<!ELEMENT ...
... Charset of destination (advisory) -->
<!-- <A LANG="..."> Language of contents btw <A> and </A> -->
<!-- <A DIR=...> Contents is a new counterflow embedding -->
...
... <!-- <IMG ALIGN=...> Position relative to text -->
<!-- <IMG LANG=...> Image contains "text" in that language -->
<!-- <IMG DIR=...> Inline image acts as a RTL or LTR
...
...
<!-- <P> Paragraph -->
<!-- <P LANG="..."> Language of paragraph text -->
<!-- <P DIR=...> Base directionality of paragraph -->
<!-- <P ALIGN=...> Paragraph alignment (justification) -->
...
... <!-- <PRE WIDTH=...> Maximum characters per line -->
<!-- <PRE DIR=...> Base direction of preformatted block -->
<!-- <PRE LANG=...> Language of contents -->
<