| dbpprop:abstract
|
- In addition to native character encodings, characters can also be encoded as character references, which can be numeric character references or character entity references. Character entity references are also sometimes referred to as named entities, or HTML entities for HTML. HTML's usage of character references derives from SGML. Character entity references have the format &name; where "name" is a case-sensitive alphanumeric string. For example, the character 'λ' can be encoded as λ in an HTML 4 document. Characters <, >, " and & are used to delimit tags, attribute values, and character references. Character entity references <, >, " and &, which are predefined in HTML, XML, and SGML, can be used instead for literal representations of the characters. Numeric character references can be in decimal format, &#DD;, where DD is a variable-width string of decimal digits. Similarly there is a hexadecimal format, &#xHHHH;, where HHHH is a variable-width string of hexadecimal digits, though many consider it good practice to never use fewer than four hex digits, and never use an odd number of hex digits (due to the correspondence of two hex digits to one byte). Unlike named entities, hexadecimal character references are case-insensitive in HTML. For example, λ can also be represented as λ, λ or λ. Numeric references always refer to Universal Character Set code points, regardless of the page's encoding. Using numeric references that refer to UCS control code ranges is forbidden, with the exception of the linefeed, tab, and carriage return characters. That is, characters in the hexadecimal ranges 00–08, 0B–0C, 0E–1F, 7F, and 80–9F cannot be used in an HTML document, not even by reference —so "™", for example, is not allowed. However, for backward compatibility with early HTML authors and browsers that ignored this restriction, raw characters and numeric character references in the 80–9F range are interpreted by some browsers as representing the characters mapped to bytes 80–9F in the Windows-1252 encoding. Unnecessary use of HTML character references may significantly reduce HTML readability. If the character encoding for a web page is chosen appropriately then HTML character references are usually only required for a few special characters (or not at all if a native Unicode encoding like UTF-8 is used).
- Zeichenreferenzen (engl. character references) sind spezielle Zeichensequenzen, um Schriftzeichen in einem SGML- und XML-Dokument darzustellen. Sie beginnen mit einem Et-Zeichen und enden mit einem Semikolon . Zeichenreferenzen sind notwendig um die Metazeichen der Sprachen („<“, „>“, „&“, „"“ und „'“) als Zeichen selbst zu verwenden und nützlich für selten verwendete Zeichen oder solche, die im Editor nur schwer oder gar nicht eingegeben oder aufgrund des verwendeten Zeichensatzes oder Zeichenkodierung nicht genutzt werden können. Sie werden in numerische und benannte Zeichenreferenzen unterschieden.
- Een Character Entity Reference is een codering van een letterteken (character in het Engels) in meerdere lettertekens uit een beperktere tekenset. De naam Character Entity Reference wordt gebruikt voor de documentstructureringstechnologieën van het World Wide Web Consortium, zoals HTML, XML en XHTML. Een voorbeeld: € wordt gecodeerd als € (de ampersand wordt gecodeerd als &). Door de tekst van deze pagina te bewerken of door de broncode te bekijken zul je zien hoe het werkt. Alle webpagina's ter wereld worden met behulp van een van deze technologieën gemaakt, waaronder ook Wikipedia. Een overzicht van door browsers ondersteunde tekens is te vinden op: http://www. htmlhelp. com/reference/html40/entities/.
- Язык гипертекстовой разметки HTML используется с 1991 года, но версия 4.0 (1997) была первой, где представление символов, отличных от ASCII (то есть, английского языка), достаточно стандартизировано.
- HTML于1991年面世,但一直要到1997年推出4.0版本以后,才对国际化这题目有一个较好的回应。在此之前,为了保证所有人都能够正常阅读内容,当要对所有用到ASCII字集以外字符 的规范。这是为了两个目的: 保持储存在 HTML 文件内的内容的完整性, 让市面上大多数浏览器都能正确显示文本的内容。
|
| rdfs:comment
|
- In addition to native character encodings, characters can also be encoded as character references, which can be numeric character references or character entity references. Character entity references are also sometimes referred to as named entities, or HTML entities for HTML. HTML's usage of character references derives from SGML. Character entity references have the format &name; where "name" is a case-sensitive alphanumeric string.
- Zeichenreferenzen (engl. character references) sind spezielle Zeichensequenzen, um Schriftzeichen in einem SGML- und XML-Dokument darzustellen. Sie beginnen mit einem Et-Zeichen und enden mit einem Semikolon .
- Een Character Entity Reference is een codering van een letterteken (character in het Engels) in meerdere lettertekens uit een beperktere tekenset. De naam Character Entity Reference wordt gebruikt voor de documentstructureringstechnologieën van het World Wide Web Consortium, zoals HTML, XML en XHTML. Een voorbeeld: € wordt gecodeerd als € (de ampersand wordt gecodeerd als &). Door de tekst van deze pagina te bewerken of door de broncode te bekijken zul je zien hoe het werkt.
- Язык гипертекстовой разметки HTML используется с 1991 года, но версия 4.0 (1997) была первой, где представление символов, отличных от ASCII (то есть, английского языка), достаточно стандартизировано.
- HTML于1991年面世,但一直要到1997年推出4.0版本以后,才对国际化这题目有一个较好的回应。在此之前,为了保证所有人都能够正常阅读内容,当要对所有用到ASCII字集以外字符 的规范。这是为了两个目的: 保持储存在 HTML 文件内的内容的完整性, 让市面上大多数浏览器都能正确显示文本的内容。
|