/usr/share/doc/texinfo/html/documentencoding.html is in texinfo-doc-nonfree 4.13a-1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 | <html lang="en">
<head>
<title>documentencoding - GNU Texinfo 4.13</title>
<meta http-equiv="Content-Type" content="text/html">
<meta name="description" content="GNU Texinfo 4.13">
<meta name="generator" content="makeinfo 4.13">
<link title="Top" rel="start" href="index.html#Top">
<link rel="up" href="Internationalization.html#Internationalization" title="Internationalization">
<link rel="prev" href="documentlanguage.html#documentlanguage" title="documentlanguage">
<link href="http://www.gnu.org/software/texinfo/" rel="generator-home" title="Texinfo Homepage">
<!--
This manual is for GNU Texinfo (version 4.13, 18 September 2008),
a documentation system that can produce both online information and a
printed manual from a single source.
Copyright (C) 1988, 1990, 1991, 1992, 1993, 1995, 1996, 1997,
1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008
Free Software Foundation, Inc.
Permission is granted to copy, distribute and/or modify this
document under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software
Foundation; with no Invariant Sections, with the Front-Cover Texts
being ``A GNU Manual'', and with the Back-Cover Texts as in (a)
below. A copy of the license is included in the section entitled
``GNU Free Documentation License.''
(a) The FSF's Back-Cover Text is: ``You are free to copy and modify
this GNU Manual. Buying copies from GNU Press supports the FSF in
developing GNU and promoting software freedom.''
-->
<meta http-equiv="Content-Style-Type" content="text/css">
<style type="text/css"><!--
pre.display { font-family:inherit }
pre.format { font-family:inherit }
pre.smalldisplay { font-family:inherit; font-size:smaller }
pre.smallformat { font-family:inherit; font-size:smaller }
pre.smallexample { font-size:smaller }
pre.smalllisp { font-size:smaller }
span.sc { font-variant:small-caps }
span.roman { font-family:serif; font-weight:normal; }
span.sansserif { font-family:sans-serif; font-weight:normal; }
--></style>
</head>
<body>
<div class="node">
<a name="documentencoding"></a>
<p>
Previous: <a rel="previous" accesskey="p" href="documentlanguage.html#documentlanguage">documentlanguage</a>,
Up: <a rel="up" accesskey="u" href="Internationalization.html#Internationalization">Internationalization</a>
<hr>
</div>
<h3 class="section">18.2 <code>@documentencoding </code><var>enc</var>: Set Input Encoding</h3>
<p><a name="index-documentencoding-1047"></a><a name="index-Encoding_002c-declaring-1048"></a><a name="index-Input-encoding_002c-declaring-1049"></a><a name="index-Character-set_002c-declaring-1050"></a><a name="index-Document-input-encoding-1051"></a>
The <code>@documentencoding</code> command declares the input document
encoding. Write it on a line by itself, with a valid encoding
specification following, near the beginning of the file but after
<code>@setfilename</code> (see <a href="setfilename.html#setfilename"><code>@setfilename</code></a>):
<pre class="example"> @documentencoding <var>enc</var>
</pre>
<p>At present, Texinfo supports only these encodings:
<dl>
<dt><code>US-ASCII</code><dd>This has no particular effect, but it's included for completeness.
<dt><code>UTF-8</code><dd>The vast global character encoding, expressed in 8-bit bytes.
The Texinfo processors have no deep knowledge of Unicode; for the most
part, they just pass along the input they are given to the output.
<dt><code>ISO-8859-1</code><dt><code>ISO-8859-15</code><br><dt><code>ISO-8859-2</code><dd>These specify the standard encodings for Western European (the first
two) and Eastern European languages (the third), respectively. ISO
8859-15 replaces some little-used characters from 8859-1 (e.g.,
precomposed fractions) with more commonly needed ones, such as the
Euro symbol (€).
<p>A full description of the encodings is beyond our scope here;
one useful reference is <a href="http://czyborra.com/charsets/iso8859.html">http://czyborra.com/charsets/iso8859.html</a>.
<br><dt><code>koi8-r</code><dd>This is the commonly used encoding for the Russian language.
<br><dt><code>koi8-u</code><dd>This is the commonly used encoding for the Ukrainian language.
</dl>
<p>Specifying an encoding <var>enc</var> has the following effects:
<p><a name="index-g_t_002d_002denable_002dencoding-1052"></a><a name="index-Local-Variables_003a-section_002c-for-encoding-1053"></a><a name="index-Info-output_002c-and-encoding-1054"></a>In Info output, unless the option <samp><span class="option">--disable-encoding</span></samp> is given
to <samp><span class="command">makeinfo</span></samp>, a so-called `Local Variables' section
(see <a href="../emacs/File-Variables.html#File-Variables">File Variables</a>) is output
including <var>enc</var>. This allows Info readers to set the encoding
appropriately.
<pre class="example"> Local Variables:
coding: <var>enc</var>
End:
</pre>
<p>Also, in Info and plain text output (barring
<samp><span class="option">--disable-encoding</span></samp>), accent constructs and special
characters, such as <code>@'e</code>, are output as the actual 8-bit
character in the given encoding.
<p><a name="index-HTML-output_002c-and-encodings-1055"></a><a name="index-g_t_0040code_007bhttp_002dequiv_007d_002c-and-charset-specification-1056"></a><a name="index-g_t_0040code_007b_003cmeta_003e_007d-HTML-tag_002c-and-charset-specification-1057"></a>In HTML output, a ‘<samp><span class="samp"><meta></span></samp>’ tag is output, in the ‘<samp><span class="samp"><head></span></samp>’
section of the HTML, that specifies <var>enc</var>. Web servers and
browsers cooperate to use this information so the correct encoding is
used to display the page, if supported by the system.
<pre class="example"> <meta http-equiv="Content-Type" content="text/html;
charset=<var>enc</var>">
</pre>
<p>In split HTML output, if <samp><span class="option">--transliterate-file-names</span></samp> is
given (see <a href="HTML-Xref-8_002dbit-Character-Expansion.html#HTML-Xref-8_002dbit-Character-Expansion">HTML Xref 8-bit Character Expansion</a>), the names of HTML
files are formed by transliteration of the corresponding node names,
using the specified encoding.
<p>In XML and Docbook output, the given document encoding is written in
the output file as usual with those formats.
<p>In TeX output, the characters which are supported in the standard
Computer Modern fonts are output accordingly. (For example, this
means using constructed accents rather than precomposed glyphs.)
Using a missing character generates a warning message, as does
specifying an unimplemented encoding.
</body></html>
|