/usr/share/doc/texinfo/html/Inserting-Unicode.html is in texinfo-doc-nonfree 6.1.0-2.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<!-- This manual is for GNU Texinfo (version 6.1, 6 February 2016),
a documentation system that can produce both online information and a
printed manual from a single source using semantic markup.
Copyright (C) 1988, 1990, 1991, 1992, 1993, 1995, 1996, 1997,
1998, 1999, 2001, 2001, 2003, 2004, 2005, 2006, 2007, 2008, 2009,
2010, 2011, 2012, 2013, 2014, 2015, 2016 Free Software Foundation, Inc.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License, Version 1.3 or
any later version published by the Free Software Foundation; with no
Invariant Sections, with the Front-Cover Texts being "A GNU Manual",
and with the Back-Cover Texts as in (a) below. A copy of the license
is included in the section entitled "GNU Free Documentation
License".
(a) The FSF's Back-Cover Text is: "You have the freedom to copy and
modify this GNU manual. Buying copies from the FSF supports it in
developing GNU and promoting software freedom." -->
<!-- Created by GNU Texinfo 6.1, http://www.gnu.org/software/texinfo/ -->
<head>
<title>GNU Texinfo 6.1: Inserting Unicode</title>
<meta name="description" content="GNU Texinfo 6.1: Inserting Unicode">
<meta name="keywords" content="GNU Texinfo 6.1: Inserting Unicode">
<meta name="resource-type" content="document">
<meta name="distribution" content="global">
<meta name="Generator" content="makeinfo">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<link href="index.html#Top" rel="start" title="Top">
<link href="Command-and-Variable-Index.html#Command-and-Variable-Index" rel="index" title="Command and Variable Index">
<link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
<link href="Insertions.html#Insertions" rel="up" title="Insertions">
<link href="Breaks.html#Breaks" rel="next" title="Breaks">
<link href="Click-Sequences.html#Click-Sequences" rel="prev" title="Click Sequences">
<style type="text/css">
<!--
a.summary-letter {text-decoration: none}
blockquote.indentedblock {margin-right: 0em}
blockquote.smallindentedblock {margin-right: 0em; font-size: smaller}
blockquote.smallquotation {font-size: smaller}
div.display {margin-left: 3.2em}
div.example {margin-left: 3.2em}
div.lisp {margin-left: 3.2em}
div.smalldisplay {margin-left: 3.2em}
div.smallexample {margin-left: 3.2em}
div.smalllisp {margin-left: 3.2em}
kbd {font-style: oblique}
pre.display {font-family: inherit}
pre.format {font-family: inherit}
pre.menu-comment {font-family: serif}
pre.menu-preformatted {font-family: serif}
pre.smalldisplay {font-family: inherit; font-size: smaller}
pre.smallexample {font-size: smaller}
pre.smallformat {font-family: inherit; font-size: smaller}
pre.smalllisp {font-size: smaller}
span.nolinebreak {white-space: nowrap}
span.roman {font-family: initial; font-weight: normal}
span.sansserif {font-family: sans-serif; font-weight: normal}
ul.no-bullet {list-style: none}
-->
</style>
</head>
<body lang="en">
<a name="Inserting-Unicode"></a>
<div class="header">
<p>
Previous: <a href="Glyphs-for-Programming.html#Glyphs-for-Programming" accesskey="p" rel="prev">Glyphs for Programming</a>, Up: <a href="Insertions.html#Insertions" accesskey="u" rel="up">Insertions</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Command-and-Variable-Index.html#Command-and-Variable-Index" title="Index" rel="index">Index</a>]</p>
</div>
<hr>
<a name="Inserting-Unicode_003a-_0040U"></a>
<h3 class="section">12.10 Inserting Unicode: <code>@U</code></h3>
<a name="index-Unicode-character_002c-inserting"></a>
<a name="index-Code-point-of-Unicode-character_002c-inserting-by"></a>
<a name="index-U"></a>
<p>The command <code>@U{<var>hex</var>}</code> inserts a representation of the
Unicode character U+<var>hex</var>. For example, <code>@U{0132}</code>
inserts the Dutch ‘IJ’ ligature (poorly shown here as simply the two
letters ‘I’ and ‘J’).
</p>
<p>The <var>hex</var> value should be at least four hex digits; leading zeros
are <em>not</em> added. In general, <var>hex</var> must specify a valid
normal Unicode character; e.g., U+10FFFF (the very last code point) is
invalid by definition, and thus cannot be inserted this way.
</p>
<a name="index-ASCII_002c-source-document-portability-using"></a>
<p><code>@U</code> is useful for inserting occasional glyphs for which Texinfo
has no dedicated command, while allowing the Texinfo source to remain
purely 7-bit ASCII for maximum portability.
</p>
<a name="index-Unicode-and-TeX"></a>
<p>This command has many limitations—the same limitations as inserting
Unicode characters in UTF-8 or another binary form. First and most
importantly, TeX knows nothing about most of Unicode. Supporting
specific additional glyphs upon request is possible, but it’s not
viable for <samp>texinfo.tex</samp> to support whole additional scripts
(Japanese, Urdu, …). The <code>@U</code> command does nothing to
change this. If the specified character is not supported in TeX,
an error is given. (See <a href="_0040documentencoding.html#g_t_0040documentencoding"><code>@documentencoding</code></a>.)
</p>
<a name="index-Entity-reference-in-HTML-et-al_002e"></a>
<a name="index-_0026_0023xhex_003b_002c-output-from-_0040U"></a>
<p>In HTML, XML, and Docbook, the output from <code>@U</code> is always an
entity reference of the form ‘<samp>&#x<var>hex</var>;</samp>’, as in
‘<samp>&#x0132;</samp>’ for the example above. This should work even when an
HTML document uses some other encoding (say, Latin 1) and the
given character is not supported in that encoding.
</p>
<a name="index-UTF_002d8_002c-output-from-_0040U"></a>
<p>In Info and plain text, if the document encoding is specified
explicitly to be UTF-8, the output will be the UTF-8 representation of
the character U+<var>hex</var> (presuming it’s a valid character). In all
other cases, the output is the ASCII sequence ‘<samp>U+<var>hex</var></samp>’, as
in the six ASCII characters ‘<samp>U+0132</samp>’ for the example above.
</p>
<p>That’s all. No magic!
</p>
<hr>
<div class="header">
<p>
Previous: <a href="Glyphs-for-Programming.html#Glyphs-for-Programming" accesskey="p" rel="prev">Glyphs for Programming</a>, Up: <a href="Insertions.html#Insertions" accesskey="u" rel="up">Insertions</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Command-and-Variable-Index.html#Command-and-Variable-Index" title="Index" rel="index">Index</a>]</p>
</div>
</body>
</html>
|