/usr/share/doc/jed-common/html/jed020.html

<!DOCTYPE html>
<html >
<head>
<meta http-equiv="Content-Type" content="text/html; charset=US-ASCII">
<meta name="generator" content="hevea 2.28">
<link rel="stylesheet" type="text/css" href="jed.css">
<title>Eight Bit Clean Issues</title>
</head>
<body >
<a href="jed019.html"><img src="previous_motif.gif" alt="Previous"></a>
<a href="index.html"><img src="contents_motif.gif" alt="Up"></a>
<a href="jed021.html"><img src="next_motif.gif" alt="Next"></a>
<hr>
<h2 id="sec41" class="section">19&#XA0;&#XA0;Eight Bit Clean Issues</h2>
<h3 id="sec42" class="subsection">19.1&#XA0;&#XA0;Displaying Characters with the High Bit Set</h3>
<p>There are several issues to consider here. The most important issue is how
to get <span style="font-weight:bold">jed</span> to display 8 bit characters in a &#X201C;clean&#X201D; way. By
&#X201C;clean&#X201D; I mean any character with the high bit set is sent to the
display device as is. This is achieved by putting the line:</p><pre class="verbatim">      DISPLAY_EIGHT_BIT = 1;
</pre><p>in the <code>jed.rc</code> (<code>.jedrc</code>) startup file. European systems might
want to put this in the file <code>site.sl</code> for all users. The default is
1 so unless its value has been changed, this step may not be necessary.</p><p>There is another issue. Suppose you want to display 8 bit characters with
extended Ascii codes greater than or equal to some value, say 160. This
is done by putting <code>DISPLAY_EIGHT_BIT = 160;</code>. I believe that ISO
Latin character sets assume this. This is the default value for Unix and
VMS systems.</p>
<h3 id="sec43" class="subsection">19.2&#XA0;&#XA0;Inputting Characters with the hight bit Set</h3>
<p>
Inputting characters with the high bit set into <span style="font-weight:bold">jed</span> is another issue.
How <span style="font-weight:bold">jed</span> interprets this bit is controlled by the variable
<code>META_CHAR</code>. What happens is this: When <span style="font-weight:bold">jed</span> reads a character from
the input device with the high bit set, it:
</p><ol class="enumerate" type=1><li class="li-enumerate">
Checks the value of <code>META_CHAR</code>. If this value is -1, <span style="font-weight:bold">jed</span> simply
inserts the character into the buffer.</li><li class="li-enumerate">For any other value of <code>META_CHAR</code> in the range 0 to 255, <span style="font-weight:bold">jed</span>
returns two 7-bit characters. The first character returned is
<code>META_CHAR</code> itself. The next character returned is the original
character but with the high bit stripped.
</li></ol><p>The default value of <code>META_CHAR</code> is -1 which means that when <span style="font-weight:bold">jed</span> sees
a character with the high bit set, <span style="font-weight:bold">jed</span> leaves it as is. Please note that
a character with the high bit set it <em>cannot</em> be the prefix character
of a keymap. It can be a part of the keymap but not the prefix.</p><p>Some systems only handle 7-bit character sequences and as a result, <span style="font-weight:bold">jed</span>
will only see 7-bit characters. <span style="font-weight:bold">jed</span> is still able to insert <em>any</em>
character in the range 0-255 on a 7-bit system. This is done through the
use of the <code>quoted_insert</code> function which, by default, is bound to
the backquote key <span style="font-family:monospace">&#X2018;</span>. If the <code>quoted_insert</code> function is called
with a digit argument (repeat argument), the character with the value of
the argument is inserted into the buffer. Operationally, one hits
<span style="font-variant:small-caps">Esc</span>, enters the extended Ascii code and hits the backquote key. For
example, to insert character 255 into the buffer, simply press the
following five keys: <span style="font-variant:small-caps">Esc 2 5 5 &#X2018;</span>.</p>
<h3 id="sec44" class="subsection">19.3&#XA0;&#XA0;Upper Case - Lower Case Conversions</h3>
<p>The above discussion centers around input and output of characters with
the high bit set. How <span style="font-weight:bold">jed</span> treats them internally is another issue and new
questions arise. For example, what is the uppercase equivalent of a
character with ASCII code 231? This may vary from language to language.
Some languages even have characters whose uppercase equivalent correspond
to multiple characters. For <span style="font-weight:bold">jed</span>, the following assumptions have been
made:
</p><ul class="itemize"><li class="li-itemize">
Each character is only 8 bits.
</li><li class="li-itemize">Each character has a unique uppercase equivalent.
</li><li class="li-itemize">Each character has a unique lowercase equivalent.
</li></ul><p> 
It would be nice if a fourth assumption could be made:
</p><ul class="itemize"><li class="li-itemize"> 
The value of the lowercase of a character is greater than or equal to
its uppercase counterpart.
</li></ul><p> 
However, apparently this is not possible since most IBMPC character sets
violate this assumption. Hence, <span style="font-weight:bold">jed</span> does not assume it. Suppose <span style="font-family:monospace">X</span>
is the upper case value of some character and suppose <span style="font-family:monospace">Y</span> is its lower
case value. Then to make <span style="font-weight:bold">jed</span> aware of this fact and use it case
conversions, it may be necessary to put a statement of the form:
</p><pre class="verbatim">     define_case (X, Y);
</pre><p>in the startup file. For example, suppose 211 is the uppercase of 244.
Then, the line
</p><pre class="verbatim">      define_case (211, 244);
</pre><p>will make <span style="font-weight:bold">jed</span> use this fact in operations involving the case of a character.</p><p>This has already been done for the ISO Latin 1 character set. See the file
<code>iso-latin.sl</code> for details. For MSDOS, this will not work. Instead
use the files <code>dos437.sl</code> and <code>dos850.sl</code>. By default, <span style="font-weight:bold">jed</span>&#X2019;s
internal lookup tables are initialized to the ISO Latin set for Unix and
VMS systems and to the DOS 437 code page for the IBMPC. To change the
defaults, it is only necessary to load the appropriate file. For example,
to load <code>dos850.sl</code> definitions, put
</p><pre class="verbatim">      evalfile ("dos850"); pop ();
</pre><p>in the startup file (e.g., <code>site.sl</code>). In addition to
uppercase/lowercase information, these files also contain word
definitions, i.e., which characters constitute a &#X201C;word&#X201D;.</p>
<hr>
<a href="jed019.html"><img src="previous_motif.gif" alt="Previous"></a>
<a href="index.html"><img src="contents_motif.gif" alt="Up"></a>
<a href="jed021.html"><img src="next_motif.gif" alt="Next"></a>
</body>
</html>
jed-common 1:0.99.19-7 / usr / share / doc / jed-common / html / jed020.html