/usr/share/unicode/auxiliary/WordBreakTest.html is in unicode-data 6.1.0-1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 | <!DOCTYPE HTML PUBLIC '-//W3C//DTD HTML 4.01 Transitional//EN' 'http://www.w3.org/TR/html4/loose.dtd'>
<html><head><meta http-equiv='Content-Type' content='text/html; charset=utf-8'>
<title>Word Break Chart</title>
<style type='text/css'>
td, th { vertical-align: top }
</style></head>
<body bgcolor='#FFFFFF'>
<h2>Word Break Chart</h2>
<p><b>Unicode Version:</b> 6.1.0</p>
<p><b>Date:</b> 2011-12-11, 18:27:27 GMT</p>
<p>This page illustrates the application of the boundary specification for Words. The material here is informative, not normative.</p> <p>The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification.</p><p>Each cell shows the break-status for the position between the character(s) in its row header and the character(s) in its column header. The × symbol indicates no break, while the ÷ symbol indicated a break. The cells with × are also shaded to make it easier to scan the table. For example, in the cell at the intersection of the row headed by 'CR' and the column headed by 'LF', there is a × symbol, indicating that there is no break between CR and LF.</p>
<p>After the heavy blue line in the table are additional rows, either with different sample characters or for sequences, such as "ALetter MidLetter" in WordBreak. Some column headers may be composed, reflecting 'treat as' or 'ignore' rules.</p>
<p>If your browser handles titles (tool tips), then hovering the mouse over the row header will show a sample character of that type. Hovering over a column header will show the sample character, plus its abbreviated general category and script. Hovering over the intersected cells shows the rule number that produces the break-status. For example, in GraphemeBreakTest, hovering over the cell at the intersection of LVT and T shows ×, with the rule 8.0. Checking below, the rule 8.0 is '( LVT | T) × T', which is the one that applies to that case. Note that a rule is invoked only when no lower-numbered rules have applied.</p>
<table border='1' cellspacing='0' width='100%'><tr><th width='7%'></th><th width='7%' class='lbclass' title='U+0001 <START OF HEADING>, gc=Cc, sc=Zyyy'>Other</th><th width='7%' class='lbclass' title='U+000D <CARRIAGE RETURN (CR)>, gc=Cc, sc=Zyyy'>CR</th><th width='7%' class='lbclass' title='U+000A <LINE FEED (LF)>, gc=Cc, sc=Zyyy'>LF</th><th width='7%' class='lbclass' title='U+000B <LINE TABULATION>, gc=Cc, sc=Zyyy'>Newline</th><th width='7%' class='lbclass' title='U+3031 VERTICAL KANA REPEAT MARK, gc=Lm, sc=Zyyy'>Katakana</th><th width='7%' class='lbclass' title='U+0041 LATIN CAPITAL LETTER A, gc=Lu, sc=Latn'>ALetter</th><th width='7%' class='lbclass' title='U+003A COLON, gc=Po, sc=Zyyy'>MidLetter</th><th width='7%' class='lbclass' title='U+002C COMMA, gc=Po, sc=Zyyy'>MidNum</th><th width='7%' class='lbclass' title='U+0027 APOSTROPHE, gc=Po, sc=Zyyy'>MidNumLet</th><th width='7%' class='lbclass' title='U+0030 DIGIT ZERO, gc=Nd, sc=Zyyy'>Numeric</th><th width='7%' class='lbclass' title='U+005F LOW LINE, gc=Pc, sc=Zyyy'>ExtendNumLet</th><th width='7%' class='lbclass' title='U+00AD SOFT HYPHEN, gc=Cf, sc=Zyyy'>Format_FE</th><th width='7%' class='lbclass' title='U+0300 COMBINING GRAVE ACCENT, gc=Mn, sc=Zinh'>Extend_FE</th></tr>
<tr><th class='lbclass' title='U+0001 <START OF HEADING>'>Other</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+000D <CARRIAGE RETURN (CR)>'>CR</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+000A <LINE FEED (LF)>'>LF</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+000B <LINE TABULATION>'>Newline</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th><th title='3.1' class='pairItem'>÷</th></tr>
<tr><th class='lbclass' title='U+3031 VERTICAL KANA REPEAT MARK'>Katakana</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='13.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='13.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0041 LATIN CAPITAL LETTER A'>ALetter</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='13.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+003A COLON'>MidLetter</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+002C COMMA'>MidNum</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0027 APOSTROPHE'>MidNumLet</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0030 DIGIT ZERO'>Numeric</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='10.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='8.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='13.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+005F LOW LINE'>ExtendNumLet</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='13.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='13.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='13.2' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='13.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+00AD SOFT HYPHEN'>Format_FE</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0300 COMBINING GRAVE ACCENT'>Extend_FE</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><td bgcolor='#0000FF' colSpan='14' style='font-size: 1px'> </td></tr>
<tr><th class='lbclass' title='U+0061 LATIN SMALL LETTER A, U+2060 WORD JOINER'>ALetter Format_FE</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='5.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='9.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='13.1' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0061 LATIN SMALL LETTER A, U+003A COLON'>ALetter MidLetter</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0061 LATIN SMALL LETTER A, U+0027 APOSTROPHE'>ALetter MidNumLet</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0061 LATIN SMALL LETTER A, U+0027 APOSTROPHE, U+2060 WORD JOINER'>ALetter MidNumLet Format_FE</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='7.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0061 LATIN SMALL LETTER A, U+002C COMMA'>ALetter MidNum</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0031 DIGIT ONE, U+003A COLON'>Numeric MidLetter</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0031 DIGIT ONE, U+0027 APOSTROPHE'>Numeric MidNumLet</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='11.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0031 DIGIT ONE, U+002C COMMA'>Numeric MidNum</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='11.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
<tr><th class='lbclass' title='U+0031 DIGIT ONE, U+002E FULL STOP, U+2060 WORD JOINER'>Numeric MidNumLet Format_FE</th><th title='999.0' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='3.2' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='999.0' class='pairItem'>÷</th><th title='11.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='999.0' class='pairItem'>÷</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th><th title='4.0' bgcolor='#CCCCFF' class='pairItem'>×</th></tr>
</table>
<h3><a name='rules'>Rules</a></h3>
<p>This section shows the rules. They are mechanically modified for programmatic generation of the tables and test code, and thus do not match the UAX rules precisely. In particular:</p><ol><li>The rules are cast into a form that is more like regular expressions.</li><li>The rules "sot ÷ <i>or</i> ×", "÷ eot", and "÷ Any" are added mechanically, and have artificial numbers.</li><li>The rules are given decimal numbers, so rules such as 11a are given a number using tenths, such as 11.1.</li><li>Any 'treat as' or 'ignore' rules are handled as discussed in the UAX, and thus reflected in a transformation of the rules usually not visible here. Where it does show up, an extra variable like CM* or FE* may appear, and the rule may be recast. In addition, final rules like "Any ÷ Any" may be recast as the equivalent expression "÷ Any".</li><li>Where a rule has multiple parts (lines), each one is numbered using hundredths, such as 21.01) × BA, 21.02) × HY,... In some cases, the numbering and form of a rule is changed due to 'treat as' rules.</li></ol><p>For the original rules, see the UAX.</p>
<ul style='list-style-type: none'>
<li>0.2) sot ÷</li>
<li>0.3) ÷ eot</li>
<li>3.0) CR × LF</li>
<li>3.1) (Newline | CR | LF) ÷</li>
<li>3.2) ÷ (Newline | CR | LF)</li>
<li>4.0) [^ Newline CR LF ] × [Format Extend]</li>
<li>5.0) ALetter × ALetter</li>
<li>6.0) ALetter × (MidLetter | MidNumLet) ALetter</li>
<li>7.0) ALetter (MidLetter | MidNumLet) × ALetter</li>
<li>8.0) Numeric × Numeric</li>
<li>9.0) ALetter × Numeric</li>
<li>10.0) Numeric × ALetter</li>
<li>11.0) Numeric (MidNum | MidNumLet) × Numeric</li>
<li>12.0) Numeric × (MidNum | MidNumLet) Numeric</li>
<li>13.0) Katakana × Katakana</li>
<li>13.1) (ALetter | Numeric | Katakana | ExtendNumLet) × ExtendNumLet</li>
<li>13.2) ExtendNumLet × (ALetter | Numeric | Katakana)</li>
<li>999.0) ÷ Any</li>
</ul>
<h3><a name='samples'>Sample Strings</a></h3>
<p>The following samples illustrate the application of the rules. The blue lines indicate possible break points. If your browser supports titles (tool-tips), then positioning the mouse over each character will show its name, while positioning between characters shows the rule number of the rule responsible for the break-status.</p>
<ol>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0063 LATIN SMALL LETTER C (ALetter)'>c</span><span title='5.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='5.0'><span> </span> </span><span title='U+006E LATIN SMALL LETTER N (ALetter)'>n</span><span title='6.0'><span> </span> </span><span title='U+0027 APOSTROPHE (MidNumLet)'>'</span><span title='7.0'><span> </span> </span><span title='U+0074 LATIN SMALL LETTER T (ALetter)'>t</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0063 LATIN SMALL LETTER C (ALetter)'>c</span><span title='5.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='5.0'><span> </span> </span><span title='U+006E LATIN SMALL LETTER N (ALetter)'>n</span><span title='6.0'><span> </span> </span><span title='U+2019 RIGHT SINGLE QUOTATION MARK (MidNumLet)'>’</span><span title='7.0'><span> </span> </span><span title='U+0074 LATIN SMALL LETTER T (ALetter)'>t</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='5.0'><span> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='4.0'><span> </span> </span><span title='U+00AD SOFT HYPHEN (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='5.0'><span> </span> </span><span title='U+0079 LATIN SMALL LETTER Y (ALetter)'>y</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0024 DOLLAR SIGN (Other)'>$</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+002D HYPHEN-MINUS (Other)'>-</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0033 DIGIT THREE (Numeric)'>3</span><span title='8.0'><span> </span> </span><span title='U+0034 DIGIT FOUR (Numeric)'>4</span><span title='12.0'><span> </span> </span><span title='U+002C COMMA (MidNum)'>,</span><span title='11.0'><span> </span> </span><span title='U+0035 DIGIT FIVE (Numeric)'>5</span><span title='8.0'><span> </span> </span><span title='U+0036 DIGIT SIX (Numeric)'>6</span><span title='8.0'><span> </span> </span><span title='U+0037 DIGIT SEVEN (Numeric)'>7</span><span title='12.0'><span> </span> </span><span title='U+002E FULL STOP (MidNumLet)'>.</span><span title='11.0'><span> </span> </span><span title='U+0031 DIGIT ONE (Numeric)'>1</span><span title='8.0'><span> </span> </span><span title='U+0034 DIGIT FOUR (Numeric)'>4</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0025 PERCENT SIGN (Other)'>%</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0033 DIGIT THREE (Numeric)'>3</span><span title='10.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0063 LATIN SMALL LETTER C (ALetter)'>c</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+006E LATIN SMALL LETTER N (ALetter)'>n</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='6.0'><span> </span> </span><span title='U+0027 APOSTROPHE (MidNumLet)'>'</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='7.0'><span> </span> </span><span title='U+0074 LATIN SMALL LETTER T (ALetter)'>t</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0063 LATIN SMALL LETTER C (ALetter)'>c</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+006E LATIN SMALL LETTER N (ALetter)'>n</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='6.0'><span> </span> </span><span title='U+2019 RIGHT SINGLE QUOTATION MARK (MidNumLet)'>’</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='7.0'><span> </span> </span><span title='U+0074 LATIN SMALL LETTER T (ALetter)'>t</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+00AD SOFT HYPHEN (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='5.0'><span> </span> </span><span title='U+0079 LATIN SMALL LETTER Y (ALetter)'>y</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0024 DOLLAR SIGN (Other)'>$</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+002D HYPHEN-MINUS (Other)'>-</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0033 DIGIT THREE (Numeric)'>3</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='8.0'><span> </span> </span><span title='U+0034 DIGIT FOUR (Numeric)'>4</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='12.0'><span> </span> </span><span title='U+002C COMMA (MidNum)'>,</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='11.0'><span> </span> </span><span title='U+0035 DIGIT FIVE (Numeric)'>5</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='8.0'><span> </span> </span><span title='U+0036 DIGIT SIX (Numeric)'>6</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='8.0'><span> </span> </span><span title='U+0037 DIGIT SEVEN (Numeric)'>7</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='12.0'><span> </span> </span><span title='U+002E FULL STOP (MidNumLet)'>.</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='11.0'><span> </span> </span><span title='U+0031 DIGIT ONE (Numeric)'>1</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='8.0'><span> </span> </span><span title='U+0034 DIGIT FOUR (Numeric)'>4</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0025 PERCENT SIGN (Other)'>%</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0062 LATIN SMALL LETTER B (ALetter)'>b</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
<li><font size='5'>
<span title='0.2'><span style='border-right: 1px solid blue'> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='999.0'><span style='border-right: 1px solid blue'> </span> </span><span title='U+0033 DIGIT THREE (Numeric)'>3</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='10.0'><span> </span> </span><span title='U+0061 LATIN SMALL LETTER A (ALetter)'>a</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='4.0'><span> </span> </span><span title='U+2060 WORD JOINER (Format_FE)'>□</span><span title='0.3'><span style='border-right: 1px solid blue'> </span> </span>
</font></li>
</ol>
<hr width='50%'>
<div align='center'>
<center>
<table cellspacing='0' cellpadding='0' border='0'>
<tr>
<td><a href='http://www.unicode.org/unicode/copyright.html'>
<img src='http://www.unicode.org/img/hb_notice.gif' border='0' alt='Access to Copyright and terms of use' width='216' height='50'></a></td>
</tr>
</table>
<script language='Javascript' type='text/javascript' src='http://www.unicode.org/webscripts/lastModified.js'>
</script>
</center>
</div>
|