/usr/share/doc/festival-doc/html/Ngrams.html is in festival-doc 1:2.1~release-8.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<!-- Created by GNU Texinfo 5.2, http://www.gnu.org/software/texinfo/ -->
<head>
<title>Festival Speech Synthesis System: Ngrams</title>
<meta name="description" content="Festival Speech Synthesis System: Ngrams">
<meta name="keywords" content="Festival Speech Synthesis System: Ngrams">
<meta name="resource-type" content="document">
<meta name="distribution" content="global">
<meta name="Generator" content="makeinfo">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<link href="index.html#Top" rel="start" title="Top">
<link href="Index.html#Index" rel="index" title="Index">
<link href="Index.html#SEC_Contents" rel="contents" title="Table of Contents">
<link href="Tools.html#Tools" rel="up" title="Tools">
<link href="Viterbi-decoder.html#Viterbi-decoder" rel="next" title="Viterbi decoder">
<link href="CART-trees.html#CART-trees" rel="prev" title="CART trees">
<style type="text/css">
<!--
a.summary-letter {text-decoration: none}
blockquote.smallquotation {font-size: smaller}
div.display {margin-left: 3.2em}
div.example {margin-left: 3.2em}
div.indentedblock {margin-left: 3.2em}
div.lisp {margin-left: 3.2em}
div.smalldisplay {margin-left: 3.2em}
div.smallexample {margin-left: 3.2em}
div.smallindentedblock {margin-left: 3.2em; font-size: smaller}
div.smalllisp {margin-left: 3.2em}
kbd {font-style:oblique}
pre.display {font-family: inherit}
pre.format {font-family: inherit}
pre.menu-comment {font-family: serif}
pre.menu-preformatted {font-family: serif}
pre.smalldisplay {font-family: inherit; font-size: smaller}
pre.smallexample {font-size: smaller}
pre.smallformat {font-family: inherit; font-size: smaller}
pre.smalllisp {font-size: smaller}
span.nocodebreak {white-space:nowrap}
span.nolinebreak {white-space:nowrap}
span.roman {font-family:serif; font-weight:normal}
span.sansserif {font-family:sans-serif; font-weight:normal}
ul.no-bullet {list-style: none}
-->
</style>
</head>
<body lang="en" bgcolor="#FFFFFF" text="#000000" link="#0000FF" vlink="#800080" alink="#FF0000">
<a name="Ngrams"></a>
<div class="header">
<p>
Next: <a href="Viterbi-decoder.html#Viterbi-decoder" accesskey="n" rel="next">Viterbi decoder</a>, Previous: <a href="CART-trees.html#CART-trees" accesskey="p" rel="prev">CART trees</a>, Up: <a href="Tools.html#Tools" accesskey="u" rel="up">Tools</a> [<a href="Index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Index.html#Index" title="Index" rel="index">Index</a>]</p>
</div>
<hr>
<a name="Ngrams-1"></a>
<h3 class="section">25.3 Ngrams</h3>
<a name="index-ngrams"></a>
<p>Bigram, trigrams, and general ngrams are used in the part
of speech tagger and the phrase break predicter. An Ngram
C++ Class is defined in the speech tools library and some simple
facilities are added within Festival itself.
</p>
<p>Ngrams may be built from files of tokens using the program
<code>ngram_build</code> which is part of the speech tools. See
the speech tools documentation for details.
</p>
<p>Within Festival ngrams may be named and loaded from files
and used when required. The LISP function <code>load_ngram</code>
takes a name and a filename as argument and loads the Ngram
from that file. For an example of its use once loaded see
<samp>src/modules/base/pos.cc</samp> or
<samp>src/modules/base/phrasify.cc</samp>.
</p>
</body>
</html>
|