This file is indexed.

/usr/lib/x86_64-linux-gnu/perl5/5.24/Lucy/Analysis/StandardTokenizer.pod is in liblucy-perl 0.3.3-7+b1.

This file is owned by root:root, with mode 0o644.

The actual contents of the file can be viewed below.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
# Auto-generated file -- DO NOT EDIT!!!!!

# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements.  See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License.  You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

=head1 NAME

Lucy::Analysis::StandardTokenizer - Split a string into tokens.

=head1 SYNOPSIS

    my $tokenizer = Lucy::Analysis::StandardTokenizer->new;

    # Then... once you have a tokenizer, put it into a PolyAnalyzer:
    my $polyanalyzer = Lucy::Analysis::PolyAnalyzer->new(
        analyzers => [ $case_folder, $tokenizer, $stemmer ], );



=head1 DESCRIPTION

Generically, "tokenizing" is a process of breaking up a string into an
array of "tokens".  For instance, the string "three blind mice" might be
tokenized into "three", "blind", "mice".

Lucy::Analysis::StandardTokenizer breaks up the text at the word
boundaries defined in Unicode Standard Annex #29. It then returns those
words that start with an alphabetic or numeric character.

=head1 CONSTRUCTORS

=head2 new()

    my $tokenizer = Lucy::Analysis::StandardTokenizer->new;

Constructor.  Takes no arguments.







=head1 INHERITANCE

Lucy::Analysis::StandardTokenizer isa L<Lucy::Analysis::Analyzer> isa L<Lucy::Object::Obj>.


=cut