/usr/share/perl/5.26.1/pod/perlopentut.pod is in perl-doc 5.26.1-6.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 | =encoding utf8
=head1 NAME
perlopentut - simple recipes for opening files and pipes in Perl
=head1 DESCRIPTION
Whenever you do I/O on a file in Perl, you do so through what in Perl is
called a B<filehandle>. A filehandle is an internal name for an external
file. It is the job of the C<open> function to make the association
between the internal name and the external name, and it is the job
of the C<close> function to break that association.
For your convenience, Perl sets up a few special filehandles that are
already open when you run. These include C<STDIN>, C<STDOUT>, C<STDERR>,
and C<ARGV>. Since those are pre-opened, you can use them right away
without having to go to the trouble of opening them yourself:
print STDERR "This is a debugging message.\n";
print STDOUT "Please enter something: ";
$response = <STDIN> // die "how come no input?";
print STDOUT "Thank you!\n";
while (<ARGV>) { ... }
As you see from those examples, C<STDOUT> and C<STDERR> are output
handles, and C<STDIN> and C<ARGV> are input handles. They are
in all capital letters because they are reserved to Perl, much
like the C<@ARGV> array and the C<%ENV> hash are. Their external
associations were set up by your shell.
You will need to open every other filehandle on your own. Although there
are many variants, the most common way to call Perl's open() function
is with three arguments and one return value:
C< I<OK> = open(I<HANDLE>, I<MODE>, I<PATHNAME>)>
Where:
=over
=item I<OK>
will be some defined value if the open succeeds, but
C<undef> if it fails;
=item I<HANDLE>
should be an undefined scalar variable to be filled in by the
C<open> function if it succeeds;
=item I<MODE>
is the access mode and the encoding format to open the file with;
=item I<PATHNAME>
is the external name of the file you want opened.
=back
Most of the complexity of the C<open> function lies in the many
possible values that the I<MODE> parameter can take on.
One last thing before we show you how to open files: opening
files does not (usually) automatically lock them in Perl. See
L<perlfaq5> for how to lock.
=head1 Opening Text Files
=head2 Opening Text Files for Reading
If you want to read from a text file, first open it in
read-only mode like this:
my $filename = "/some/path/to/a/textfile/goes/here";
my $encoding = ":encoding(UTF-8)";
my $handle = undef; # this will be filled in on success
open($handle, "< $encoding", $filename)
|| die "$0: can't open $filename for reading: $!";
As with the shell, in Perl the C<< "<" >> is used to open the file in
read-only mode. If it succeeds, Perl allocates a brand new filehandle for
you and fills in your previously undefined C<$handle> argument with a
reference to that handle.
Now you may use functions like C<readline>, C<read>, C<getc>, and
C<sysread> on that handle. Probably the most common input function
is the one that looks like an operator:
$line = readline($handle);
$line = <$handle>; # same thing
Because the C<readline> function returns C<undef> at end of file or
upon error, you will sometimes see it used this way:
$line = <$handle>;
if (defined $line) {
# do something with $line
}
else {
# $line is not valid, so skip it
}
You can also just quickly C<die> on an undefined value this way:
$line = <$handle> // die "no input found";
However, if hitting EOF is an expected and normal event, you do not want to
exit simply because you have run out of input. Instead, you probably just want
to exit an input loop. You can then test to see if an actual error has caused
the loop to terminate, and act accordingly:
while (<$handle>) {
# do something with data in $_
}
if ($!) {
die "unexpected error while reading from $filename: $!";
}
B<A Note on Encodings>: Having to specify the text encoding every time
might seem a bit of a bother. To set up a default encoding for C<open> so
that you don't have to supply it each time, you can use the C<open> pragma:
use open qw< :encoding(UTF-8) >;
Once you've done that, you can safely omit the encoding part of the
open mode:
open($handle, "<", $filename)
|| die "$0: can't open $filename for reading: $!";
But never use the bare C<< "<" >> without having set up a default encoding
first. Otherwise, Perl cannot know which of the many, many, many possible
flavors of text file you have, and Perl will have no idea how to correctly
map the data in your file into actual characters it can work with. Other
common encoding formats including C<"ASCII">, C<"ISO-8859-1">,
C<"ISO-8859-15">, C<"Windows-1252">, C<"MacRoman">, and even C<"UTF-16LE">.
See L<perlunitut> for more about encodings.
=head2 Opening Text Files for Writing
When you want to write to a file, you first have to decide what to do about
any existing contents of that file. You have two basic choices here: to
preserve or to clobber.
If you want to preserve any existing contents, then you want to open the file
in append mode. As in the shell, in Perl you use C<<< ">>" >>> to open an
existing file in append mode. C<<< ">>" >>> creates the file if it does not
already exist.
my $handle = undef;
my $filename = "/some/path/to/a/textfile/goes/here";
my $encoding = ":encoding(UTF-8)";
open($handle, ">> $encoding", $filename)
|| die "$0: can't open $filename for appending: $!";
Now you can write to that filehandle using any of C<print>, C<printf>,
C<say>, C<write>, or C<syswrite>.
As noted above, if the file does not already exist, then the append-mode open
will create it for you. But if the file does already exist, its contents are
safe from harm because you will be adding your new text past the end of the
old text.
On the other hand, sometimes you want to clobber whatever might already be
there. To empty out a file before you start writing to it, you can open it
in write-only mode:
my $handle = undef;
my $filename = "/some/path/to/a/textfile/goes/here";
my $encoding = ":encoding(UTF-8)";
open($handle, "> $encoding", $filename)
|| die "$0: can't open $filename in write-open mode: $!";
Here again Perl works just like the shell in that the C<< ">" >> clobbers
an existing file.
As with the append mode, when you open a file in write-only mode,
you can now write to that filehandle using any of C<print>, C<printf>,
C<say>, C<write>, or C<syswrite>.
What about read-write mode? You should probably pretend it doesn't exist,
because opening text files in read-write mode is unlikely to do what you
would like. See L<perlfaq5> for details.
=head1 Opening Binary Files
If the file to be opened contains binary data instead of text characters,
then the C<MODE> argument to C<open> is a little different. Instead of
specifying the encoding, you tell Perl that your data are in raw bytes.
my $filename = "/some/path/to/a/binary/file/goes/here";
my $encoding = ":raw :bytes"
my $handle = undef; # this will be filled in on success
And then open as before, choosing C<<< "<" >>>, C<<< ">>" >>>, or
C<<< ">" >>> as needed:
open($handle, "< $encoding", $filename)
|| die "$0: can't open $filename for reading: $!";
open($handle, ">> $encoding", $filename)
|| die "$0: can't open $filename for appending: $!";
open($handle, "> $encoding", $filename)
|| die "$0: can't open $filename in write-open mode: $!";
Alternately, you can change to binary mode on an existing handle this way:
binmode($handle) || die "cannot binmode handle";
This is especially handy for the handles that Perl has already opened for you.
binmode(STDIN) || die "cannot binmode STDIN";
binmode(STDOUT) || die "cannot binmode STDOUT";
You can also pass C<binmode> an explicit encoding to change it on the fly.
This isn't exactly "binary" mode, but we still use C<binmode> to do it:
binmode(STDIN, ":encoding(MacRoman)") || die "cannot binmode STDIN";
binmode(STDOUT, ":encoding(UTF-8)") || die "cannot binmode STDOUT";
Once you have your binary file properly opened in the right mode, you can
use all the same Perl I/O functions as you used on text files. However,
you may wish to use the fixed-size C<read> instead of the variable-sized
C<readline> for your input.
Here's an example of how to copy a binary file:
my $BUFSIZ = 64 * (2 ** 10);
my $name_in = "/some/input/file";
my $name_out = "/some/output/flie";
my($in_fh, $out_fh, $buffer);
open($in_fh, "<", $name_in)
|| die "$0: cannot open $name_in for reading: $!";
open($out_fh, ">", $name_out)
|| die "$0: cannot open $name_out for writing: $!";
for my $fh ($in_fh, $out_fh) {
binmode($fh) || die "binmode failed";
}
while (read($in_fh, $buffer, $BUFSIZ)) {
unless (print $out_fh $buffer) {
die "couldn't write to $name_out: $!";
}
}
close($in_fh) || die "couldn't close $name_in: $!";
close($out_fh) || die "couldn't close $name_out: $!";
=head1 Opening Pipes
To be announced.
=head1 Low-level File Opens via sysopen
To be announced. Or deleted.
=head1 SEE ALSO
To be announced.
=head1 AUTHOR and COPYRIGHT
Copyright 2013 Tom Christiansen.
This documentation is free; you can redistribute it and/or modify it under
the same terms as Perl itself.
|