5.16.1

Other versions:
5.42.0
5.40.0
5.38.0
5.36.0
5.34.0
5.32.0
5.30.0
5.26.1
5.24.1
5.22.1
5.20.1
5.18.1
5.14.1
5.12.1
5.10.1
5.10.0

編集

変更履歴

誤訳の報告

原文を表示/隠す

名前¶

perlopentut - tutorial on opening things in Perl

perlopentut - Perl でいろんなものを開くためのチュートリアル

説明¶

Perl has two simple, built-in ways to open files: the shell way for convenience, and the C way for precision. The shell way also has 2- and 3-argument forms, which have different semantics for handling the filename. The choice is yours.

Perl には、ファイルを開くための 2 つの単純な組み込みの手段があります: 利便性のためのシェル風の方法と、正確性のための C 風の方法です。シェル風の方法には 2 引数と 3 引数があり、ファイル名の扱いに関して異なった動作をします。選択はあなた次第です。

シェル風に開く¶

Perl's open function was designed to mimic the way command-line redirection in the shell works. Here are some basic examples from the shell:

Perl の open 関数は、シェルでのコマンドラインのリダイレクトをまねて設計されています。以下はシェルでの基本的な例です:

    $ myprogram file1 file2 file3
    $ myprogram    <  inputfile
    $ myprogram    >  outputfile
    $ myprogram    >> outputfile
    $ myprogram    |  otherprogram 
    $ otherprogram |  myprogram

And here are some more advanced examples:

そして以下はもう少し高度な例です:

    $ otherprogram      | myprogram f1 - f2
    $ otherprogram 2>&1 | myprogram -
    $ myprogram     <&3
    $ myprogram     >&4

Programmers accustomed to constructs like those above can take comfort in learning that Perl directly supports these familiar constructs using virtually the same syntax as the shell.

上述のような方法に慣れているプログラマにとっては、Perl がシェルと事実上同じ文法を使った親しんでいる構造に直接対応していることは学ぶのが容易になります。

単純に開く¶

The open function takes two arguments: the first is a filehandle, and the second is a single string comprising both what to open and how to open it. open returns true when it works, and when it fails, returns a false value and sets the special variable $! to reflect the system error. If the filehandle was previously opened, it will be implicitly closed first.

open 関数は 2 つの引数を取ります: 1 つめはファイルハンドルで、 2 つめは何を開くかとどう開くかで構成される単一の文字列です。 open は成功すると真を返し、失敗すると偽を返して特殊変数 $! にシステムエラーを反映します。指定されたファイルハンドルが以前に開かれていた場合は、暗黙の内にまず閉じられます。

For example:

例えば:

    open(INFO,      "datafile") || die("can't open datafile: $!");
    open(INFO,   "<  datafile") || die("can't open datafile: $!");
    open(RESULTS,">  runstats") || die("can't open runstats: $!");
    open(LOG,    ">> logfile ") || die("can't open logfile:  $!");

If you prefer the low-punctuation version, you could write that this way:

句読点が少ない方が好みなら、以下のようにも書けます:

    open INFO,   "<  datafile"  or die "can't open datafile: $!";
    open RESULTS,">  runstats"  or die "can't open runstats: $!";
    open LOG,    ">> logfile "  or die "can't open logfile:  $!";

A few things to notice. First, the leading < is optional. If omitted, Perl assumes that you want to open the file for reading.

いくつか気がつくことがあります。まず、先頭の < は省略可能です。省略されると、Perl はファイルを読み込みのために開きたいと仮定します。

Note also that the first example uses the || logical operator, and the second uses or, which has lower precedence. Using || in the latter examples would effectively mean

最初の例は || 論理演算子を使っていて、二つめの例はより優先順位の低い or を使っていることにも注意してください。後者の例で || を使うと、実際には以下のような意味になり

    open INFO, ( "<  datafile"  || die "can't open datafile: $!" );

which is definitely not what you want.

あなたが望んでいるのと全く違うことになります。

The other important thing to notice is that, just as in the shell, any whitespace before or after the filename is ignored. This is good, because you wouldn't want these to do different things:

他の注意するべき重要なこととしては、シェルと同様、ファイル名の前後の空白は無視されることです。これはよいことです; なぜなら、以下のものが違うことをすることは望まないだろうからです:

    open INFO,   "<datafile"   
    open INFO,   "< datafile" 
    open INFO,   "<  datafile"

Ignoring surrounding whitespace also helps for when you read a filename in from a different file, and forget to trim it before opening:

周りの空白を無視することは、ファイル名を別のファイルから読み込んで、開く前に空白を取り除くのを忘れたときにも助けになります:

    $filename = <INFO>;         # oops, \n still there
    open(EXTRA, "< $filename") || die "can't open $filename: $!";

This is not a bug, but a feature. Because open mimics the shell in its style of using redirection arrows to specify how to open the file, it also does so with respect to extra whitespace around the filename itself as well. For accessing files with naughty names, see "Dispelling the Dweomer".

これはバグではありません、仕様です。 open はどのようにファイルを開くかを指定するのにリダイレクトの矢印を使うことでシェルを真似ているので、ファイル名の周りの空白についても同じように扱います。行儀の悪い名前のファイルにアクセスするためには、 "Dispelling the Dweomer" を参照してください。

There is also a 3-argument version of open, which lets you put the special redirection characters into their own argument:

また、3 引数版の open もあって、これは特殊なリダイレクト文字を独立した引数にしたものです:

    open( INFO, ">", $datafile ) || die "Can't create $datafile: $!";

In this case, the filename to open is the actual string in $datafile, so you don't have to worry about $datafile containing characters that might influence the open mode, or whitespace at the beginning of the filename that would be absorbed in the 2-argument version. Also, any reduction of unnecessary string interpolation is a good thing.

この場合、開くファイル名は $datafile の実際の文字列なので、 $datafile に開くモードに影響を与える文字や、 2 引数版では吸収されるファイル名の先頭の空白が含まれているかどうかを心配する必要はありません。また、不必要な文字列変換が削減されるのもよいことです。

間接ファイルハンドル¶

open's first argument can be a reference to a filehandle. As of perl 5.6.0, if the argument is uninitialized, Perl will automatically create a filehandle and put a reference to it in the first argument, like so:

open の最初の引数は、ファイルハンドルへのリファレンスにすることも出来ます。 perl 5.6.0 以降、引数が初期化されていない場合、Perl は以下のように、自動的にファイルハンドルを作成して、それへのリファレンスを最初の引数に設定します:

    open( my $in, $infile )   or die "Couldn't read $infile: $!";
    while ( <$in> ) {
        # do something with $_
    }
    close $in;

Indirect filehandles make namespace management easier. Since filehandles are global to the current package, two subroutines trying to open INFILE will clash. With two functions opening indirect filehandles like my $infile, there's no clash and no need to worry about future conflicts.

間接ファイルハンドルは、名前空間管理をより容易にします。ファイルハンドルは現在のパッケージに対してグローバルなので、二つのサブルーチンが INFILE を開こうとすると衝突します。二つの関数が my $infil のように間接ファイルハンドルで開いていると、衝突は発生せず、将来の衝突を気にする必要もありません。

Another convenient behavior is that an indirect filehandle automatically closes when there are no more references to it:

もう一つの便利は振る舞いとして、間接ファイルハンドルは、それに対する参照がなくなったとき、自動的に閉じます:

    sub firstline {
        open( my $in, shift ) && return scalar <$in>;
        # no close() required
    }

Indirect filehandles also make it easy to pass filehandles to and return filehandles from subroutines:

間接ファイルハンドルは、サブルーチンとのファイルハンドルの受け渡しも容易にします:

    for my $file ( qw(this.conf that.conf) ) {
        my $fin = open_or_throw('<', $file);
        process_conf( $fin );
        # no close() needed
    }

    use Carp;
    sub open_or_throw {
        my ($mode, $filename) = @_;
        open my $h, $mode, $filename
            or croak "Could not open '$filename': $!";
        return $h;
    }

パイプを開く¶

In C, when you want to open a file using the standard I/O library, you use the fopen function, but when opening a pipe, you use the popen function. But in the shell, you just use a different redirection character. That's also the case for Perl. The open call remains the same--just its argument differs.

C では、標準 I/O ライブラリを使ってファイルを開きたいときは fopen を使いますが、パイプを開くときには popen 関数を使います。しかし、シェルでは、単に違うリダイレクト文字を使います。これは Perl の場合にも当てはまります。 open 呼び出しは同じままです -- 単にその引数が変わります。

If the leading character is a pipe symbol, open starts up a new command and opens a write-only filehandle leading into that command. This lets you write into that handle and have what you write show up on that command's standard input. For example:

先頭の文字がパイプ記号の場合、open は新しいコマンドを準備して、そのコマンドへと導かれる書き込み専用のファイルハンドルを開きます。これによって、あなたがこのハンドルに書き込んだものがコマンドの標準入力に渡されるようになります。例えば:

    open(PRINTER, "| lpr -Plp1")    || die "can't run lpr: $!";
    print PRINTER "stuff\n";
    close(PRINTER)                  || die "can't close lpr: $!";

If the trailing character is a pipe, you start up a new command and open a read-only filehandle leading out of that command. This lets whatever that command writes to its standard output show up on your handle for reading. For example:

末尾の文字がパイプの場合、新しいコマンドを準備して、そのコマンドから導かれる読み込み専用のファイルハンドルを開きます。これにより、そのコマンドが標準出力にしたものはなんでも読み込み用のファイルハンドルに現れます。例えば:

    open(NET, "netstat -i -n |")    || die "can't fork netstat: $!";
    while (<NET>) { }               # do something with input
    close(NET)                      || die "can't close netstat: $!";

What happens if you try to open a pipe to or from a non-existent command? If possible, Perl will detect the failure and set $! as usual. But if the command contains special shell characters, such as > or *, called 'metacharacters', Perl does not execute the command directly. Instead, Perl runs the shell, which then tries to run the command. This means that it's the shell that gets the error indication. In such a case, the open call will only indicate failure if Perl can't even run the shell. See "How can I capture STDERR from an external command?" in perlfaq8 to see how to cope with this. There's also an explanation in perlipc.

存在しないコマンドに対してパイプを開こうとすると何が起こるでしょうか? 可能なら、Perl は失敗を検出していつも通り $! をセットします。しかし、もしコマンドに「メタ文字」と呼ばれる > や * のような特殊シェル文字が含まれていると、Perl はコマンドを直接実行しません。その代わりに、Perl はシェルを実行し、それからコマンドを実行しようとします。これは、エラーを受け取るのはシェルであることを意味します。このような場合、open 呼び出しは、たとえ Perl がシェルを実行できなかった場合でも、失敗を示すだけです。これを扱う方法については、 "How can I capture STDERR from an external command?" in perlfaq8 を参照してください。 perlipc にも説明があります。

If you would like to open a bidirectional pipe, the IPC::Open2 library will handle this for you. Check out "Bidirectional Communication with Another Process" in perlipc

双方向パイプを開きたい場合は、IPC::Open2 ライブラリが使えます。 "Bidirectional Communication with Another Process" in perlipc を参照してください。

perl-5.6.x introduced a version of piped open that executes a process based on its command line arguments without relying on the shell. (Similar to the system(@LIST) notation.) This is safer and faster than executing a single argument pipe-command, but does not allow special shell constructs. (It is also not supported on Microsoft Windows, Mac OS Classic or RISC OS.)

perl-5.6.x から、シェルに頼らずにコマンドライン引数を基にしてプロセスを実行するパイプオープンが導入されました。 (system(@LIST) 記法と同様です。) これは 1 引数のパイプコマンドを実行するより安全で高速ですが、特殊シェル構文は使えません。 (また、Microsoft Windows, Mac OS Classic, RISC OS でも対応していません。)

Here's an example of open '-|', which prints a random Unix fortune cookie as uppercase:

以下は open '-|' の例で、ランダムな Unix おみくじを大文字で表示します:

    my $collection = shift(@ARGV);
    open my $fortune, '-|', 'fortune', $collection
        or die "Could not find fortune - $!";
    while (<$fortune>)
    {
        print uc($_);
    }
    close($fortune);

And this open '|-' pipes into lpr:

そしてこれは open '|-' パイプを lpr に送ります:

    open my $printer, '|-', 'lpr', '-Plp1'
        or die "can't run lpr: $!";
    print {$printer} "stuff\n";
    close($printer)
        or die "can't close lpr: $!";

"-" ファイル¶

Again following the lead of the standard shell utilities, Perl's open function treats a file whose name is a single minus, "-", in a special way. If you open minus for reading, it really means to access the standard input. If you open minus for writing, it really means to access the standard output.

再び標準シェルの機能に合わせるように、Perl の open 関数は、名前がマイナス一つ "-" だけのファイルを特別に扱います。読み込み用にマイナスを開くと、実際には標準入力にアクセスします。書き込み用にマイナスを開くと、実際には標準出力にアクセスします。

If minus can be used as the default input or default output, what happens if you open a pipe into or out of minus? What's the default command it would run? The same script as you're currently running! This is actually a stealth fork hidden inside an open call. See "Safe Pipe Opens" in perlipc for details.

マイナスがデフォルトの入力やデフォルトの出力として使えるとすると、パイプに対してマイナスを使うとどうなるでしょう? デフォルトのコマンドとして何が実行されるのでしょう? 今実行している同じスクリプトです! これは実際には open 呼び出し内で隠れた fork が行われます。詳しくは "Safe Pipe Opens" in perlipc を参照してください。

読み書きを混ぜる¶

It is possible to specify both read and write access. All you do is add a "+" symbol in front of the redirection. But as in the shell, using a less-than on a file never creates a new file; it only opens an existing one. On the other hand, using a greater-than always clobbers (truncates to zero length) an existing file, or creates a brand-new one if there isn't an old one. Adding a "+" for read-write doesn't affect whether it only works on existing files or always clobbers existing ones.

読み書きアクセス双方を指定することは可能です。必要なことはリダイレクトの前に "+" の文字を加えるだけです。しかしシェルの場合と同様、ファイルに小なり記号を使っても新しいファイルが作成されることはありません; すでにあるファイルを開くだけです。一方、大なり記号を使うと、ファイルがある場合には常に上書き (長さ 0 に切り詰め)られ、ファイルがない場合は新しいファイルが作成されます。読み書き用に "+" を追加しても、既にあるファイルにだけ動作するか既にあるファイルを上書きするかということには影響を与えません。

    open(WTMP, "+< /usr/adm/wtmp") 
        || die "can't open /usr/adm/wtmp: $!";

    open(SCREEN, "+> lkscreen")
        || die "can't open lkscreen: $!";

    open(LOGFILE, "+>> /var/log/applog")
        || die "can't open /var/log/applog: $!";

The first one won't create a new file, and the second one will always clobber an old one. The third one will create a new file if necessary and not clobber an old one, and it will allow you to read at any point in the file, but all writes will always go to the end. In short, the first case is substantially more common than the second and third cases, which are almost always wrong. (If you know C, the plus in Perl's open is historically derived from the one in C's fopen(3S), which it ultimately calls.)

一つ目のものは新しいファイルを作ることはなく、二つ目のものは常に古いファイルを上書きします。三つ目のものは必要があれば新しいファイルを作りますが、古いファイルを上書きせず、ファイルのどの地点でも読み込むことができますが、書き込みは常に末尾に行われます。要するに、一つ目のものは(ほとんど常に間違っている)二つ目や三つ目のものよりもかなり一般的です。 (もし C を知っているなら、Perl の open で使われるプラス記号が歴史的には (最終的に呼ばれることになる) C の fopen(3S) に由来しています。)

In fact, when it comes to updating a file, unless you're working on a binary file as in the WTMP case above, you probably don't want to use this approach for updating. Instead, Perl's -i flag comes to the rescue. The following command takes all the C, C++, or yacc source or header files and changes all their foo's to bar's, leaving the old version in the original filename with a ".orig" tacked on the end:

実際、ファイルを更新するとき、上述の WTMP の場合のようなバイナリファイルに対して作業をするのでない限り、おそらく更新のためにこの手法を使いたくないでしょう。代わりに、Perl の -i フラグが助けになります。以下のコマンドは C, C++, yacc 全てののソースファイルとヘッダファイルを取って、その中の全ての foo を bar に変更し、原版は元のファイル名の末尾に ".orig" を付けたファイルに保持します:

    $ perl -i.orig -pe 's/\bfoo\b/bar/g' *.[Cchy]

This is a short cut for some renaming games that are really the best way to update textfiles. See the second question in perlfaq5 for more details.

これは実際にはテキストファイルを更新するための最良の方法であるリネーム手法へのショートカットです。さらなる詳細については perlfaq5 の 2 番目の質問を参照してください。

フィルタ¶

One of the most common uses for open is one you never even notice. When you process the ARGV filehandle using <ARGV>, Perl actually does an implicit open on each file in @ARGV. Thus a program called like this:

open のもっとも一般的な使い方の一つは、使っていることを気づきすらしないものです。 ARGV ファイルハンドルを <ARGV> を使って処理するとき、Perl は実際は @ARGV の各ファイルを暗黙の内に開いています。従って、以下のようなプログラムは:

    $ myprogram file1 file2 file3

can have all its files opened and processed one at a time using a construct no more complex than:

以下のようなものより複雑な構文を使わなくても、それぞれのファイルを開いて一度に処理できます:

    while (<>) {
        # do something with $_
    }

If @ARGV is empty when the loop first begins, Perl pretends you've opened up minus, that is, the standard input. In fact, $ARGV, the currently open file during <ARGV> processing, is even set to "-" in these circumstances.

ループが最初に開始したときに @ARGV が空なら、Perl はマイナス記号 (つまり標準入力) を開いたかのように振る舞います。実際、<ARGV> で現在開いているファイルを示す $ARGV には、この慣習によって "-" がセットされます。

You are welcome to pre-process your @ARGV before starting the loop to make sure it's to your liking. One reason to do this might be to remove command options beginning with a minus. While you can always roll the simple ones by hand, the Getopts modules are good for this:

好みの形にするために、ループの開始前に @ARGV を前処理しても問題ありません。こうするための理由の一つは、マイナスから始まるコマンドオプションを削除するためです。いつでも自分で単純なものを作ることができる一方、 Getopts モジュールはこれを行うのによいものです:

    use Getopt::Std;

    # -v, -D, -o ARG, sets $opt_v, $opt_D, $opt_o
    getopts("vDo:");            

    # -v, -D, -o ARG, sets $args{v}, $args{D}, $args{o}
    getopts("vDo:", \%args);

Or the standard Getopt::Long module to permit named arguments:

あるいは、名前付きの引数を使えるようにするための標準の Getopt::Long モジュールもあります:

    use Getopt::Long;
    GetOptions( "verbose"  => \$verbose,        # --verbose
                "Debug"    => \$debug,          # --Debug
                "output=s" => \$output );       
            # --output=somestring or --output somestring

Another reason for preprocessing arguments is to make an empty argument list default to all files:

引数を前処理するためのもう一つの理由は、空引数リストの時はデフォルトで全てのファイルとする場合です:

    @ARGV = glob("*") unless @ARGV;

You could even filter out all but plain, text files. This is a bit silent, of course, and you might prefer to mention them on the way.

プレーンなテキストファイル以外をフィルタリングすることもできます。これはもちろん少し静かなので、途中でそれに言及したいかもしれません。

    @ARGV = grep { -f && -T } @ARGV;

If you're using the -n or -p command-line options, you should put changes to @ARGV in a BEGIN{} block.

もし -n や -p のコマンドラインオプションを使っているなら、 @ARGV への変更は BEGIN{} ブロックで行うべきです。

Remember that a normal open has special properties, in that it might call fopen(3S) or it might called popen(3S), depending on what its argument looks like; that's why it's sometimes called "magic open". Here's an example:

通常の open は特別な特性を持っていて、引数が何に見えるかによって、 fopen(3S) を呼ぶかもしれませんし、popen(3S) を呼ぶかもしれません; これが時々「マジカルに開く」と呼ばれる理由です。以下は例です:

    $pwdinfo = `domainname` =~ /^(\(none\))?$/
                    ? '< /etc/passwd'
                    : 'ypcat passwd |';

    open(PWD, $pwdinfo)                 
                or die "can't open $pwdinfo: $!";

This sort of thing also comes into play in filter processing. Because <ARGV> processing employs the normal, shell-style Perl open, it respects all the special things we've already seen:

このようなことはフィルタ処理でも起こります。 <ARGV> 処理は通常のシェル風の Perl open を用いるので、今までに見てきた全ての特別なことが反映されます:

    $ myprogram f1 "cmd1|" - f2 "cmd2|" f3 < tmpfile

That program will read from the file f1, the process cmd1, standard input (tmpfile in this case), the f2 file, the cmd2 command, and finally the f3 file.

このプログラムはファイル f1、プロセス cmd1、標準入力 (この場合は tmpfile)、ファイル f2、コマンド cmd2、ファイル f3 から読み込みます。

Yes, this also means that if you have files named "-" (and so on) in your directory, they won't be processed as literal files by open. You'll need to pass them as "./-", much as you would for the rm program, or you could use sysopen as described below.

はい、これは、"-" (あるいは同じような) 名前を持つファイルがある場合、 open によってそのまま処理することができないことも意味します。 rm プログラムに対して行うのと同様に "./-" という形で渡すか、後述する sysopen を使う必要があります。

One of the more interesting applications is to change files of a certain name into pipes. For example, to autoprocess gzipped or compressed files by decompressing them with gzip:

もっと興味深いアプリケーションの一つは、ある名前を持ったファイルをパイプに変更するものです。例えば、gzip や compress されたファイルを、gzip を使って自動的に展開するには:

    @ARGV = map { /\.(gz|Z)$/ ? "gzip -dc $_ |" : $_  } @ARGV;

Or, if you have the GET program installed from LWP, you can fetch URLs before processing them:

あるいは、LWP からインストールされる GET プログラムがあるなら、処理する前に URL をフェッチできます:

    @ARGV = map { m#^\w+://# ? "GET $_ |" : $_ } @ARGV;

It's not for nothing that this is called magic <ARGV>. Pretty nifty, eh?

これがマジカルな <ARGV> と呼ばれるのは理由のないことではありません。かなりしゃれてるでしょ?

C 風に開く¶

If you want the convenience of the shell, then Perl's open is definitely the way to go. On the other hand, if you want finer precision than C's simplistic fopen(3S) provides you should look to Perl's sysopen, which is a direct hook into the open(2) system call. That does mean it's a bit more involved, but that's the price of precision.

シェルの便利さを求めているなら、Perl の open はまさにぴったりです。一方、C の単純な fopen(3S) が提供しているものより高い精度を求めているなら、 open(2) システムコールへの直接的なフックである、Perl の sysopen を見るべきです。これはもう少し深く関わることを意味しますが、これは精度のコストです。

sysopen takes 3 (or 4) arguments.

sysopen は 3 (または 4) 引数を取ります。

    sysopen HANDLE, PATH, FLAGS, [MASK]

The HANDLE argument is a filehandle just as with open. The PATH is a literal path, one that doesn't pay attention to any greater-thans or less-thans or pipes or minuses, nor ignore whitespace. If it's there, it's part of the path. The FLAGS argument contains one or more values derived from the Fcntl module that have been or'd together using the bitwise "|" operator. The final argument, the MASK, is optional; if present, it is combined with the user's current umask for the creation mode of the file. You should usually omit this.

HANDLE 引数は open と同様のファイルハンドルです。 PATH はリテラルなパスで、大なりや小なりやパイプやマイナスや空白の無視といったことに一切注意を払いません。もしこれらの文字があれば、それはパスの一部です。 FLAGS 引数は、ビット単位 "|" 演算子で結合できる、Fcntl モジュールに由来する一つ以上の値を指定します。最後の引数である MASK はオプションです; もしあれば、これはファイルの作成モードのためのユーザーの現在の umask と組み合わされます。普通はこれは省略するべきです。

Although the traditional values of read-only, write-only, and read-write are 0, 1, and 2 respectively, this is known not to hold true on some systems. Instead, it's best to load in the appropriate constants first from the Fcntl module, which supplies the following standard flags:

読み込み専用、書き込み専用、読み書きを示す伝統的な値はそれぞれ 0, 1, 2 ですが、これが正しくないシステムもあることが知られています。代わりに、以下の標準フラグを提供している Fcntl モジュールから最初に適切な定数を読み込むのが最善です:

    O_RDONLY            Read only
    O_WRONLY            Write only
    O_RDWR              Read and write
    O_CREAT             Create the file if it doesn't exist
    O_EXCL              Fail if the file already exists
    O_APPEND            Append to the file
    O_TRUNC             Truncate the file
    O_NONBLOCK          Non-blocking access

Less common flags that are sometimes available on some operating systems include O_BINARY, O_TEXT, O_SHLOCK, O_EXLOCK, O_DEFER, O_SYNC, O_ASYNC, O_DSYNC, O_RSYNC, O_NOCTTY, O_NDELAY and O_LARGEFILE. Consult your open(2) manpage or its local equivalent for details. (Note: starting from Perl release 5.6 the O_LARGEFILE flag, if available, is automatically added to the sysopen() flags because large files are the default.)

オペレーティングシステムによっては、 O_BINARY, O_TEXT, O_SHLOCK, O_EXLOCK, O_DEFER, O_SYNC, O_ASYNC, O_DSYNC, O_RSYNC, O_NOCTTY, O_NDELAY, O_LARGEFILE のような、それほど有名ではないフラグも利用可能です。詳しくは open(2) man ページその等価物を参照してください。 (注意: Perl リリース 5.6 から、もし利用可能なら、sysopen() のフラグに自動的に O_LARGEFILE フラグが付きます; 大きなファイルがデフォルトになったからです。)

Here's how to use sysopen to emulate the simple open calls we had before. We'll omit the || die $! checks for clarity, but make sure you always check the return values in real code. These aren't quite the same, since open will trim leading and trailing whitespace, but you'll get the idea.

これは、前述した単純な open をエミュレートするために sysopen を使う方法です。明確化のために || die $! のチェックは省略しましたが、実際のコードでは常に返り値をチェックするようにしてください。 open は前後の空白を削除するのでこれは全く同じというわけではありませんが、想像はできるでしょう。

To open a file for reading:

ファイルを読み込み用に開くには:

    open(FH, "< $path");
    sysopen(FH, $path, O_RDONLY);

To open a file for writing, creating a new file if needed or else truncating an old file:

ファイルを書き込み用に開いて、必要なら新しいファイルを作り、そうでなければ古いファイルを切り詰めるには:

    open(FH, "> $path");
    sysopen(FH, $path, O_WRONLY | O_TRUNC | O_CREAT);

To open a file for appending, creating one if necessary:

ファイルを追加用に開いて、もし必要なら新しいファイルを作るには:

    open(FH, ">> $path");
    sysopen(FH, $path, O_WRONLY | O_APPEND | O_CREAT);

To open a file for update, where the file must already exist:

既に存在しているファイルを更新用に開くには:

    open(FH, "+< $path");
    sysopen(FH, $path, O_RDWR);

And here are things you can do with sysopen that you cannot do with a regular open. As you'll see, it's just a matter of controlling the flags in the third argument.

そしてここでは普通の open では出来ないことを sysopen でしています。見てきたように、これは単に 3 番目の引数のフラグの制御の問題です。

To open a file for writing, creating a new file which must not previously exist:

既に存在していたりはしない新しいファイルを作成して、ファイルを書き込み用に開くには:

    sysopen(FH, $path, O_WRONLY | O_EXCL | O_CREAT);

To open a file for appending, where that file must already exist:

既に存在している必要があるファイルを追加用に開くには:

    sysopen(FH, $path, O_WRONLY | O_APPEND);

To open a file for update, creating a new file if necessary:

必要なら新しいファイルを作成して、ファイルを更新用に開くには:

    sysopen(FH, $path, O_RDWR | O_CREAT);

To open a file for update, where that file must not already exist:

予め存在していてはならないファイルを交信用に開くには:

    sysopen(FH, $path, O_RDWR | O_EXCL | O_CREAT);

To open a file without blocking, creating one if necessary:

必要ならファイルを作成して、ファイルをブロックせずに開くには:

    sysopen(FH, $path, O_WRONLY | O_NONBLOCK | O_CREAT);

権限モード¶

If you omit the MASK argument to sysopen, Perl uses the octal value 0666. The normal MASK to use for executables and directories should be 0777, and for anything else, 0666.

sysopen の MASK 引数を省略すると、Perl は 8 進数の 0666 を使います。実行ファイルとディレクトリに対する通常の MASK は 0777で、それ以外のファイルでは 0666 です。

Why so permissive? Well, it isn't really. The MASK will be modified by your process's current umask. A umask is a number representing disabled permissions bits; that is, bits that will not be turned on in the created file's permissions field.

なぜそんなに権限を与えるのでしょう? えっと、実際にはそうではありません。 MASK はプロセスの現在の umask で修正されます。 umask は 無効にする 許可ビットを表現する数値です; つまり、作成したファイルの許可フィールドを有効にすることはないということです。

For example, if your umask were 027, then the 020 part would disable the group from writing, and the 007 part would disable others from reading, writing, or executing. Under these conditions, passing sysopen 0666 would create a file with mode 0640, since 0666 & ~027 is 0640.

例えば、umask が 027 の場合、020 の部分はグループによる書き込みと実行を無効にし、007 の部分は他のユーザーによる読み込み、書き込み、実行を無効にします。この条件では、sysopen に 0666 を渡すとモード 0640 でファイルを作ります; 0666 & ~027 は 0640 だからです。

You should seldom use the MASK argument to sysopen(). That takes away the user's freedom to choose what permission new files will have. Denying choice is almost always a bad thing. One exception would be for cases where sensitive or private data is being stored, such as with mail folders, cookie files, and internal temporary files.

sysopen() に MASK 引数を使うことはほとんどないでしょう。これは、新しいファイルにどのパーミッションを与えるかというユーザーの自由を奪います。選択を拒むということは、ほとんど常に悪いことです。一つの例外は、メールフォルダ、クッキーファイル、内部用一時ファイルのような、微妙な、あるいはプライベートなデータを保管する場合でしょう。

わかりにくい開くときの小技¶

ファイルを再び開く(dup)¶

Sometimes you already have a filehandle open, and want to make another handle that's a duplicate of the first one. In the shell, we place an ampersand in front of a file descriptor number when doing redirections. For example, 2>&1 makes descriptor 2 (that's STDERR in Perl) be redirected into descriptor 1 (which is usually Perl's STDOUT). The same is essentially true in Perl: a filename that begins with an ampersand is treated instead as a file descriptor if a number, or as a filehandle if a string.

既に開いているファイルハンドルを持っている時に、これを複製してもう一つのハンドルがほしくなる場合がときどきあります。シェルでは、リダイレクトをするときにファイル記述子番号の前にアンパサンドを置きます。例えば 2>&1 は、記述子 2 (これは Perl では STDERR) を記述子 1 (これは Perl では普通は STDOUT) にリダイレクトします。同じことは Perl でも基本的には真です: アンパサンドで始まるファイル名は、それが数値ならファイル記述子、文字列ならファイルハンドルとして扱われます。

    open(SAVEOUT, ">&SAVEERR") || die "couldn't dup SAVEERR: $!";
    open(MHCONTEXT, "<&4")     || die "couldn't dup fd4: $!";

That means that if a function is expecting a filename, but you don't want to give it a filename because you already have the file open, you can just pass the filehandle with a leading ampersand. It's best to use a fully qualified handle though, just in case the function happens to be in a different package:

これは、もし関数がファイル名を想定しているけれども、既にファイルは開いているのでファイル名を渡したくない場合、単に先頭にアンパサンドを付けたファイルハンドルを渡せるということを意味します。しかし、万が一関数がたまたま違うパッケージだったときのために、完全修飾したハンドルを渡すのが最善です:

    somefunction("&main::LOGFILE");

This way if somefunction() is planning on opening its argument, it can just use the already opened handle. This differs from passing a handle, because with a handle, you don't open the file. Here you have something you can pass to open.

この方法により、somefunction() が引数の値を開いた場合、単に既に開いているハンドルを使えます。これはハンドルを渡すのとは違います; なぜならハンドルではファイルを開かないからです。こちらでは開くときに指定できるものが指定できます。

If you have one of those tricky, newfangled I/O objects that the C++ folks are raving about, then this doesn't work because those aren't a proper filehandle in the native Perl sense. You'll have to use fileno() to pull out the proper descriptor number, assuming you can:

もし、C++ 民が夢中になっているような巧妙で目新しい I/O オブジェクトの一つを使っているなら、これらはネイティブな Perl 的に適切なファイルハンドルではないので、上述のような方法は動作しません。適切な記述子番号を得るために fileno() を使う必要があります; それが出来ると仮定すれば:

    use IO::Socket;
    $handle = IO::Socket::INET->new("www.perl.com:80");
    $fd = $handle->fileno;
    somefunction("&$fd");  # not an indirect function call

It can be easier (and certainly will be faster) just to use real filehandles though:

しかし、単に普通のファイルハンドルを使う方が簡単でしょう (そして確実に高速です):

    use IO::Socket;
    local *REMOTE = IO::Socket::INET->new("www.perl.com:80");
    die "can't connect" unless defined(fileno(REMOTE));
    somefunction("&main::REMOTE");

If the filehandle or descriptor number is preceded not just with a simple "&" but rather with a "&=" combination, then Perl will not create a completely new descriptor opened to the same place using the dup(2) system call. Instead, it will just make something of an alias to the existing one using the fdopen(3S) library call. This is slightly more parsimonious of systems resources, although this is less a concern these days. Here's an example of that:

もしファイルハンドルや記述子番号の前にあるのが単なる "&" ではなく "&=" の組み合わせの場合、Perl は dup(2) システムコールを使って同じ場所で開いた完全に新しい記述子は作りません。代わりに、fdopen(3S) ライブラリコールを使ってすでにある記述子の別名的なものを作ります。これはシステムのリソースを少しケチることが出来ますが、最近ではこれはあまり関心を持たれなくなりました。以下はこの例です:

    $fd = $ENV{"MHCONTEXTFD"};
    open(MHCONTEXT, "<&=$fd")   or die "couldn't fdopen $fd: $!";

If you're using magic <ARGV>, you could even pass in as a command line argument in @ARGV something like "<&=$MHCONTEXTFD", but we've never seen anyone actually do this.

もしマジカルな <ARGV> を使っているなら、"<&=$MHCONTEXTFD" のような感じで @ARGV 内のコマンドライン引数として渡すことすら可能ですが、実際にこれをしている人を見たことはありません。

魔法を解く¶

Perl is more of a DWIMmer language than something like Java--where DWIM is an acronym for "do what I mean". But this principle sometimes leads to more hidden magic than one knows what to do with. In this way, Perl is also filled with dweomer, an obscure word meaning an enchantment. Sometimes, Perl's DWIMmer is just too much like dweomer for comfort.

Perl は、Java のような言語よりも「空気を読む」(DWIM)言語です -- DWIM とは "do what I mean" の略です。しかし、この原則は時々利用者が知っている以上の隠れた動作をすることがあります。こんな風に、Perl は (魔法を意味する不明確な単語である) dweomer にも満ちています。時々、Perl の空気の読み方は快適さのために魔法のようになります。

If magic open is a bit too magical for you, you don't have to turn to sysopen. To open a file with arbitrary weird characters in it, it's necessary to protect any leading and trailing whitespace. Leading whitespace is protected by inserting a "./" in front of a filename that starts with whitespace. Trailing whitespace is protected by appending an ASCII NUL byte ("\0") at the end of the string.

もしマジカルな open があなたにとってちょっとマジカルすぎるとしても、 sysopen にまで戻る必要はありません。ファイル名にどんな変な文字が含まれているファイルでも開くためには、先頭と末尾の空白を保護する必要があります。先頭の空白は、空白で始まるファイル名の前に "./" を挿入することで保護します。末尾の空白は、文字列の末尾に ASCII NUL バイト ("\0") を追加することで保護します。

    $file =~ s#^(\s)#./$1#;
    open(FH, "< $file\0")   || die "can't open $file: $!";

This assumes, of course, that your system considers dot the current working directory, slash the directory separator, and disallows ASCII NULs within a valid filename. Most systems follow these conventions, including all POSIX systems as well as proprietary Microsoft systems. The only vaguely popular system that doesn't work this way is the "Classic" Macintosh system, which uses a colon where the rest of us use a slash. Maybe sysopen isn't such a bad idea after all.

これはもちろん、あなたのシステムが "." をカレントディレクトリ、 "/" をディレクトリの区切りとして扱い、ASCII NUL をファイル名として認めていないということを仮定しています。全ての POSIX システムとプロプリエタリの Microsoft システムを含む、ほとんどのシステムはこの慣例に従っています。これに従わない、一般的に有名な唯一のシステムは "Classic" Macintosh システムです; これは他のシステムが "/" を使っているところで ":" を使います。おそらく、とにかく sysopen を使うということはそれほど悪い考えではありません。

If you want to use <ARGV> processing in a totally boring and non-magical way, you could do this first:

もし、<ARGV> の処理を、本当に退屈かつマジカルでない方法で行いたいなら、まず以下のようにできます:

    #   "Sam sat on the ground and put his head in his hands.  
    #   'I wish I had never come here, and I don't want to see 
    #   no more magic,' he said, and fell silent."
    for (@ARGV) { 
        s#^([^./])#./$1#;
        $_ .= "\0";
    } 
    while (<>) {  
        # now process $_
    }

But be warned that users will not appreciate being unable to use "-" to mean standard input, per the standard convention.

但し、ユーザーは、標準入力を意味するために "-" を使うという一般的な慣習が使えないということを喜ばないだろうということは警告しておきます。

open にパスを¶

You've probably noticed how Perl's warn and die functions can produce messages like:

どうやって Perl の warn 関数と die 関数が以下のようなメッセージを生成するかに気付いたでしょう:

    Some warning at scriptname line 29, <FH> line 7.

That's because you opened a filehandle FH, and had read in seven records from it. But what was the name of the file, rather than the handle?

これは、あなたがファイルハンドル FH を開いて、そこから 7 レコードを読み込んだからです。しかし、ハンドルではなく、ファイル名はどうでしょう?

If you aren't running with strict refs, or if you've turned them off temporarily, then all you have to do is this:

もし strict refs を有効にしていないか、一時的に無効にしているなら、する必要があるのは以下のことだけです:

    open($path, "< $path") || die "can't open $path: $!";
    while (<$path>) {
        # whatever
    }

Since you're using the pathname of the file as its handle, you'll get warnings more like

ファイルのパス名をハンドルとして使っているので、以下のような警告が出ます

    Some warning at scriptname line 29, </etc/motd> line 7.

1 引数の open¶

Remember how we said that Perl's open took two arguments? That was a passive prevarication. You see, it can also take just one argument. If and only if the variable is a global variable, not a lexical, you can pass open just one argument, the filehandle, and it will get the path from the global scalar variable of the same name.

Perl の open は 2 引数を取ると言ったことを覚えていますか? これは消極的なごまかしです。ほら、単に 1 引数を取ることもできます。変数がレキシカルではなくグローバルな変数の場合にのみ、open に 1 引数だけ(ファイルハンドル)を渡すことができます; こうすると、同じ名前を持つグローバルなスカラ変数からパスを取ります。

    $FILE = "/etc/motd";
    open FILE or die "can't open $FILE: $!";
    while (<FILE>) {
        # whatever
    }

Why is this here? Someone has to cater to the hysterical porpoises. It's something that's been in Perl since the very beginning, if not before.

どうしてこれはここなんでしょう? 誰かがヒステリックなネズミイルカの要求を満たす必要があります。これは(遅くとも)非常に初期から Perl にあります。

STDIN と STDOUT を扱う¶

One clever move with STDOUT is to explicitly close it when you're done with the program.

STDOUT に関する一つの利口な行動は、プログラムの終了時に明示的に閉じることです。

    END { close(STDOUT) || die "can't close stdout: $!" }

If you don't do this, and your program fills up the disk partition due to a command line redirection, it won't report the error exit with a failure status.

これをしないままで、このプログラムがコマンドラインリダイレクトによってディスクをいっぱいにしてしまっても、失敗状態でエラー終了しません。

You don't have to accept the STDIN and STDOUT you were given. You are welcome to reopen them if you'd like.

与えられた STDIN と STDOUT を受け入れる必要はありません。もし望むなら、これらを開き直せます。

    open(STDIN, "< datafile")
        || die "can't open datafile: $!";

    open(STDOUT, "> output")
        || die "can't open output: $!";

And then these can be accessed directly or passed on to subprocesses. This makes it look as though the program were initially invoked with those redirections from the command line.

それからこれらは直接アクセスしたり子プロセスに渡したりできます。これらは、プログラムの起動時にコマンドラインからリダイレクトが与えられたかのように動作します。

It's probably more interesting to connect these to pipes. For example:

これらをパイプにつなぐ方がより興味深いでしょう。例えば:

    $pager = $ENV{PAGER} || "(less || more)";
    open(STDOUT, "| $pager")
        || die "can't fork a pager: $!";

This makes it appear as though your program were called with its stdout already piped into your pager. You can also use this kind of thing in conjunction with an implicit fork to yourself. You might do this if you would rather handle the post processing in your own program, just in a different process:

これによって、プログラムの標準出力がが既にページャとパイプでつながれているかのように見えます。このようなことはまた、自分自身を暗黙に fork したものと結合するためにも使えます。自分自身のプログラムの別のプロセスでで後処理を扱いたい場合、以下のようにできます:

    head(100);
    while (<>) {
        print;
    } 

    sub head {
        my $lines = shift || 20;
        return if $pid = open(STDOUT, "|-");       # return if parent
        die "cannot fork: $!" unless defined $pid;
        while (<STDIN>) {
            last if --$lines < 0;
            print;
        } 
        exit;
    }

This technique can be applied to repeatedly push as many filters on your output stream as you wish.

このテクニックは、繰り返しプッシュすることで、出力ストリームに好きなだけ多くのフィルタを適用できます。

その他の I/O 関連の話題¶

These topics aren't really arguments related to open or sysopen, but they do affect what you do with your open files.

これらの話題は実際には open や sysopen に関連したものではありませんが、ファイルを開くときに行うことに影響を与えます。

ファイルでないファイルを開く¶

When is a file not a file? Well, you could say when it exists but isn't a plain file. We'll check whether it's a symbolic link first, just in case.

ファイルがファイルでないときは? えっと、プレーンファイルでないもののとき、と言いたいんですよね。まず、念のために、それがシンボリックリンクかどうかを調べます。

    if (-l $file || ! -f _) {
        print "$file is not a plain file\n";
    }

What other kinds of files are there than, well, files? Directories, symbolic links, named pipes, Unix-domain sockets, and block and character devices. Those are all files, too--just not plain files. This isn't the same issue as being a text file. Not all text files are plain files. Not all plain files are text files. That's why there are separate -f and -T file tests.

えーと、ファイルの他にどんな種類のファイルがあるのでしょう? ディレクトリ、シンボリックリンク、名前付きパイプ、Unix ドメインソケット、キャラクタデバイス、ブロックデバイスです。これらも全てファイルです -- 単に プレーン ファイルではないというだけです。これはテキストファイルと同じ問題ではありません。全てのテキストファイルがプレーンファイルではありません。全てのプレーンファイルがテキストファイルではありません。これが、-f と -T のファイルテストが分離している理由です。

To open a directory, you should use the opendir function, then process it with readdir, carefully restoring the directory name if necessary:

ディレクトリを開くには、opendir 関数を使って、それから readdir で処理します; もし必要なら注意深くディレクトリ名を復元します:

    opendir(DIR, $dirname) or die "can't opendir $dirname: $!";
    while (defined($file = readdir(DIR))) {
        # do something with "$dirname/$file"
    }
    closedir(DIR);

If you want to process directories recursively, it's better to use the File::Find module. For example, this prints out all files recursively and adds a slash to their names if the file is a directory.

ディレクトリを再帰的に処理したい場合は、File::Find モジュールを使った方がいいでしょう。例えば、これは全てのファイルを再帰的に表示して、もしファイルがディレクトリの場合は末尾にスラッシュを追加します。

    @ARGV = qw(.) unless @ARGV;
    use File::Find;
    find sub { print $File::Find::name, -d && '/', "\n" }, @ARGV;

This finds all bogus symbolic links beneath a particular directory:

以下は、特定のディレクトリ以下から偽のシンボリックリンクを全て探します:

    find sub { print "$File::Find::name\n" if -l && !-e }, $dir;

As you see, with symbolic links, you can just pretend that it is what it points to. Or, if you want to know what it points to, then readlink is called for:

上述したように、シンボリックリンクの場合、単にそれが指しているもの振りをすることができます。あるいは、もしそれが何を指しているのかを知りたい場合は、 readlink を呼び出します:

    if (-l $file) {
        if (defined($whither = readlink($file))) {
            print "$file points to $whither\n";
        } else {
            print "$file points nowhere: $!\n";
        } 
    }

名前付きパイプを開く¶

Named pipes are a different matter. You pretend they're regular files, but their opens will normally block until there is both a reader and a writer. You can read more about them in "Named Pipes" in perlipc. Unix-domain sockets are rather different beasts as well; they're described in "Unix-Domain TCP Clients and Servers" in perlipc.

名前付きパイプは別の問題です。これらは普通のファイルのように振る舞いますが、この open は普通読み込み側と書き込み側の両方ができるまでブロックされます。これらについては "Named Pipes" in perlipc でより多くのことを読むことができます。 Unix ドメインソケットは同様にやや違うものです; これらは "Unix-Domain TCP Clients and Servers" in perlipc に記述されています。

When it comes to opening devices, it can be easy and it can be tricky. We'll assume that if you're opening up a block device, you know what you're doing. The character devices are more interesting. These are typically used for modems, mice, and some kinds of printers. This is described in "How do I read and write the serial port?" in perlfaq8 It's often enough to open them carefully:

デバイスを開くときは、簡単にもなりますしトリッキーにもなります。ブロックデバイスを開こうとしているなら、何をしようとしているのか分かっていることを仮定します。キャラクタデバイスはもっと興味深いです。これらは典型的にはモデム、マウス、ある種のプリンタのために使われます。これは "How do I read and write the serial port?" in perlfaq8 に記述されています。しばしば慎重に開くだけで充分です:

    sysopen(TTYIN, "/dev/ttyS1", O_RDWR | O_NDELAY | O_NOCTTY)
                # (O_NOCTTY no longer needed on POSIX systems)
        or die "can't open /dev/ttyS1: $!";
    open(TTYOUT, "+>&TTYIN")
        or die "can't dup TTYIN: $!";

    $ofh = select(TTYOUT); $| = 1; select($ofh);

    print TTYOUT "+++at\015";
    $answer = <TTYIN>;

With descriptors that you haven't opened using sysopen, such as sockets, you can set them to be non-blocking using fcntl:

ソケットのように、sysopen を使わずに開いた記述子の場合は、 fcntl を使って非ブロックモードに設定できます:

    use Fcntl;
    my $old_flags = fcntl($handle, F_GETFL, 0) 
        or die "can't get flags: $!";
    fcntl($handle, F_SETFL, $old_flags | O_NONBLOCK) 
        or die "can't set non blocking: $!";

Rather than losing yourself in a morass of twisting, turning ioctls, all dissimilar, if you're going to manipulate ttys, it's best to make calls out to the stty(1) program if you have it, or else use the portable POSIX interface. To figure this all out, you'll need to read the termios(3) manpage, which describes the POSIX interface to tty devices, and then POSIX, which describes Perl's interface to POSIX. There are also some high-level modules on CPAN that can help you with these games. Check out Term::ReadKey and Term::ReadLine.

もし tty を操作しようとしているなら、全く異なる ioctl の泥沼に迷い込むのではなく、もし stty(1) プログラムがあるならこれを呼び出して、さもなければ移植性のある POSIX インターフェースを使うのが最善です。これらのこと全てを理解するには、まず tty デバイスへの POSIX インターフェースについて記述している termios(3) man ページを読んで、次に POSIX への Perl のインターフェースについて記述している POSIX を読む必要があります。これらのものを扱う助けになるような高レベルモジュールも CPAN にあります。 Term::ReadKey と Term::ReadLine を調べてください。

ソケットを開く¶

What else can you open? To open a connection using sockets, you won't use one of Perl's two open functions. See "Sockets: Client/Server Communication" in perlipc for that. Here's an example. Once you have it, you can use FH as a bidirectional filehandle.

他の何を開けるの? ソケットを使った接続を開くには、Perl の 2 つの open 関数のどちらも使いません。そのためには "Sockets: Client/Server Communication" in perlipc を参照してください。以下は例です。これを実行すると、FH を双方向ファイルハンドルとして使えます。

    use IO::Socket;
    local *FH = IO::Socket::INET->new("www.perl.com:80");

For opening up a URL, the LWP modules from CPAN are just what the doctor ordered. There's no filehandle interface, but it's still easy to get the contents of a document:

URL を開くには、CPAN にある LWP モジュールがぴったりです。ファイルハンドルのインターフェースはないですが、それでも簡単に文書の中身を得られます:

    use LWP::Simple;
    $doc = get('http://www.cpan.org/');

バイナリファイル¶

On certain legacy systems with what could charitably be called terminally convoluted (some would say broken) I/O models, a file isn't a file--at least, not with respect to the C standard I/O library. On these old systems whose libraries (but not kernels) distinguish between text and binary streams, to get files to behave properly you'll have to bend over backwards to avoid nasty problems. On such infelicitous systems, sockets and pipes are already opened in binary mode, and there is currently no way to turn that off. With files, you have more options.

最終的に (壊れていると言われる) I/O モデルに巻き込まれると寛大にも呼ばれるある種の古いシステムでは、ファイルはファイルではありません -- 少なくとも C 標準 I/O ライブラリという観点では。 (カーネルではなく)ライブラリがテキストストリームとバイナリストリームを区別するような古いシステムでは、適切に振る舞うようにファイルを取得するためには、不愉快な問題を避けるために懸命な努力が必要です。このような不幸なシステムでは、ソケットとパイプは既にバイナリモードで開いていて、今のところこれをオフにする方法はありません。ファイルに対しては、もっと選択肢があります。

Another option is to use the binmode function on the appropriate handles before doing regular I/O on them:

もう一つの選択肢は、通常の I/O を行う前に、適切なファイルハンドルに binmode 関数を使うことです:

    binmode(STDIN);
    binmode(STDOUT);
    while (<STDIN>) { print }

Passing sysopen a non-standard flag option will also open the file in binary mode on those systems that support it. This is the equivalent of opening the file normally, then calling binmode on the handle.

sysopen に非標準フラグオプションを渡すことでも、そのようなシステムでバイナリモードでファイルを開けます。これは、ファイルを普通に開いてから、ハンドルに対して binmode を呼び出すのと等価です。

    sysopen(BINDAT, "records.data", O_RDWR | O_BINARY)
        || die "can't open records.data: $!";

Now you can use read and print on that handle without worrying about the non-standard system I/O library breaking your data. It's not a pretty picture, but then, legacy systems seldom are. CP/M will be with us until the end of days, and after.

これで、非標準システム I/O ライブラリがデータを壊す心配なしにハンドルに対して read と print を使えるようになりました。これは美しい形ではありませんが、レガシーシステムとは大抵そういうものです。 CP/M は世界が終わるまで(そしてその後も)我々と共にあるでしょう。

On systems with exotic I/O systems, it turns out that, astonishingly enough, even unbuffered I/O using sysread and syswrite might do sneaky data mutilation behind your back.

風変わりな I/O システムを持つシステムでは、驚いたことに、 sysread や syswrite を使ったバッファリングしない I/O でさえも背後でこっそりとデータ操作をすることがあります。

    while (sysread(WHENCE, $buf, 1024)) {
        syswrite(WHITHER, $buf, length($buf));
    }

Depending on the vicissitudes of your runtime system, even these calls may need binmode or O_BINARY first. Systems known to be free of such difficulties include Unix, the Mac OS, Plan 9, and Inferno.

実行させるシステムの紆余曲折具合によっては、これらのシステムコールですら最初に binmode や O_BINARY が必要かもしれません。このような問題がないと分かっているシステムには Unix, Mac OS, Plan 9, Inferno などがあります。

ファイルのロック¶

In a multitasking environment, you may need to be careful not to collide with other processes who want to do I/O on the same files as you are working on. You'll often need shared or exclusive locks on files for reading and writing respectively. You might just pretend that only exclusive locks exist.

マルチタスク環境では、あなたが触ろうとしているファイルと同じファイルを他のプロセスが衝突しないように気をつける必要があります。しばしば、ファイルを読み込みまたは書き込みするために、それぞれ共有ロックと排他ロックが必要になります。あるいは、単に排他ロックしかないような振りをするかもしれません。

Never use the existence of a file -e $file as a locking indication, because there is a race condition between the test for the existence of the file and its creation. It's possible for another process to create a file in the slice of time between your existence check and your attempt to create the file. Atomicity is critical.

決して、ファイルの存在 -e $file をロック指示に使わないでください; なぜならファイルの存在のテストとその作成の間に競合条件があるからです。存在チェックとファイル作成のわずかな間に、他のプロセスがファイルを作る可能性があります。原子性は危機的です。

Perl's most portable locking interface is via the flock function, whose simplicity is emulated on systems that don't directly support it such as SysV or Windows. The underlying semantics may affect how it all works, so you should learn how flock is implemented on your system's port of Perl.

Perl でのもっとも移植性のあるロックインターフェースは、 flock 関数によるものです; この単純さは、SysV や Windows のような、これに直接対応していないシステムでもエミュレートされています。基礎となる動作はこれがどのように働くかに影響を与えるので、あなたが使うシステムの Perl で flock がどのように実装されているかを学ぶべきです。

File locking does not lock out another process that would like to do I/O. A file lock only locks out others trying to get a lock, not processes trying to do I/O. Because locks are advisory, if one process uses locking and another doesn't, all bets are off.

ファイルロックは、他のプロセスが I/O 操作を行うことからロックするもの ではありません。ファイルロックは、他のプロセスの I/O 操作をロックするのではなく、他のプロセスがロックを得ようとすることをロックします。ロックは勧告的なので、あるプロセスがロックを使っていても、他のプロセスがロックを使っていなければ、全ては台無しになります。

By default, the flock call will block until a lock is granted. A request for a shared lock will be granted as soon as there is no exclusive locker. A request for an exclusive lock will be granted as soon as there is no locker of any kind. Locks are on file descriptors, not file names. You can't lock a file until you open it, and you can't hold on to a lock once the file has been closed.

デフォルトでは、flock 呼び出しは、ロックが得られるまでブロックします。共有ロック要求は、誰も排他ロックを持っていない状態になれば直ちに受け入れられます。排他ロック要求は、誰もあらゆる種類のロックを守っていない状態になれば与えられます。ロックはファイル名に対してではなく、ファイル記述子について与えられます。ファイルを開かずにファイルをロックすることはできませんし、ファイルを閉じた後もロックを持ったままにすることもできません。

Here's how to get a blocking shared lock on a file, typically used for reading:

以下はファイルに対してブロックする共有ロックを得る方法で、典型的には読み込み時に使われます:

    use 5.004;
    use Fcntl qw(:DEFAULT :flock);
    open(FH, "< filename")  or die "can't open filename: $!";
    flock(FH, LOCK_SH)      or die "can't lock filename: $!";
    # now read from FH

You can get a non-blocking lock by using LOCK_NB.

LOCK_NB を使うことでブロックしないロックも得られます。

    flock(FH, LOCK_SH | LOCK_NB)
        or die "can't lock filename: $!";

This can be useful for producing more user-friendly behaviour by warning if you're going to be blocking:

ブロックするときに警告することで、よりユーザーにやさしい振る舞いをすることは有用です:

    use 5.004;
    use Fcntl qw(:DEFAULT :flock);
    open(FH, "< filename")  or die "can't open filename: $!";
    unless (flock(FH, LOCK_SH | LOCK_NB)) {
        $| = 1;
        print "Waiting for lock...";
        flock(FH, LOCK_SH)  or die "can't lock filename: $!";
        print "got it.\n"
    } 
    # now read from FH

To get an exclusive lock, typically used for writing, you have to be careful. We sysopen the file so it can be locked before it gets emptied. You can get a nonblocking version using LOCK_EX | LOCK_NB.

(典型的には書き込みのために) 排他ロックを得るためには、慎重になる必要があります。空なる前にロックするために、ファイルを sysopen で開きます。 LOCK_EX | LOCK_NB を使った非ブロック版も得られます。

    use 5.004;
    use Fcntl qw(:DEFAULT :flock);
    sysopen(FH, "filename", O_WRONLY | O_CREAT)
        or die "can't open filename: $!";
    flock(FH, LOCK_EX)
        or die "can't lock filename: $!";
    truncate(FH, 0)
        or die "can't truncate filename: $!";
    # now write to FH

Finally, due to the uncounted millions who cannot be dissuaded from wasting cycles on useless vanity devices called hit counters, here's how to increment a number in a file safely:

最後に、アクセスカウンタと呼ばれる無駄で空虚な装置のために CPU パワーを無駄遣いすることから逃れられない無慮数百万のために、あるファイルの数値を安全に増加させる方法を以下に示します:

    use Fcntl qw(:DEFAULT :flock);

    sysopen(FH, "numfile", O_RDWR | O_CREAT)
        or die "can't open numfile: $!";
    # autoflush FH
    $ofh = select(FH); $| = 1; select ($ofh);
    flock(FH, LOCK_EX)
        or die "can't write-lock numfile: $!";

    $num = <FH> || 0;
    seek(FH, 0, 0)
        or die "can't rewind numfile : $!";
    print FH $num+1, "\n"
        or die "can't write numfile: $!";

    truncate(FH, tell(FH))
        or die "can't truncate numfile: $!";
    close(FH)
        or die "can't close numfile: $!";

IO 層¶

In Perl 5.8.0 a new I/O framework called "PerlIO" was introduced. This is a new "plumbing" for all the I/O happening in Perl; for the most part everything will work just as it did, but PerlIO also brought in some new features such as the ability to think of I/O as "layers". One I/O layer may in addition to just moving the data also do transformations on the data. Such transformations may include compression and decompression, encryption and decryption, and transforming between various character encodings.

Perl 5.8.0 で、"PerlIO" と呼ばれる新しい I/O フレームワークが導入されました。これは Perl で発生する全ての I/O のための新しい「配管」です; ほとんど全ての部分では単に今まで通りに動作しますが、 I/O を「層」として考えるための機能のような、新しい要素も導入されています。ある I/O 層は単にデータを移動させるだけでなく、データを変換するかも知れません。このような変換には、圧縮と展開、暗号化と復号化、様々な文字エンコーディング間の変換を含むかも知れません。

Full discussion about the features of PerlIO is out of scope for this tutorial, but here is how to recognize the layers being used:

PerlIO の機能に関する完全な議論はこのチュートリアルの対象外ですが、層が使われていることをどうやって認識するかを以下に示します:

The three-(or more)-argument form of open is being used and the second argument contains something else in addition to the usual '<', '>', '>>', '|' and their variants, for example:

3 (以上) 引数形式の open が使われ、2 番目の引数に通常の '<', '>', '>>', '|' およびそのバリエーション以外の何かが含まれている場合; 例えば:
```
    open(my $fh, "<:crlf", $fn);
```
The two-argument form of binmode is being used, for example

2 引数形式の binmode が使われている場合; 例えば
```
    binmode($fh, ":encoding(utf16)");
```

For more detailed discussion about PerlIO see PerlIO; for more detailed discussion about Unicode and I/O see perluniintro.

PerlIO に関するより詳細な議論については PerlIO を参照してください; Unicode と I/O に関するより詳細な議論については perluniintro を参照してください。

AUTHOR and COPYRIGHT¶

This documentation is free; you can redistribute it and/or modify it under the same terms as Perl itself.

Irrespective of its distribution, all code examples in these files are hereby placed into the public domain. You are permitted and encouraged to use this code in your own programs for fun or for profit as you see fit. A simple comment in the code giving credit would be courteous but is not required.

HISTORY¶

First release: Sat Jan 9 08:09:11 MST 1999

名前¶

説明¶