Legend:
Library
Module
Module type
Parameter
Class
Class type
Reader is Async's main API for buffered input from a file descriptor. It is the analog of Stdio.In_channel.
Each reader has an internal buffer, which is filled via read() system calls when data is needed to satisfy a Reader.read* call.
Each of the read functions returns a deferred that will become determined when the read completes. It is an error to have two simultaneous reads. That is, if you call a read function, you should not call another read function until the first one completes.
If the file descriptor underlying a reader is closed, the reader will return EOF (after all the buffered bytes have been read).
Any Reader.read* call could, rather than determine its result, send an exception to the monitor in effect when read was called. Such exceptions can be handled in the usual way by using try_with, e.g.:
open_file file opens file for reading and returns a reader reading from it.
This may raise an exception for the typical reasons that an open(2) system call may fail. If it does raise, it's guaranteed to be a Unix_error variant.
transfer t pipe_w transfers data from t into pipe_w one chunk at a time (whatever is read from the underlying file descriptor without post-processing). The result becomes determined after reaching EOF on t and the final bytes have been transferred, or if pipe_w is closed.
This function will normally not be needed (see pipe).
pipe t returns the reader end of a pipe that will continually be filled with chunks of data from the underlying Reader.t. When the reader reaches EOF or the pipe is closed, pipe closes the reader, and then after the reader close is finished, closes the pipe.
of_pipe info pipe_r returns a reader t that receives all the data from pipe_r. If pipe_r is closed, t will see an EOF (but will not be automatically closed). If t is closed, then pipe_r will stop being drained.
of_pipe is implemented by shuttling bytes from pipe_r to the write-end of a Unix pipe, with t being attached to the read end of the Unix pipe.
with_file file f opens file, creates a reader with it, and passes the reader to f. It closes the reader when the result of f becomes determined, and returns f's result.
This may raise an exception for the typical reasons that an open(2) system call may fail. If it does raise before f is called, it's guaranteed to be a Unix_error variant.
Note: You need to be careful that all your IO is done when the deferred you return becomes determined. If for example you use with_file and call lines, make sure you return a deferred that becomes determined when the EOF is reached on the pipe, not when you get the pipe (because you get it straight away).
exclusive = true uses a filesystem lock to try and make sure that the file is not read while it's being modified. This is an advisory lock, which means that the writer must be cooperating by taking a relevant lock when writing (see Writer.with_file). This is unrelated and should not be confused with the O_EXCL flag in open systemcall. Note that the implementation uses Unix.lockf, which has known pitfalls. It's recommended that you avoid the exclusive flag in favor of using a library dedicated to dealing with file locks where the pitfalls can be documented in detail.
close t prevents further use of t and closes t's underlying file descriptor. The result of close becomes determined once the underlying file descriptor has been closed. It is an error to call other operations on t after close t has been called, except that calls of close subsequent to the original call to close will return the same deferred as the original call.
close_finished t becomes determined after t's underlying file descriptor has been closed, i.e., it is the same as the result of close. close_finished differs from close in that it does not have the side effect of initiating a close.
is_closed t returns true iff close t has been called.
with_close t ~f runs f (), and closes t after f finishes or raises.
drain t reads and ignores all data from t until it hits EOF, and then closes t.
type'a read_one_chunk_at_a_time_result = [
| `Eof
| `Stopped of'a
(*
`Eof_with_unconsumed_data s means that handle_chunk returned `Consumed (c, _) and left data in the reader's buffer (i.e., c < len), and that the reader reached EOF without reading any more data into the buffer; hence the data in the buffer was never consumed (and never will be, since the reader is at EOF).
`Stop a means that handle_chunk consumed all len bytes, and that read_one_chunk_at_a_time should stop reading and return `Stopped a.
*)
| `Stop_consumed of'a * int
(*
`Stop_consumed (a, n) means that handle_chunk consumed n bytes, and that read_one_chunk_at_a_time should stop reading and return `Stopped a.
*)
| `Continue
(*
`Continue means that handle_chunk has consumed all len bytes.
*)
| `Consumed of int * [ `Need of int| `Need_unknown ]
(*
`Consumed (c, need) means that c bytes were consumed and need says how many bytes are needed (including the data remaining in the buffer after the c were already consumed). It is an error if c < 0 || c > len. For `Need n, it is an error if n < 0 || c + n <= len.
read_one_chunk_at_a_time t ~handle_chunk reads into t's internal buffer, and whenever bytes are available, applies handle_chunk to them. It waits to read again until the deferred returned by handle_chunk becomes determined. If handle_chunk returns `Consumed, then read_one_chunk_at_a_time will wait for additional data to arrive before calling handle_chunk again. Thus, handle_chunk should consume as much as possible.
read_one_chunk_at_a_time continues reading until it reaches `Eof or handle_chunk returns `Stop or `Stop_consumed. In the case of `Stop and `Stop_consumed, one may read from t after read_one_chunk_at_a_time returns.
type'a handle_iobuf_result = [
| `Stop of'a
| `Continue
]
`Stop a or `Continue respects the usual Iobuf semantics where data up to the Iobuf.Lo_bound is considered consumed.
read_one_iobuf_at_a_time is like read_one_chunk_at_a_time, except that the user-supplied handle_chunk function receives its data in an Iobuf.t, and uses the Iobuf position to communicate how much data was consumed. read_one_iobuf_at_a_time is implemented as a wrapper around read_one_chunk_at_a_time.
read_substring t ss reads up to Substring.length ss bytes into ss, blocking until some data is available or EOF is reached. The resulting i satisfies 0 < i <=
Substring.length ss.
really_read t buf ?pos ?len reads until it fills len bytes of buf starting at pos, or runs out of input. In the former case it returns `Ok. In the latter, it returns `Eof n where n is the number of bytes that were read before end of input, and 0 <= n < String.length ss.
val read_until :
t->[ `Pred of char -> bool| `Char of char ]->keep_delim:bool ->[ `Ok of string| `Eof_without_delim of string| `Eof ]Async_kernel.Deferred.t
read_until t pred ~keep_delim reads until it hits a delimiter c such that:
if pred = `Char c' then c = c'
if pred = `Pred p then p c
`Char c' is equivalent to `Pred (fun c -> c = c') but the underlying implementation is more efficient, in particular it will not call a function on every input character.
read_until returns a freshly-allocated string consisting of all the characters read and optionally including the delimiter as per keep_delim.
val read_until_bounded :
t->[ `Pred of char -> bool| `Char of char ]->keep_delim:bool ->max:int ->[ `Ok of string| `Eof_without_delim of string| `Eof| `Max_exceeded of string ]Async_kernel.Deferred.t
read_until_bounded is just like read_until, except you have the option of specifying a maximum number of chars to read (not including the separator).
When `Max_exceeded str is returned, the length of str will be equal to max + 1.
read_line t reads up to and including the next newline (\n) character (or \r\n) and returns a freshly-allocated string containing everything up to but not including the newline character. If read_line encounters EOF before the newline char then everything read up to but not including EOF will be returned as a line.
really_read_line ~wait_time t reads up to and including the next newline (\n) character and returns an optional, freshly-allocated string containing everything up to but not including the newline character. If really_read_line encounters EOF before the newline char, then a time span of wait_time will be used before the input operation is retried. If the descriptor is closed, None will be returned.
read_sexps t reads all the sexps and returns them as a pipe. When the reader reaches EOF or the pipe is closed, read_sexps closes the reader, and then after the reader close is finished, closes the pipe.
read_bin_prot ?max_len t bp_reader reads the next binary protocol message using binary protocol reader bp_reader. The format is the "size-prefixed binary protocol", in which the length of the data is prefixed as a 64-bit integer to the data. This is the format that Writer.write_bin_prot writes.
For higher performance, consider Unpack_sequence.unpack_bin_prot_from_reader.
read_marshal_raw reads and returns a buffer containing one marshaled value, but doesn't unmarshal it. You can just call Marshal.from_string on the string, and cast it to the desired type (preferably the actual type). Similar to Marshal.from_channel, but suffers from the String-length limitation (16MB) on 32-bit platforms.
read_all t read_one returns a pipe that receives all values read from t by repeatedly using read_one t. When the reader reaches EOF, it closes the reader, and then after the reader close is finished, closes the pipe.
lseek t offset ~mode clears t's buffer and calls Unix.lseek on t's file descriptor. The `Cur mode is not exposed because seeking relative to the current position of the file descriptor is not the same as seeking relative to the current position of the reader.
ltell t returns the file position of t from the perspective of a consumer of t. It uses Unix.lseek to find the file position of t's underlying file descriptor, and then subtracts the number of bytes in t's buffer that have been read from the OS but not from t.
lines t reads all the lines from t and puts them in the pipe, one line per pipe element. The lines do not contain the trailing newline. When the reader reaches EOF or the pipe is closed, lines closes the reader, and then after the reader close is finished, closes the pipe.
file_lines file returns a list of the lines in the file. The lines do not contain the trailing newline.
load_sexp file conv loads a sexp from file and converts it to a value using conv. This function provides an accurate error location if convert raises Of_sexp_error.
load_sexps is similar, but converts a sequence of sexps.