A User Agent Configuration Mechanism For Multimedia Mail Format Information
A User Agent Configuration Mechanism For Multimedia Mail Format Information
Borenstein, Bellcore
Request for Comments: 1343 June 1992
This is an informational memo for the Internet community, and requests discussion and
suggestions for improvements. This memo does not specify an Internet standard.
Distribution of this memo is unlimited.
Abstract
This memo suggests a file format to be used to inform multiple mail reading user agent
programs about the locally-installed facilities for handling mail in various formats. The
mechanism is explicitly designed to work with mail systems based Internet mail as
defined by RFC’s 821, 822, 934, 1049, 1113, and the Multipurpose Internet Mail
Extensions, known as MIME. However, with some extensions it could probably be made
to work for X.400-based mail systems as well. The format and mechanism are proposed
in a manner that is generally operating-system independent. However, certain
implementation details will inevitably reflect operating system differences, some of
which will have to be handled in a uniform manner for each operating system. This
memo makes such situations explicit, and, in an appendix, suggests a standard behavior
under the UNIX operating system.
Introduction
The electronic mail world is in the midst of a transition from single-part text-only mail to
multi-part, multi-media mail. In support of this transition, various extensions to RFC 821
and RFC 822 have been proposed and/or adopted, notably including MIME [RFC-1341].
Various parties have demonstrated extremely high-functionality multimedia mail, but the
problem of mail interchange between different user agents has been severe. In general,
only text messages have been shared between user agents that were not explicitly
designed to work together. This limitation is not compatible with a smooth transition to a
multi-media mail world.
One approach to this transition is to modify diverse sets of mail reading user agents so
that, when they need to display mail of an unfamiliar (non-text) type, they consult an
external file for information on how to display that file. That file might say, for example,
that if the content-type of a message is "foo" it can be displayed to the user via the
"displayfoo" program.
Borenstein [Page 1]
RFC 1343 Multimedia Mail Configuration June 1992
This approach means that, with a one-time modification, a wide variety of mail reading
programs can be given the ability to display a wide variety of types of message.
Moreover, extending the set of media types supported at a site becomes a simple matter
of installing a binary and adding a single line to a configuration file. Crucial to this
scheme, however, is that all of the user agents agree on a common representation and
source for the configuration file. This memo proposes such a common representation.
Each user agent must clearly obtain the configuration information from a common
location, if the same information is to be used to configure all user agents. However,
individual users should be able to override or augment a site’s configuration. The
configuration information should therefore be obtained from a designated set of
locations. The overall configuration will be obtained through the virtual concatenation of
several individual configuration files known as mailcap files. The configuration
information will be obtained from the FIRST matching entry in a mailcap file, where
"matching" depends on both a matching content-type specification, an entry containing
sufficient information for the purposes of the application doing the searching, and the
success of any test in the "test=" field, if present.
Each mailcap file consists of a set of entries that describe the proper handling of one
media type at the local site. For example, one line might tell how to display a message in
Group III fax format. A mailcap file consists of a sequence of such individual entries,
separated by newlines (according to the operating system’s newline conventions). Blank
lines and lines that start with the "#" character (ASCII 35) are considered comments, and
are ignored. Long entries may be continued on multiple lines if each non-terminal line
ends with a backslash character (’\’, ASCII 92), in which case the multiple lines are to be
treated as a single mailcap entry. Note that for such "continued" lines, the backslash
must be the last character on the line to be continued.
Thus the overall format of a mailcap file is given, in the modified BNF of RFC 822, as:
Mailcap-File = *Mailcap-Line
Borenstein [Page 2]
RFC 1343 Multimedia Mail Configuration June 1992
Note that the above specification implies that comments must appear on lines all to
themselves, with a "#" character as the first character on each comment line.
Each mailcap entry consists of a number of fields, separated by semi-colons. The first
two fields are required, and must occur in the specified order. The remaining fields are
optional, and may appear in any order.
The first field is the content-type, which indicates the type of data this mailcap entry
describes how to handle. It is to be matched against the type/subtype specification in the
"Content-Type" header field of an Internet mail message. If the subtype is specified as
"*", it is intended to match all subtypes of the named content-type.
The second field, view-command, is a specification of how the message or body part can
be viewed at the local site. Although the syntax of this field is fully specified, the
semantics of program execution are necessarily somewhat operating system dependent.
UNIX semantics are given in Appendix A.
The optional fields, which may be given in any order, are as follows:
-- The "compose" field may be used to specify a program that can be used to compose a
new body or body part in the given format. Its intended use is to support mail composing
agents that support the composition of multiple types of mail using external composing
agents. As with the view-command, the semantics of program execution are operating
system dependent, with UNIX semantics specified in Appendix A. The result of the
composing program may be data that is not yet suitable for mail transport -- that is, a
Content-Transfer-Encoding may need to be applied to the data.
-- The "composetyped" field is similar to the "compose" field, but is to be used when the
composing program needs to specify the Content-type header field to be applied to the
composed data. The "compose" field is simpler, and is preferred for use with existing
(non-mail-oriented) programs for composing data in a given format. The
"composetyped" field is necessary when the Content-type information must include
auxilliary parameters, and the composition program must then know enough about mail
formats to produce output that includes the mail type information.
-- The "edit" field may be used to specify a program that can be used to edit a body or
body part in the given format. In many cases, it may be identical in content to the
"compose" field, and shares the operating-system dependent semantics for program
execution.
-- The "print" field may be used to specify a program that can be used to print a message
or body part in the given format. As with the view-command, the semantics of program
execution are operating system dependent, with UNIX semantics specified in Appendix
A.
Borenstein [Page 3]
RFC 1343 Multimedia Mail Configuration June 1992
-- The "test" field may be used to test some external condition (e.g. the machine
architecture, or the window system in use) to determine whether or not the mailcap line
applies. It specifies a program to be run to test some condition. The semantics of
execution and of the value returned by the test program are operating system dependent,
with UNIX semantics specified in Appendix A. If the test fails, a subsequent mailcap
entry should be sought. Multiple test fields are not permitted -- since a test can call a
program, it can already be arbitrarily complex.
-- The "copiousoutput" field indicates that the output from the view-command will be an
extended stream of output, and is to be interpreted as advice to the UA (User Agent
mail-reading program) that the output should be either paged or made scrollable. Note
that it is probably a mistake if needsterminal and copiousoutput are both specified.
-- The "description" field simply provides a textual description, optionally quoted, that
describes the type of data, to be used optionally by mail readers that wish to describe the
data before offering to display it.
-- The "x11-bitmap" field names a file, in X11 bitmap (xbm) format, which points to an
appropriate icon to be used to visually denote the presence of this kind of data.
-- Any other fields beginning with "x-" may be included for local or mailer-specific
extensions of this format. Implementations should simply ignore all such unrecognized
fields to permit such extensions, some of which might be standardized in a future version
of this document.
Some of the fields above, such as "needsterminal", apply to the actions of the view-
command, edit-command, and compose-command, alike. In some unusual cases, this
may not be desirable, but differentiation can be accomplished via separate mailcap
entries, taking advantage of the fact that subsequent mailcap entries are searched if an
earlier mailcap entry does not provide enough information:
Borenstein [Page 4]
RFC 1343 Multimedia Mail Configuration June 1992
compose=idraw %s
In RFC 822 modified BNF, the following grammar describes a mailcap entry:
implicitwild = type
view-command = mtext
mtext = *mchar
Note that "type", "subtype", and "x-token" are defined in MIME. Note also that while
the definition of "schar" includes the percent sign, "%", this character has a special
meaning in at least the UNIX semantics, and will therefore need to be quoted as a qchar
to be used literally.
Borenstein [Page 5]
RFC 1343 Multimedia Mail Configuration June 1992
Although this memo fully specifies a syntax for "mailcap" files, the semantics of the
mailcap file are of necessity operating-system dependent in four respects. In order to
clarify the intent, and to promote a standard usage, this appendix proposes a UNIX
semantics for these four cases. If a mailcap mechanism is implemented on non-UNIX
systems, similar semantic decisions should be made and published.
For UNIX, a path search of mailcap files is specified. The default path search is specified
as including at least the following:
$HOME/.mailcap:/etc/mailcap:/usr/etc/mailcap:/usr/local/etc/mailcap
However, this path may itself be overridden by a path specified by the MAILCAPS
environment variable.
On a UNIX system, such commands will each be a full shell command line, including the
path name for a program and its arguments. (Because of differences in shells and the
implementation and behavior of the same shell from one system to another, it is specified
that the command line be intended as input to the Bourne shell, i.e. that it is implicitly
preceded by "/bin/sh -c " on the command line.)
The two characters "%s", if used, will be replaced by the name of a file for the actual
mail body data. In the case of the edit adn view-command, the body part will be passed
to this command as standard input unless one or more instances of "%s" appear in the
view-command, in which case %s will be replaced by the name of a file containing the
body part, a file which may have to be created before the view-command program is
executed. (Such files cannot be presumed to continue to exist after the view-command
program exits. Thus a view-command that wishes to exit and continue processing in the
background should take care to save the data first.) In the case of the compose and
composetyped commands, %s should be replaced by the name of a file to which the
composed data should be written by the programs named in the compose or
composedtyped commands. Thus, the calling program will look in that file later in order
to retrieve the composed data. If %s does not appear in the compose or composetyped
commands, then the composed data will be assumed to be written by the composing
programs to standard output.
Borenstein [Page 6]
RFC 1343 Multimedia Mail Configuration June 1992
Furthermore, any occurrence of "%t" will be replaced by the content-type and subtype
specification. (That is, if the content-type is "text/plain", then %t will be replaced by
"text/plain".) A literal % character may be quoted as \%. Finally, named parameters
from the Content-type field may be placed in the command execution line using "%{"
followed by the parameter name and a closing "}" character. The entire parameter
should appear as a single command line argument, regardless of embedded spaces. Thus,
if the message has a Content-type line of:
multipart/*; /usr/local/bin/showmulti \
%t %{boundary}
/usr/local/bin/showmulti multipart/mixed 42
The "test" field specifies a program to be used to test whether or not the current mailcap
line applies. This can be used, for example, to have a mailcap line that only applies if the
X window system is running, or if the user is running on a SPARCstation with a
/dev/audio. The value of the "test" field is a program to run to test such a condition. The
precise program to run and arguments to give it are determined as specified in the
previous section. The test program should return an exit code of zero if the condition is
true, and a non-zero code otherwise.
On UNIX, the composing program is expected to produce a data stream for such a body
part as its standard output. The program will be executed with the command line
arguments determined as specified above. The data returned via its standard output will
be given a Content-Type field that has no supplementary parameters. For example, the
following mailcap entry:
audio/basic; /usr/local/bin/showaudio %t
compose = /usr/local/bin/recordaudio
would result in tagging the data composed by the "recordaudio" program as:
Content-Type: audio/basic
Borenstein [Page 7]
RFC 1343 Multimedia Mail Configuration June 1992
The "composetyped" filed is much like the "compose" field, except that it names a
composition program that produces, not raw data, but data that includes a MIME-
conformant type specification. The program will be executed with the command line
arguments determined as specified above. The data returned via its standard output must
begin with a Content-Type header, followed optionally by other Content-* headers, and
then by a blank line and the data. For example, the following mailcap entry:
multipart/mixed; /usr/local/bin/showmulti %t \
%{boundary}; \
composetyped = /usr/local/bin/makemulti
would result in executing the "makemulti" program, which would be expected to begin
its output with a line of the form:
Note that a composition program need not encode binary data in base64 or quoted-
printable. It remains the responsibility of the software calling the composition program to
encode such data as necessary. However, if a composing program does encode data,
which is not encouraged, it should announce that fact using a Content-Transfer-Encoding
header in the standard manner defined by MIME. Because such encodings must be
announced by such a header, they are an option only for composetyped programs, not for
compose programs.
The following is an example of a mailcap file for UNIX that demonstrates most of the
syntax above. It contains explanatory comments where necessary.
Borenstein [Page 8]
RFC 1343 Multimedia Mail Configuration June 1992
References
[RFC 822] Crocker, D., "Standard for the format of ARPA Internet text messages",
RFC 822, UDEL, August, 1982.
[RFC 1341] Borenstein, N., and N. Freed, "MIME (Multipurpose Internet Mail
Extensions): Mechanisms for Specifying and Describing the Format of Internet Message
Bodies", RFC 1341, Bellcore, June, 1992.
Acknowledgements
The author wishes to thank Malcolm Bjorn Gillies, Dan Heller, Olle Jaernefors, Keith
Moore, Luc Rooijakkers, and the other members of the IETF task force on mail
extensions for their comments on earlier versions of this draft. If other
acknowledgements were neglected, please let me know, as it was surely accidental.
Borenstein [Page 9]
RFC 1343 Multimedia Mail Configuration June 1992
Security Considerations
Security issues are not discussed in this memo. However, the use of the mechanisms
described in this memo can make it easier for implementations to slip into the kind of
security problems discussed in the MIME document. Implementors and mailcap
administrators should be aware of these security considerations, and in particular should
exercise caution in the choice of programs to be listed in a mailcap file for automatic
execution.
Author’s Address
Nathaniel S. Borenstein
MRE 2D-296, Bellcore
445 South St.
Morristown, NJ 07962-1910
Email: [email protected]
Phone: +1 201 829 4270
Fax: +1 201 829 7019