A proposal for a new top-level media type: archive
Sean Leonard <dev+ietf <at> seantek.com>
2014-09-24 23:23:21 GMT
Colleagues on media-types and apps-discuss:
I would like to propose that the IETF create a new top-level media type:
Basically, archive would be a top-level type for all types of archive
I think it's important to register archive formats as a distinct type
from application, because there are common semantics that apply. In
fact, these semantics are very similar to multipart and message
The archive data types are all storage formats for *files*, as opposed
to *content*. Each file has its own security implications, along with
metadata that also has security implications (user and group
permissions, access bits, executable bits, ACLs). At the highest level,
an Internet-connected application ought to be able to identify that a
particular piece of content is of this type (as opposed to the opaque
application type), so it can make decisions about the content that are
unique to archives, namely, dealing with the security issues, and
presenting uniform user interfaces to handling such archives. Content
bundling types like message (RFC 5322), multipart, and application/cms
(CMS) are conceptually distinct. All those types can contain content
that can get split off into files, but their purpose is not to replicate
file system data.
Archives are ubiquitous on the Internet. Even if archives are used
"infrequently" across the Internet architecture, they are obviously used
at the endpoints. Improper transmission of archives has become a major
source of labeling and security issues.
Remarkably, most archive formats have not been registered as media types
(except for application/zip, which is an oldie). Therefore, it's pretty
much a "clean field". Furthermore, there is a trend of a lot of widely
available tools to support multiple formats, so the probability is good
that if you pass some archive/* labeled content to an archive
application, it will be able to do something intelligent with it.
The following major sub-types of archives, all belong in a common
top-level media type: [from Wikipedia]
* archiving only (concatenate files): tar
* multi-function (concatenate, compress, encrypt, etc.): zip, rar, 7z,
arc, arj, the list goes on and on...
* software packaging: cab, msi, pup, pet, apk, rpm...
* disk image: ISO-9660 (CD/DVD/Blu-Ray), Apple Disk Image, virtual
floppy disks, formerly-known-as-TrueCrypt, etc.
* backup: (a large quantity of proprietary formats)
I know that the TLMT matter has been brought up before with fonts.
Where do we start? Maybe we should talk about it? I don't think it's as
simple as drafting an Internet-Draft. Maybe there should be a BOF or
working group. Experts with file system and archival experience should