List of archive formats

Last updated

This is a list of file formats used by archivers and compressors used to create archive files.

Contents

Archiving only

File extension(s) [1] MIME type [2] Official name [3] Platform [4] Description
.a, .arapplication/x-archive Unix Archiver Unix-likeThe traditional archive format on Unix-like systems, now used mainly for the creation of static libraries.
.cpioapplication/x-cpio cpio Unix-like RPM files consist of metadata concatenated with (usually) a cpio archive. Newer RPM systems also support other archives, as cpio is becoming obsolete. cpio is also used with initramfs.
.sharapplication/x-shar Shell archive Unix-likeA self-extracting archive that uses the Bourne shell (sh).
.LBR .LBR CP/M

DOS

A system for storing multiple files. LBR archives typically contained files processed by SQ, or the archive itself was compressed with SQ. LBR archives that were compressed with SQ ended with the extension .LQR
.isoapplication/x-iso9660-image ISO-9660 image (Various; cross platform)An archive format originally used mainly for archiving and distribution of the exact, nearly-exact, or custom-modified contents of an optical storage medium such as a CD-ROM or DVD-ROM. However, it can be used to archive the contents of other storage media, selected partitions, folders, and/or files. The resulting archive is typically optimized for convenient rendering to (re-)writable CD or DVD media.
.lbrCommodore 64/128A library format used primarily on the Commodore 64 and 128 lines of computers. This bears no resemblance to the DOS LBR format. While library files were quick to implement (a number of programs exist to work with them) they are crippled in that they cannot grow with use: once a file has been created it cannot be amended (files added, changed or deleted) without recreating the entire file.
.mar Mozilla ARchive (Various; cross platform)An archive format used by Mozilla for storing binary diffs. Used in conjunction with bzip2.
.sbxapplication/x-sbx SeqBox (Various; cross platform)A single file container/archive that can be reconstructed even after total loss of file system structures.
.tarapplication/x-tar Tape archive Unix-likeA common archive format used on Unix-like systems. Generally used in conjunction with compressors such as gzip, bzip2, compress or xz to create .tar.gz, .tar.bz2, .tar.Z or tar.xz files.

Compression only

File extension(s) [1] MIME type [2] Official name [3] Platform [4] Description
.brapplication/x-brotli Brotli allBrotli is a compression algorithm developed by Google for textual web content, and typically achieves higher compression ratios than other algorithms for this use case.
.bz2application/x-bzip2 bzip2 Unix-likeAn open source, patent- and royalty-free compression format. The compression algorithm is a Burrows–Wheeler transform followed by a move-to-front transform and finally Huffman coding.
.F, .?XF [5] Freeze/melt QNX4, Unix-like and DOS Old compressor for QNX4 OS. The compression algorithm is a modified LZSS, with an adaptive Huffman coding.
.genozipapplication/vnd.genozip genozip Linux, macOS, WindowsGenozip, a compressor for genomic file formats such as FASTQ, BAM, VCF and others. [6]
.gzapplication/gzip [7] gzip Unix-like GNU Zip, the primary compression format used by Unix-like systems. The compression algorithm is Deflate, which combines LZSS with Huffman coding.
.lzapplication/x-lzip lzip Unix-likeAn alternate LZMA algorithm implementation, with support for checksums and ident bytes.
.lz4 LZ4 Unix-likeAlgorithm developed by Yann Collet, designed for very high (de)compression speeds. It is an LZ77 derivative, without entropy encoding.
.lzmaapplication/x-lzma lzma Unix-likeThe LZMA compression algorithm as used by 7-Zip.
.lzoapplication/x-lzop lzop Unix-likeAn implementation of the LZO data compression algorithm.
.rz rzip Unix-likeA compression program designed to do particularly well on very large files containing long distance redundancy.
.sfark sfArk Windows compress/decompress- Linux and macOS decompress onlyA compression program designed to do high compression on SF2 files (SoundFont).
.szapplication/x-snappy-framed Snappy Unix-likeA compression format developed by Google, and open-sourced in 2011. Snappy aims for very high speeds, reasonable compression, and maximum stability rather than maximum compression or compatibility with any other compression library. It is an LZ77 derivative, without entropy encoding.
.?Q? SQ CP/M and DOS Squeeze: A program which compressed files using Huffman coding. A file which was "squeezed" had the middle initial of the name changed to "Q", so that a squeezed text file would end with .TQT, a squeezed executable would end with .CQM or .EQE. Typically used with .LBR archives, either by storing the squeezed files in the archive, or by storing the files decompressed and then compressing the archive, which would have a name ending in ".LQR".
.?Z? CRUNCH CP/M and DOS A compression program written by Steven Greenberg implementing the LZW algorithm. For several years in the CP/M world when no implementation was available of ARC, CRUNCHed files stored in .LBR archives were very popular. CRUNCH's implementation of LZW had a somewhat unusual feature of modifying and occasionally clearing the code table in memory when it became full, resulting in a few percent better compression on many files.
.xzapplication/x-xz xz Unix-likeA compression format using LZMA2 to yield high compression ratios. The LZMA algorithm is an LZ77 derivative, with entropy encoding in the form of range encoding.
.zapplication/x-compress pack Unix-likeThe traditional Huffman coding compression format.
.Zapplication/x-compress compress Unix-likeThe traditional LZW compression format.
.zstapplication/zstd Zstandard Cross-platform Algorithm developed by Yann Collet at Facebook, combining high speed and high compression. It is an LZ77 derivative, with entropy encoding in the form of finite-state entropy and Huffman coding.
.??_MS-DOS/WindowsCompression format(s) used by some DOS and Windows install programs. MS-DOS includes expand.exe to decompress its install files. The compressed files are created with a matching compress.exe command. The compression algorithm is LZSS.

Archiving and compression

File extension(s) [1] MIME type [2] Official name [3] Creation platform [4] Restoration platform [8] Restorable with free software [9] Description
.7zapplication/x-7z-compressed 7z MultipleMultipleYesOpen source file format. Used by 7-Zip.
.s7zapplication/x-7z-compressed7zX macOS macOS, restoration on different platforms is possible although not immediateYesBased on 7z. Preserves Spotlight metadata, resource forks, owner/group information, dates and other data which would be otherwise lost with compression.

Made obsolete by the introduction of AppleDouble-encoded 7z archives (Macintosh only).

.aarapplication/octet-streamApple Archive macOS macOS NoCreated by Apple introduced with iOS 14 in 2020.
.aceapplication/x-ace-compressed ACE Windows MultipleYesProprietary format
.afaapplication/x-astrotite-afaAFAUNIX-likeUNIX-likeNoA format that compresses and doubly encrypt the data (AES256 and CAS256) avoiding brute force attacks, also hide files in an AFA file. It has two ways to safeguard data integrity and subsequent repair of the file if has an error (repair with AstroA2P (online) or Astrotite (offline)).
.alzapplication/x-alz-compressed ALZip Windows MultipleYesA mainly Korean format designed for very large archives.
.apkapplication/vnd.android.package-archive APK MultipleMultipleYes Android application package (variant of JAR file format).
.arc, .arkapplication/octet-stream ARC MultipleMultipleYesVery popular in the early days of BBSes, one of the first to offer compression and archiving in a single program. Largely replaced by PKZIP.
.arc, .cdxapplication/x-freearc FreeArc Windows, Linux Windows, Linux YesOpen source file format developed by Bulat Ziganshin. A "FreeArc Next" version is under development which includes Zstandard support.
.arjapplication/x-arj ARJ Originally DOS, now multipleMultipleYesCompetitor to PKZIP in the 1990s, offered better multi-part archive handling.
.b1application/x-b1 B1 MultipleMultipleYesOpen archive format, used by B1 Free Archiver (http://dev.b1.org/standard/archive-format.html)
.b6zB6Z macOS MultipleYesCompressed archive format B6Zip
.baSciferMultipleMultipleYesBinary Archive with external header
.bhBlakHoleYesProprietary format from the ZipTV Compression Components
.cabapplication/vnd.ms-cab-compressed Cabinet Windows MultipleYesThe Microsoft Windows native archive format, which is also used by many commercial installers such as WISE.
.carCompressia archiveOriginally DOS, now DOS and Windows Originally DOS, now DOS and Windows YesCreated by Yaakov Gringeler; released last in 2003 (Compressia 1.0.0.1 beta), now apparently defunct. Free trial of 30 days lets user create and extract archives; after that it is possible to extract, but not to create.
.cfsapplication/x-cfs-compressed Compact File Set Windows, Unix-like including macOS MultipleYesOpen source file format.
.cpt Compact Pro Classic Mac OS MultipleYesCompact Pro archive, a common archiver used on Mac platforms until about Mac OS 7.5.x. Competed with StuffIt; now obsolete.
.darapplication/x-dar Disk Archiver Unix-like including macOS Unix-like including macOS, Windows YesOpen source file format. Files are compressed individually with either gzip, bzip2 or lzo.
.dd DiskDoubler Classic Mac OS Obsolete
.dgcapplication/x-dgc-compressed DGCA Windows Windows
.dmg application/x-apple-diskimage Apple Disk Image macOS macOS, Windows, Linux YesSupports "Internet-enabled" disk images, which, once downloaded, are automatically decompressed, mounted, have the contents extracted, and thrown away. Currently, Safari is the only browser that supports this form of extraction; however, the images can be manually extracted as well. This format can also be password-protected or encrypted with 128-bit or 256-bit AES encryption.
.ear EAR MultipleMultipleYesEnterprise Java Archive archive
.gcaapplication/x-gca-compressedGCAThe predecessor of DGCA.
.genozip Genozip Linux, macOS, Windows Linux, macOS, Windows NoOptimized for (but not limited to) genomic file types such as FASTQ, BAM, VCF
.ha Originally DOS Originally DOS Yes, but may be covered by patents DOS era format; uses arithmetic/Markov coding
.hki WinHKI Archived 2006-12-05 at the Wayback Machine MS WindowsMS WindowsNoHKI
.iceICE Windows Windows YesProduced by ICEOWS program. Excels at text file compression.
.jarapplication/java-archive JAR MultipleMultipleYesJava archive, compatible with ZIP files
.kgb KGB Archiver MultipleMultipleYesOpen sourced archiver with compression using the PAQ family of algorithms and optional encryption.
.lzh, .lhaapplication/x-lzh LHA Originally DOS, now multipleMultipleYesThe standard format on Amiga.
.lzxapplication/x-lzx LZX Amiga Archiver originally used on The Amiga. Now copied by Microsoft to use in their .cab and .chm files.
.pak PAK HP NonStop HP NonStop Yes [10] file format from NoGate Consultings, a rival from ARC-Compressor.

.pak was also briefly used by the short lived MSDOS PKPAK program.

.partimg PartImage MultipleMultipleYesA disk image archive format that supports several compression methods as well as splitting the archive into smaller pieces.
.paq6, .paq7, .paq8 and variants PAQ Unix-like and Windows Unix-like and Windows YesAn experimental open source packager (http://mattmahoney.net/dc)
.pea PeaZip Linux and Windows Linux and Windows YesOpen source archiver supporting authenticated encryption, cascaded encryption, volume spanning, customizable object level and volume level integrity checks (form CRCs to SHA-512 and Whirlpool hashes), fast deflate based compression
.phar PHAR MultipleMultipleYesA package format to enable distribution of applications and libraries by bundling many PHP code files and other resources (e.g. images, stylesheets, etc.) into a single archive file
.pimPIM Windows Windows YesThe format from the PIM - a freeware compression tool by Ilia Muraviev. It uses an LZP-based compression algorithm with set of filters for executable, image and audio files.
.pit PackIt Classic Mac OS obsolete
.qdaQuadruple D Windows Windows Used for data in games written using the Quadruple D library for Delphi. Uses byte pair compression.
.rarapplication/x-rar-compressed RAR Originally DOS, now multipleMultipleYesA popular proprietary archive format.
.rkRK and WinRK MultipleMultipleNoThe format from a commercial archiving package. Odd among commercial packages in that they focus on incorporating experimental algorithms with the highest possible compression (at the expense of speed and memory), such as PAQ, PPMD and PPMZ (PPMD with unlimited-length strings), as well as a proprietary algorithms.
.sdaSelf Dissolving ARChiveCommodore 64, Commodore 128Commodore 64, Commodore 128YesSDAs refer to Self Dissolving ARC files, and are based on the Commodore 64 and Commodore 128 versions of ARC, originally written by Chris Smeets. While the files share the same extension, they are not compatible between platforms. That is, an SDA created on a Commodore 64 but run on a Commodore 128 in Commodore 128 mode will crash the machine, and vice versa. The intended successor to SDA is SFX.
.seaSelf Extracting Archive Classic Mac OS Classic Mac OS (implicitly)A pre-Mac OS X Self-Extracting Archive format. StuffIt, Compact Pro, Disk Doubler and others could create .sea files, though the StuffIt versions were the most common.
.senSciferMultipleMultipleYesScifer Archive with internal header
.sfxSelf Extracting ArchiveCommodore 64, Commodore 128Commodore 64, Commodore 128YesSFX is a Self Extracting Archive which uses the LHArc compression algorithm. It was originally developed by Chris Smeets on the Commodore platform, and runs primarily using the CS-DOS extension for the Commodore 128. Unlike its predecessor SDA, SFX files will run on both the Commodore 64 and Commodore 128 regardless of which machine they were created on.
.shk NuFX Originally Apple II, now multipleMultipleYesAn archive format designed for the Apple II series of computers. The canonical implementation is ShrinkIt, which can operate on disk images as well as files. Preferred compression algorithm is a combination of RLE and 12-bit LZW. Archives can be manipulated with the command-line NuLib tool, or the Windows-based CiderPress.
.sitapplication/x-stuffit StuffIt Originally Classic Mac OS, now multipleOriginally Classic Mac OS, now multipleYesA compression format common on Apple Macintosh computers. The free StuffIt Expander is available for Windows and macOS.
.sitxapplication/x-stuffitx StuffIt X MultipleMultipleYesThe replacement for the .sit format that supports more compression methods, UNIX file permissions, long file names, very large files, more encryption options, data specific compressors (JPEG, Zip, PDF, 24-bit image, MP3). The free StuffIt Expander is available for Windows and OS X.
.sqx SQX Windows Windows YesA royalty-free compressing format
.tar.gz, .tgz, .tar.Z, .tar.bz2,
.tbz2, .tar.lz, .tlz, .tar.xz, .txz, .tar.zst
application/x-gtartar with gzip, compress, bzip2, lzip, xz, or zstdMultipleMultipleYesThe "tarball" format combines tar archives with a file-based compression scheme (usually gzip). Commonly used for source and binary distribution on Unix-like platforms, widely available elsewhere. Xarchiver supports the .tar.zst Archive/Compression format on Unix-like platforms.
.uc .uc0 .uc2 .ucn .ur2 .ue2UltraCompressor II DOS DOS UltraCompressor 2.3 was developed to act as an alternative to the then popular PKZIP application. The main feature of the application is its ability to create large archives. This means that compressed archives with the UC2 file extension can hold almost 1 million files.
.ucaPerfectCompress [11] Windows Windows NoBased on PAQ, RZM, CSC, CCM, and 7zip. The format consists of a PAQ, RZM, CSC, or CCM compressed file and a manifest with compression settings stored in a 7z archive.
.uhaUHarc DOS/Windows DOS/Windows YesA high compression rate archive format originally for DOS.
.war WAR MultipleMultipleYesWeb Application archive (Java-based web app)
.wimapplication/x-ms-wim Windows Image Windows Windows YesFile-based disk image format developed to deploy Microsoft Windows.
.xarapplication/x-xar XAR MultipleMultipleYes
.xp3 KiriKiri Windows Windows YesNative format of the Open Source KiriKiri Visual Novel engine. Uses combination of block splitting and zlib compression. The filenames and pathes are stored in UTF-16 format. For integrity check, the Adler-32 hashsum is used. For many commercial games, the files are encrypted (and decoded on runtime) via so-called "cxdec" module, which implements xor-based encryption.
.yz1 YZ1 Windows, DOS, Linux Windows, DOS, Linux YesYamazaki zipper archive. Compression format used in DeepFreezer archiver utility created by Yamazaki Satoshi. Read and write support exists in TUGZip, IZArc and ZipZag
.zip, .zipxapplication/zip ZIP Originally DOS, now multipleMultipleYesThe most widely used compression format on Microsoft Windows. Commonly used on Macintosh and Unix systems as well.
.zooapplication/x-zoo zoo MultipleMultipleYes
.zpaq ZPAQ MultipleMultipleYesJournaling (append-only) archive format with rollback capability. Supports deduplication and incremental update based on last-modified dates. Multi-threaded. Compresses in LZ77, BWT, and context mixing formats. Open source.
.zz Zzip MultipleMultipleYesArchiver with a compression algorithm based on the Burrows–Wheeler transform method.

Data recovery

File extension(s) [1] MIME type [2] Official name [3] Platform [4] Description
.ecc dvdisaster error-correction fileMultipleFile format used by dvdisaster to be used for data recovery when discs become damaged or partially unreadable.
.ecsbx Error-correcting SeqBox MultipleArchive with forward error correction and sector level recoverability. Error-correcting version of SeqBox.
.par
.par2
application/x-par2 Parchive fileMultipleFile format used in conjunction with any archive format to provide error correction and file recovery, most often in newsgroup distribution of binary files.
.rev WinRAR recovery volumeMultipleFile format used with WinRAR rar volumes. The data recovery is error correction data which is provided in the form of open recovery records and/or recovery volumes, allowing reconstruction of good archives (including reconstruction of entirely volumes)

Comparison

Containers and compression

FormatFilename
extension
Created
by
Introduced inBased onCompressionIntegrity checkRecovery recordEncryption supported Unicode filenamesModification date resolutionPre-processingLicense
Archive (ar).a, .ar CSRG 1971OriginalNoNoNoNoNo1 s??
cpio .cpio Bell Labs 1983 Unix System V ?NoPartial, select formats onlyNoNoNo1 sNo?
Shell Archive (shar and makeself).shar, .run James Gosling, Alan Hewett1994 4.4BSD OriginalNoYes, commonly MD5PartialPartialPartialarbitrary (typically 1 s)No Public domain (shar 1.x - 3.x),
GPL (shar 4.x)
Tape Archive (tar).tar Bell Labs 1975 Version 6 Unix ?NoPartial, metadata only. Full integrity providable by filters such as gzip.NoNoOptional 1 1 sNo?
Extended TAR format (pax).tar OpenGroup 2001Sun proposal + TARNometadataNoNoYesarbitrary (typically 1 ns)No?
BagIt The Library of Congress 2007file systemNoYesNoNoYesNo??
7z .7zIgor Pavlov2000 LZMA, LZMA2, Bzip2, PPMd, Deflate YesYes,
CRC32
NoYes,
AES-256
Yes100 ns [12] Yes LGPL, Public domain
ACE .aceMarcel Lemke1998 [13] LZ77 YesYesYesYes, BlowfishYes2s? Proprietary software
AFA.afaVicente Sánchez-Alarcos2009OriginalYesYesYesYes, AES and CASTYes???
ARC .arcThom Henderson (SEA)1985?YesCRC16Noweak XOR onlyNo2sNo?
ARJ .arjRobert Jung1991 AR001 and AR002 YesYesYesweak XOR with initial constantNo?? Proprietary software
B1 .b1Catalina Group Ltd2011 LZMA YesYesNoYes, AESYes???
Cabinet .cab Microsoft 1992 Windows 3.1 Deflate YesOptional PKCS7 Authenticode signatureNoOptional (with SDK)Yes2 s??
Compact File Set .cfsJoe Lowe (Pismo Technic Inc.)2008ZIP/LZMAYesYes?YesYes?? Free software
Compact Pro .cptBill Goodman1990 (as "Compactor")OriginalYesYesNoYes??? Proprietary software
Disk Archive (DAR).darDenis Corbin2002OriginalYesYesYes 2 YesYes1 µsYes GPLv2
DGCA .dgcShin-ichi Tsuruta2001GCAYesYesYesYesYes???
FreeArc .arcBulat Ziganshin2006 LZMA, PPMd, TTAYesYesYesYes, AES, Blowfish, Twofish and SerpentYes?? GPLv2
LHA (also LZH).lzh, .lhaHaruyasu Yoshizaki1988 Frozen YesOnly on recent LHA releasesNoNoNo1–2 s??
LZX .lzx Jonathan Forbes and Tomi Poutanen1995 LZ77 YesOnly on recent LZX releases??????
ISO image .iso, .img, .ima ISO 9660 1988 High Sierra Format No???????
Spark.arcDavid Pilling1989 ARC YesCRC16Noweak XOR onlyNo1 csNo CDDL 1.0 [14]
WinMount format .mou ?2007 ?YesYesYesYesYes?? Proprietary software
Macintosh Disk Image .dmg Apple Computer 2001 Mac OS X OriginalYesYes?Yes????
Partition Image (PartImage).partimgFrançois Dupoux and Franck Ladurelle2000?Yes?????? GPLv2
PAQ Family (Several formats) 4 .paq#*, .lpaq#*Matt Mahoney2002–2006OriginalYes???????
PEA .peaGiorgio Tani2006Original, Deflate based compressionYesYes Adler32, CRC32, CRC64, MD5, SHA1, RIPEMD-160, SHA256, SHA512, WhirlpoolNoYes Authenticated Encryption, AES128 and AES256 in EAX modeYes system dependentYes arbitrary? Public domain
PIM.pimIlia Muraviev2004–2008OriginalYesYesNoNoYesNo??
Quadruple D.qdaTaku Hayase (aka sandman)1997?Yes???????
RAR .rar Eugene Roshal 1993OriginalYesYes,
CRC32,
BLAKE2
Yes,
Reed–Solomon
Yes,
AES-256 for RAR 5.0 archives
Yes,
UTF-8
2 s, 1 s, 6.5536 ms, 25.6 µs or 100 ns 3 Dropped Proprietary software
RK.rkM Software, Ltd.2004OriginalYesYesNoYes, AES, Square, TwofishYes1 s??
NuFX.shkAndy Nicholas1989OriginalYesCRC16NoNoNo1 s??
StuffIt (also SIT).sitRaymond Lau1987?Yes??Yes??? Proprietary software
StuffIt X (also SITx).sitxAladdin/Allume Systems 2002?Yes?OptionalYes, RC4,Blowfish,
AES,DES
Yes?? Proprietary software
UltraCompressor II.uc .uc0 .uc2
.ucn .ur2 .ue2
Nico de Vries1992–1996 LZ77 and Huffman coding YesYesYesYes, triple DES? ???
Windows Image .wim, .swm, .esd Microsoft 2006OriginalYesYesNoPartial 5 Yes100 ns??
ZIP .zip Phil Katz 1989 Deflate YesYesNoYes, AESYes1–2 s, depending on version??
ZPAQ .zpaq Matt Mahoney 2009 PAQ YesYes, SHA-1NoYes, AES-256Yes?? MIT, Public domain

Notes

^1 While the original tar format uses the ASCII character encoding, current implementations use the UTF-8 (Unicode) encoding, which is backwards compatible with ASCII.
^2 Supports the external Parchive program (par2).
^3 From 3.20 release RAR can store modification, creation and last access time with the precision up to 0.0000001 second (= 0.1 µs). WinRAR download and support: Knowledge Base
^4 The PAQ family (with its lighter weight derivative LPAQ) went through many revisions, each revision suggested its own extension. For example: ".paq9a".
^5 WIM can store the ciphertext of encrypted files on an NTFS volume, but such files can only by decrypted if an administrator extracts the file to an NTFS volume, and the decryption key is available (typically from the file's original owner on the same Windows installation). Microsoft has also distributed some download versions of the Windows operating system as encrypted WIM files, but via an external encryption process and not a feature of WIM.

  • Purpose: Archive formats are used for backups, mobility, and archiving. Many archive formats compress the data to consume less storage space and result in quicker transfer times as the same data is represented by fewer bytes. Another benefit is that files are combined into one archive file which has less overhead for managing or transferring. There are numerous compression algorithms available to losslessly compress archived data; some algorithms are designed to work better (smaller archive or faster compression) with particular data types. Archive formats are used by most operating systems to package software for easier distribution and installation than binary executables.
  • Filename extension: The DOS and Windows operating systems required filenames to include an extension (of at least one, and typically 3 characters) to identify the file type. Such extensions must be unique for each type of file. Many operating systems identify a file's type from its contents without the need for an extension in its name. However, the use of three-character extensions has been embraced as a useful and efficient shorthand for identifying file types.
  • Integrity check: Archive files are often stored on magnetic or other media subject to storage errors. Many archive formats contain extra error detection or correction information which can be used by the software used to read the archive files to detect and possibly correct errors.
  • Recovery record: Many archive formats contain redundant data embedded in the files in order to detect data storage or transmission errors, and the software used to read the archive files contains logic to detect and correct errors.
  • Encryption: Many archive formats include the capability to encrypt contents to prevent unauthorised access, using one of many available encryption methods.

Software packaging and distribution

FormatFilename
extension
Created
by
Introduced inBased onIntegrity checkRecovery recordEncryption supported Unicode filenamesModification date resolution
Debian package (deb).deb Debian 1994 Debian 0.91 ar, tar, and gzip, xz YesNoNoYes1 s
Macintosh Installer .pkg, .mpkg (metapackage) NeXT 1989 NeXTSTEP 1.0 pax and gzip Yes??Yes?
RPM Package Manager (RPM).rpm Red Hat 1995 Red Hat Linux 1.0 cpio and gzip Yes???1 s
Slackware Package.tgz Patrick Volkerding 1993 Slackware 1.0 tar and gzip YesNoNo??
Windows Installer (also MSI).msi Microsoft 2000 Windows 2000 OLE Structured Storage, Cabinet and SQL Optional PKCS7 Authenticode SignatureNoNoNo2 s
MSIX .appx, .appxbundle, .emsix, .emsixbundle, .msix, .msixbundle Microsoft 2012 Windows 8 Zip YesNoYesYes1 s
Java Archive (JAR 1 ).jar Sun Microsystems 1997 JDK 1.1 PKZIP YesNo?Yes?
Google Chrome extension package.crx Google 2009 (Chrome 4.0) Zip Yes?No??
Pacman .pkg.tar.zst, .pkg.tar (no compression) Judd Vinet 2001 (before ArchLinux 0.1) tar and zstd [15] (formerly xz)YesNoNoYes1 s

Notes

^1 Not to be confused with the archiver JAR written by Robert K. Jung, which produces ".j" files.

Features

Archive formatBuilt-in compressionSelf-extractingDirectory structure POSIX attributes ACLs Alternate data streams
cpio No 1 NoYesYesNo?
tar No 1 NoYesYesSome(in Solaris implementation)
dar Yes 3 NoYesYesYesYes
ar NoNoNoYesNo?
pax NoNoYesYesYes?
dump No 1 NoYesYesYes?
shar NoYesYesYesNo?
makeselfYesYesYesYesYes?
zip YesYes 2 YesNo??
rar YesYes 2 YesNo?Yes
ace Yes?YesNo??
arj YesYes 2 YesNoNo?
zoo Yes?YesNo??
ISO 9660 (CD-ROM)No 1 NoYes(with Rock Ridge extension)No?
cab YesYes 2 ?No??
rpm YesNoYesYes??
deb YesNoYesYes??
7z YesYesYesNo??
Archive formatBuilt-in compressionSelf-extractingDirectory Structure POSIX attributes ACLs Alternate data streams

Notes

^1 Compression is not a built-in feature of the formats, however, the resulting archive can be compressed with any algorithm of choice. Several implementations include functionality to do this automatically
^2 Most implementations can optionally produce a self-extracting executable
^3 Per-file compression with gzip, bzip2, lzo, xz, lzma (as opposed to compressing the whole archive). An individual can choose not to compress already compressed filenames based on their suffix as well.

See also

Footnotes

  1. 1 2 3 4 File extensions may differ across platforms. The case of these extensions may differ on case-insensitive platforms.
  2. 1 2 3 4 MIME media types may be conjectural. Very few have been officially registered with the IANA. Compression-only formats should often be denoted by the media type of the decompressed data, with a content coding indicating the compression format.
  3. 1 2 3 4 Official names may be disputed.
  4. 1 2 3 4 Creation platform indicates the platform(s) under which a format can be created.
  5. If attaching .F to the file name is not possible with the DOS operating system, the second and third character of the filename extension are replaced by XF.
  6. Genozip - A Universal Extensible Genomic Data Compressor https://academic.oup.com/bioinformatics/article/37/16/2225/6135077
  7. The 'application/zlib' and 'application/gzip' Media Types. doi: 10.17487/RFC6713 . RFC 6713 . Retrieved 2016-05-03.
  8. Restoration platform indicates the platform(s) under which a format can be restored/extracted. Most file formats can be understood by more than one platform.
  9. "Restorable with free software" indicates whether the format can be restored using an extraction tool that is free software.
  10. Tandem Unpak GUI http://sybond.web.id/project/?p=8
  11. "PerfectCompress". Archived from the original on 2012-07-23. Retrieved 2009-11-08.
  12. "7-Zip / Discussion / Open Discussion: File times: creation, access - precision". sourceforge.net.
  13. Lemke, Marcel (1998). "Technical Information on the archiver ACE V1.2". hugi.scene.org. Archived from the original on 4 April 2023. Retrieved 22 May 2023.
  14. "LICENCE · master · RiscOS / Sources / FileSys / ImageFS / SparkFS / Codecs / SparkSpark · GitLab". 28 January 2023. Retrieved 2023-03-26.
  15. "Arch Linux - News: Now using Zstandard instead of xz for package compression". archlinux.org.

Related Research Articles

A file archiver is a computer program that combines a number of files together into one archive file, or a series of archive files, for easier transportation or storage. File archivers may employ lossless data compression in their archive formats to reduce the size of the archive.

gzip GNU file compression/decompression tool

gzip is a file format and a software application used for file compression and decompression. The program was created by Jean-loup Gailly and Mark Adler as a free software replacement for the compress program used in early Unix systems, and intended for use by GNU. Version 0.1 was first publicly released on 31 October 1992, and version 1.0 followed in February 1993.

zlib DEFLATE codec library

zlib is a software library used for data compression as well as a data format. zlib was written by Jean-loup Gailly and Mark Adler and is an abstraction of the DEFLATE compression algorithm used in their gzip file compression program. zlib is also a crucial component of many software platforms, including Linux, macOS, and iOS. It has also been used in gaming consoles such as the PlayStation 4, PlayStation 3, Wii U, Wii, Xbox One and Xbox 360.

New Technology File System (NTFS) is a proprietary journaling file system developed by Microsoft. Starting with Windows NT 3.1, it is the default file system of the Windows NT family. It superseded File Allocation Table (FAT) as the preferred filesystem on Windows and is supported in Linux and BSD as well. NTFS reading and writing support is provided using a free and open-source kernel implementation known as NTFS3 in Linux and the NTFS-3G driver in BSD. By using the convert command, Windows can convert FAT32/16/12 into NTFS without the need to rewrite all files. NTFS uses several files typically hidden from the user to store metadata about other files stored on the drive which can help improve speed and performance when reading data. Unlike FAT and High Performance File System (HPFS), NTFS supports access control lists (ACLs), filesystem encryption, transparent compression, sparse files and file system journaling. NTFS also supports shadow copy to allow backups of a system while it is running, but the functionality of the shadow copies varies between different versions of Windows.

In computing, Deflate is a lossless data compression file format that uses a combination of LZ77 and Huffman coding. It was designed by Phil Katz, for version 2 of his PKZIP archiving tool. Deflate was later specified in RFC 1951 (1996).

In computing, tar is a computer software utility for collecting many files into one archive file, often referred to as a tarball, for distribution or backup purposes. The name is derived from "tape archive", as it was originally developed to write data to sequential I/O devices with no file system of their own, such as devices that use magnetic tape. The archive data sets created by tar contain various file system parameters, such as name, timestamps, ownership, file-access permissions, and directory organization. POSIX abandoned tar in favor of pax, yet tar sees continued widespread use.

ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed. The ZIP file format permits a number of compression algorithms, though DEFLATE is the most common. This format was originally created in 1989 and was first implemented in PKWARE, Inc.'s PKZIP utility, as a replacement for the previous ARC compression format by Thom Henderson. The ZIP format was then quickly supported by many software utilities other than PKZIP. Microsoft has included built-in ZIP support in versions of Microsoft Windows since 1998 via the "Plus! 98" addon for Windows 98. Native support was added as of the year 2000 in Windows ME. Apple has included built-in ZIP support in Mac OS X 10.3 and later. Most free operating systems have built in support for ZIP in similar manners to Windows and macOS.

RAR is a proprietary archive file format that supports data compression, error correction and file spanning. It was developed in 1993 by Russian software engineer Eugene Roshal and the software is licensed by win.rar GmbH. The name RAR stands for Roshal Archive.

<span class="mw-page-title-main">7-Zip</span> Open-source file archiver

7-Zip is a free and open-source file archiver, a utility used to place groups of files within compressed containers known as "archives". It is developed by Igor Pavlov and was first released in 1999. 7-Zip has its own archive format called 7z, but can read and write several others.

7z is a compressed archive file format that supports several different data compression, encryption and pre-processing algorithms. The 7z format initially appeared as implemented by the 7-Zip archiver. The 7-Zip program is publicly available under the terms of the GNU Lesser General Public License. The LZMA SDK 4.62 was placed in the public domain in December 2008. The latest stable version of 7-Zip and LZMA SDK is version 22.01.

StuffIt is a discontinued family of computer software utilities for archiving and compressing files. Originally produced for the Macintosh, versions for Microsoft Windows, Linux (x86), and Sun Solaris were later created. The proprietary compression format used by the StuffIt utilities is also termed StuffIt.

<span class="mw-page-title-main">WinRAR</span> File archiver

WinRAR is a trialware file archiver utility for Windows, developed by Eugene Roshal of win.rar GmbH. It can create and view archives in RAR or ZIP file formats, and unpack numerous archive file formats. To enable the user to test the integrity of archives, WinRAR embeds CRC32 or BLAKE2 checksums for each file in each archive. WinRAR supports creating encrypted, multi-part and self-extracting archives.

The following tables compare general and technical information for a number of file archivers. Please see the individual products' articles for further information. They are neither all-inclusive nor are some entries necessarily up to date. Unless otherwise specified in the footnotes section, comparisons are based on the stable versions—without add-ons, extensions or external programs.

In computing, an archive file is a computer file that is composed of one or more files along with metadata. Many archive formats also support compression of member files. Archive files are used to collect multiple data files together into a single file for easier portability and storage, or simply to compress files to use less storage space. Archive files often store directory structures, error detection and correction information, comments, and some use built-in encryption.

<span class="mw-page-title-main">Self-extracting archive</span>

A self-extracting archive is a computer executable program which combines compressed data in an archive file with machine-executable code to extract the information. Run on a compatible operating system, there is no need for a suitable extractor in the target computer to extract the data. The executable part of the file is known as a decompressor stub.

<span class="mw-page-title-main">PeaZip</span> File archive computer program

PeaZip is a free and open-source file manager and file archiver for Microsoft Windows, ReactOS, Linux, MacOS and BSD by Giorgio Tani. It supports its native PEA archive format and other mainstream formats, with special focus on handling open formats. Version 9.4.0 supported 234 file extensions.

XZ Utils is a set of free software command-line lossless data compressors, including the programs lzma and xz, for Unix-like operating systems and, from version 5.0 onwards, Microsoft Windows. For compression/decompression the Lempel–Ziv–Markov chain algorithm (LZMA) is used. XZ Utils started as a Unix port of Igor Pavlov's LZMA-SDK that has been adapted to fit seamlessly into Unix environments and their usual structure and behavior.

FreeArc is a free and open-source high-performance file archiver developed by Bulat Ziganshin. The project is presumably discontinued, since no information has been released by the developers since 2016 and the official website is down.

lzip Data compression utility

lzip is a free, command-line tool for the compression of data; it employs the Lempel–Ziv–Markov chain algorithm (LZMA) with a user interface that is familiar to users of usual Unix compression tools, such as gzip and bzip2.

Zstandard is a lossless data compression algorithm developed by Yann Collet at Facebook. Zstd is the corresponding reference implementation in C, released as open-source software on 31 August 2016.