List of archive formats

Last updated

This is a list of file formats used by archivers and compressors used to create archive files.

Contents

Archive formats by purpose

Archive formats are used for backups, mobility, and archiving. Many archive formats compress the data to consume less storage space and result in quicker transfer times as the same data is represented by fewer bytes. Another benefit is that files are combined into one archive file which has less overhead for managing or transferring. There are numerous compression algorithms available to losslessly compress archived data; some algorithms are designed to work better (smaller archive or faster compression) with particular data types. Archive formats are used by most operating systems to package software for easier distribution and installation than binary executables.

Archiving only

File extension(s) [lower-alpha 1] MIME type [lower-alpha 2] Official name [lower-alpha 3] Platform [lower-alpha 4] Description
.a, .arapplication/x-archive Unix Archiver Unix-likeThe traditional archive format on Unix-like systems, now used mainly for the creation of static libraries.
.cpioapplication/x-cpio cpio Unix-like RPM files consist of metadata concatenated with (usually) a cpio archive. Newer RPM systems also support other archives, as cpio is becoming obsolete. cpio is also used with initramfs.
.sharapplication/x-shar Shell archive Unix-likeA self-extracting archive that uses the Bourne shell (sh).
.LBR .LBR CP/M

DOS

A system for storing multiple files. LBR archives typically contained files processed by SQ, or the archive itself was compressed with SQ. LBR archives that were compressed with SQ ended with the extension .LQR
.isoapplication/x-iso9660-image ISO-9660 image (Various; cross platform)An archive format originally used mainly for archiving and distribution of the exact, nearly-exact, or custom-modified contents of an optical storage medium such as a CD-ROM or DVD-ROM. However, it can be used to archive the contents of other storage media, selected partitions, folders, and/or files. The resulting archive is typically optimized for convenient rendering to (re-)writable CD or DVD media.
.lbrCommodore 64/128A library format used primarily on the Commodore 64 and 128 lines of computers. This bears no resemblance to the DOS LBR format. While library files were quick to implement (a number of programs exist to work with them) they are crippled in that they cannot grow with use: once a file has been created it cannot be amended (files added, changed or deleted) without recreating the entire file.
.mar Mozilla ARchive [1] (Various; cross platform)An archive format used by Mozilla for storing binary diffs. Used in conjunction with bzip2.
.sbxapplication/x-sbx SeqBox [2] (Various; cross platform)A single file container/archive that can be reconstructed even after total loss of file system structures.
.tarapplication/x-tar Tape archive Unix-likeA common archive format used on Unix-like systems. Generally used in conjunction with compressors such as gzip, bzip2, compress or xz to create .tar.gz, .tar.bz2, .tar.Z or tar.xz files.

Compression only

File extension(s) [lower-alpha 1] MIME type [lower-alpha 2] Official name [lower-alpha 3] Platform [lower-alpha 4] Description
.brapplication/x-brotli Brotli allBrotli is a compression algorithm developed by Google for textual web content, and typically achieves higher compression ratios than other algorithms for this use case.
.bz2application/x-bzip2 bzip2 Unix-likeAn open source, patent- and royalty-free compression format. The compression algorithm is a Burrows–Wheeler transform followed by a move-to-front transform and finally Huffman coding.
.F, .?XF [lower-alpha 5] Freeze/melt [3] QNX4, Unix-like and DOS Old compressor for QNX4 OS. The compression algorithm is a modified LZSS, with an adaptive Huffman coding.
.genozipapplication/vnd.genozip genozip Linux, macOS, WindowsGenozip, a compressor for genomic file formats such as FASTQ, BAM, VCF and others. [4]
.gzapplication/gzip [5] gzip Unix-like GNU Zip, the primary compression format used by Unix-like systems. The compression algorithm is Deflate, which combines LZSS with Huffman coding.
.lzapplication/x-lzip lzip Unix-likeAn alternate LZMA algorithm implementation, with support for checksums and ident bytes.
.lz4 LZ4 Unix-likeAlgorithm developed by Yann Collet, designed for very high (de)compression speeds. It is an LZ77 derivative, without entropy encoding.
.lzmaapplication/x-lzma lzma Unix-likeThe LZMA compression algorithm as used by 7-Zip.
.lzoapplication/x-lzop lzop Unix-likeAn implementation of the LZO data compression algorithm.
.rz rzip Unix-likeA compression program designed to do particularly well on very large files containing long distance redundancy.
.sfark sfArk Windows compress/decompress- Linux and macOS decompress onlyA compression program designed to do high compression on SF2 files (SoundFont).
.szapplication/x-snappy-framed Snappy Unix-likeA compression format developed by Google, and open-sourced in 2011. Snappy aims for very high speeds, reasonable compression, and maximum stability rather than maximum compression or compatibility with any other compression library. It is an LZ77 derivative, without entropy encoding.
.?Q? SQ CP/M and DOS Squeeze: A program which compressed files using Huffman coding. A file which was "squeezed" had the middle initial of the name changed to "Q", so that a squeezed text file would end with .TQT, a squeezed executable would end with .CQM or .EQE. Typically used with .LBR archives, either by storing the squeezed files in the archive, or by storing the files decompressed and then compressing the archive, which would have a name ending in ".LQR".
.?Z? CRUNCH CP/M and DOS A compression program written by Steven Greenberg implementing the LZW algorithm. For several years in the CP/M world when no implementation was available of ARC, CRUNCHed files stored in .LBR archives were very popular. CRUNCH's implementation of LZW had a somewhat unusual feature of modifying and occasionally clearing the code table in memory when it became full, resulting in a few percent better compression on many files.
.xzapplication/x-xz xz Unix-likeA compression format using LZMA2 to yield high compression ratios. The LZMA algorithm is an LZ77 derivative, with entropy encoding in the form of range encoding.
.zapplication/x-compress pack Unix-likeThe traditional Huffman coding compression format.
.Zapplication/x-compress compress Unix-likeThe traditional LZW compression format.
.zstapplication/zstd Zstandard Cross-platform Algorithm developed by Yann Collet at Facebook, combining high speed and high compression. It is an LZ77 derivative, with entropy encoding in the form of finite-state entropy and Huffman coding.
.??_MS-DOS/WindowsCompression format(s) used by some DOS and Windows install programs. MS-DOS includes expand.exe to decompress its install files. The compressed files are created with a matching compress.exe command. The compression algorithm is LZSS.

Archiving and compression

File extension(s) [lower-alpha 1] MIME type [lower-alpha 2] Official name [lower-alpha 3] Creation platform [lower-alpha 4] Restoration platform [lower-alpha 6] Restorable with free software [lower-alpha 7] Description
.7zapplication/x-7z-compressed 7z MultipleMultipleYesOpen source file format. Used by 7-Zip.
.s7zapplication/x-7z-compressed7zX macOS macOS, restoration on different platforms is possible although not immediateYesBased on 7z. Preserves Spotlight metadata, resource forks, owner/group information, dates and other data which would be otherwise lost with compression.

Made obsolete by the introduction of AppleDouble-encoded 7z archives (Macintosh only).

.aarapplication/octet-streamApple Archive macOS macOS NoCreated by Apple introduced with iOS 14 in 2020.
.aceapplication/x-ace-compressed ACE Windows MultipleYesProprietary format
.afaapplication/x-astrotite-afaAFAUNIX-likeUNIX-likeNoA format that compresses and doubly encrypt the data (AES256 and CAS256) avoiding brute force attacks, also hide files in an AFA file. It has two ways to safeguard data integrity and subsequent repair of the file if has an error (repair with AstroA2P (online) or Astrotite (offline)).
.alzapplication/x-alz-compressed ALZip Windows MultipleYesA mainly Korean format designed for very large archives.
.apkapplication/vnd.android.package-archive APK MultipleMultipleYes Android application package (variant of JAR file format).
.arc, .arkapplication/octet-stream ARC MultipleMultipleYesVery popular in the early days of BBSes, one of the first to offer compression and archiving in a single program. Largely replaced by PKZIP.
.arc, .cdxapplication/x-freearc FreeArc Windows, Linux Windows, Linux YesOpen source file format developed by Bulat Ziganshin. A "FreeArc Next" version is under development which includes Zstandard support.
.arjapplication/x-arj ARJ Originally DOS, now multipleMultipleYesCompetitor to PKZIP in the 1990s, offered better multi-part archive handling.
.b1application/x-b1 B1 MultipleMultipleYesOpen archive format, used by B1 Free Archiver [6]
.b6zB6Z macOS MultipleYesCompressed archive format B6Zip [7]
.baSciferMultipleMultipleYesBinary Archive with external header
.bhBlakHoleYesProprietary format from the ZipTV Compression Components
.cabapplication/vnd.ms-cab-compressed Cabinet Windows MultipleYesThe Microsoft Windows native archive format, which is also used by many proprietary installers such as WISE.
.carCompressia archiveOriginally DOS, now DOS and Windows Originally DOS, now DOS and Windows YesCreated by Yaakov Gringeler; released last in 2003 (Compressia 1.0.0.1 beta), now apparently defunct. Free trial of 30 days lets user create and extract archives; after that it is possible to extract, but not to create.
.cfsapplication/x-cfs-compressed Compact File Set Windows, Unix-like including macOS MultipleYesOpen source file format.
.cpt Compact Pro Classic Mac OS MultipleYesCompact Pro archive, a common archiver used on Mac platforms until about Mac OS 7.5.x. Competed with StuffIt; now obsolete.
.darapplication/x-dar Disk Archiver Unix-like including macOS Unix-like including macOS, Windows YesOpen source file format. Files are compressed individually with either gzip, bzip2 or lzo.
.dd DiskDoubler Classic Mac OS Obsolete
.dgcapplication/x-dgc-compressed DGCA Windows Windows
.dmg application/x-apple-diskimage Apple Disk Image macOS macOS, Windows, Linux YesSupports "Internet-enabled" disk images, which, once downloaded, are automatically decompressed, mounted, have the contents extracted, and thrown away. Currently, Safari is the only browser that supports this form of extraction; however, the images can be manually extracted as well. This format can also be password-protected or encrypted with 128-bit or 256-bit AES encryption.
.ear EAR MultipleMultipleYesEnterprise Java Archive archive
.gcaapplication/x-gca-compressedGCAThe predecessor of DGCA.
.genozip Genozip [8] Linux, macOS, Windows Linux, macOS, Windows NoOptimized for (but not limited to) genomic file types such as FASTQ, BAM, VCF
.ha [9] Originally DOS, now multipleMultipleYes, but may be covered by patents DOS era format; uses arithmetic/Markov coding
.hki WinHKI [10] MS WindowsMS WindowsNoHKI
.iceICE Windows Windows YesProduced by ICEOWS program. Excels at text file compression.
.jarapplication/java-archive JAR MultipleMultipleYesJava archive, compatible with ZIP files
.kgb KGB Archiver MultipleMultipleYesOpen sourced archiver with compression using the PAQ family of algorithms and optional encryption.
.lzh, .lhaapplication/x-lzh LHA Originally DOS, now multipleMultipleYesThe standard format on Amiga.
.lzxapplication/x-lzx LZX Amiga Archiver originally used on The Amiga. Now copied by Microsoft to use in their .cab and .chm files.
.pak PAK HP NonStop HP NonStop Yes [11] file format from NoGate Consultings, a rival from ARC-Compressor.

.pak was also briefly used by the short lived MSDOS PKPAK program.

.partimg PartImage MultipleMultipleYesA disk image archive format that supports several compression methods as well as splitting the archive into smaller pieces.
.paq6, .paq7, .paq8 and variants PAQ Unix-like and Windows Unix-like and Windows YesAn experimental open source packager [12]
.pea PeaZip Linux and Windows Linux and Windows YesOpen source archiver supporting authenticated encryption, cascaded encryption, volume spanning, customizable object level and volume level integrity checks (form CRCs to SHA-512 and Whirlpool hashes), fast deflate based compression
.phar PHAR MultipleMultipleYesA package format to enable distribution of applications and libraries by bundling many PHP code files and other resources (e.g. images, stylesheets, etc.) into a single archive file
.pimPIM Windows Windows YesThe format from the PIM - a freeware compression tool by Ilia Muraviev. It uses an LZP-based compression algorithm with set of filters for executable, image and audio files.
.pit PackIt Classic Mac OS obsolete
.qdaQuadruple D Windows Windows Used for data in games written using the Quadruple D library for Delphi. Uses byte pair compression.
.rarapplication/x-rar-compressed RAR Originally DOS, now WinRAR MultiplePartial [lower-alpha 8] A popular proprietary archive format.
.rkRK and WinRK [16] MultipleMultipleNoThe format from a proprietary archiving package. Odd among proprietary packages in that they focus on incorporating experimental algorithms with the highest possible compression (at the expense of speed and memory), such as PAQ, PPMD and PPMZ (PPMD with unlimited-length strings), as well as a proprietary algorithms.
.sdaSelf Dissolving ARChiveCommodore 64, Commodore 128Commodore 64, Commodore 128YesSDAs refer to Self Dissolving ARC files, and are based on the Commodore 64 and Commodore 128 versions of ARC, originally written by Chris Smeets. While the files share the same extension, they are not compatible between platforms. That is, an SDA created on a Commodore 64 but run on a Commodore 128 in Commodore 128 mode will crash the machine, and vice versa. The intended successor to SDA is SFX.
.seaSelf Extracting Archive Classic Mac OS Classic Mac OS (implicitly)A pre-Mac OS X Self-Extracting Archive format. StuffIt, Compact Pro, Disk Doubler and others could create .sea files, though the StuffIt versions were the most common.
.senSciferMultipleMultipleYesScifer Archive with internal header
.sfxSelf Extracting ArchiveCommodore 64, Commodore 128Commodore 64, Commodore 128YesSFX is a Self Extracting Archive which uses the LHArc compression algorithm. It was originally developed by Chris Smeets on the Commodore platform, and runs primarily using the CS-DOS extension for the Commodore 128. Unlike its predecessor SDA, SFX files will run on both the Commodore 64 and Commodore 128 regardless of which machine they were created on.
.shk NuFX Originally Apple II, now multipleMultipleYesAn archive format designed for the Apple II series of computers. The canonical implementation is ShrinkIt, which can operate on disk images as well as files. Preferred compression algorithm is a combination of RLE and 12-bit LZW. Archives can be manipulated with the command-line NuLib tool, or the Windows-based CiderPress.
.sitapplication/x-stuffit StuffIt Originally Classic Mac OS, now multipleOriginally Classic Mac OS, now multipleYesA compression format common on Apple Macintosh computers. The free StuffIt Expander is available for Windows and macOS.
.sitxapplication/x-stuffitx StuffIt X MultipleMultipleYesThe replacement for the .sit format that supports more compression methods, UNIX file permissions, long file names, very large files, more encryption options, data specific compressors (JPEG, Zip, PDF, 24-bit image, MP3). The free StuffIt Expander is available for Windows and OS X.
.sqx SQX Windows Windows YesA royalty-free compressing format
.tar.gz, .tgz, .tar.Z, .tar.bz2,
.tbz2, .tar.lz, .tlz, .tar.xz, .txz, .tar.zst
application/x-gtartar with gzip, compress, bzip2, lzip, xz, or zstdMultipleMultipleYesThe "tarball" format combines tar archives with a file-based compression scheme (usually gzip). Commonly used for source and binary distribution on Unix-like platforms, widely available elsewhere. Xarchiver supports the .tar.zst Archive/Compression format on Unix-like platforms.
.uc .uc0 .uc2 .ucn .ur2 .ue2UltraCompressor II DOS DOS UltraCompressor 2.3 was developed to act as an alternative to the then popular PKZIP application. The main feature of the application is its ability to create large archives. This means that compressed archives with the UC2 file extension can hold almost 1 million files.
.ucaPerfectCompress [17] Windows Windows NoBased on PAQ, RZM, CSC, CCM, and 7zip. The format consists of a PAQ, RZM, CSC, or CCM compressed file and a manifest with compression settings stored in a 7z archive.
.uhaUHarc DOS/Windows DOS/Windows YesA high compression rate archive format originally for DOS.
.war WAR MultipleMultipleYesWeb Application archive (Java-based web app)
.wimapplication/x-ms-wim Windows Image Windows Windows YesFile-based disk image format developed to deploy Microsoft Windows.
.xarapplication/x-xar XAR MultipleMultipleYes
.xp3 KiriKiri Windows Windows YesNative format of the Open Source KiriKiri Visual Novel engine. Uses combination of block splitting and zlib compression. The filenames and pathes are stored in UTF-16 format. For integrity check, the Adler-32 hashsum is used. For many proprietary games, the files are encrypted (and decoded on runtime) via so-called "cxdec" module, which implements xor-based encryption.
.yz1 YZ1 Windows, DOS, Linux Windows, DOS, Linux YesYamazaki zipper archive. Compression format used in DeepFreezer archiver utility created by Yamazaki Satoshi. [18] Read and write support exists in TUGZip, IZArc and ZipZag
.zip, .zipxapplication/zip ZIP Originally DOS, now multipleMultipleYesThe most widely used compression format on Microsoft Windows. Commonly used on Macintosh and Unix systems as well.
.zooapplication/x-zoo zoo MultipleMultipleYes
.zpaq ZPAQ MultipleMultipleYesJournaling (append-only) archive format with rollback capability. Supports deduplication and incremental update based on last-modified dates. Multi-threaded. Compresses in LZ77, BWT, and context mixing formats. Open source.
.zz Zzip [19] MultipleMultipleYesArchiver with a compression algorithm based on the Burrows–Wheeler transform method.

Data recovery

File extension(s) [lower-alpha 1] MIME type [lower-alpha 2] Official name [lower-alpha 3] Platform [lower-alpha 4] Description
.ecc dvdisaster error-correction fileMultipleFile format used by dvdisaster to be used for data recovery when discs become damaged or partially unreadable.
.ecsbx Error-correcting SeqBox [20] MultipleArchive with forward error correction and sector level recoverability. Error-correcting version of SeqBox. [2]
.par
.par2
application/x-par2 Parchive fileMultipleFile format used in conjunction with any archive format to provide error correction and file recovery, most often in newsgroup distribution of binary files.
.rev WinRAR recovery volumeMultipleFile format used with WinRAR rar volumes. The data recovery is error correction data which is provided in the form of open recovery records and/or recovery volumes, allowing reconstruction of good archives (including reconstruction of entirely volumes)

Comparison

Containers and compression

FormatFilename
extension [lower-alpha 9]
Created
by
Introduced inBased onCompressionIntegrity check [lower-alpha 10] Recovery record [lower-alpha 11] Encryption supported [lower-alpha 12] Unicode filenamesModification date resolutionPre-processingLicense
Archive (ar).a, .ar CSRG 1971OriginalNoNoNoNoNo1 s ? ?
cpio .cpio Bell Labs 1983 Unix System V  ?NoPartial, select formats onlyNoNoNo1 sNo ?
Shell Archive (shar and makeself).shar, .run James Gosling, Alan Hewett1994 4.4BSD OriginalNoYes, commonly MD5PartialPartialPartialarbitrary (typically 1 s)No Public domain (shar 1.x - 3.x),
GPL (shar 4.x)
Tape Archive (tar).tar Bell Labs 1975 Version 6 Unix  ?NoPartial, metadata only. Full integrity providable by filters such as gzip.NoNoOptional [lower-alpha 13] 1 sNo ?
Extended TAR format (pax).tar OpenGroup 2001Sun proposal + TARNometadataNoNoYesarbitrary (typically 1 ns)No ?
BagIt The Library of Congress 2007file systemNoYesNoNoYesNo ? ?
7z .7zIgor Pavlov2000 LZMA, LZMA2, Bzip2, PPMd, Deflate YesYes,
CRC32
NoYes,
AES-256
Yes100 ns [21] Yes LGPL, Public domain
ACE .aceMarcel Lemke1998 [22] LZ77 YesYesYesYes, BlowfishYes2s ? Proprietary software
AFA.afaVicente Sánchez-Alarcos2009OriginalYesYesYesYes, AES and CASTYes ? ? ?
ARC .arcThom Henderson (SEA)1985 ?YesCRC16Noweak XOR onlyNo2sNo ?
ARJ .arjRobert Jung1991 AR001 and AR002 YesYesYesweak XOR with initial constantNo ? ? Proprietary software
B1 .b1Catalina Group Ltd2011 LZMA YesYesNoYes, AESYes ? ? ?
Cabinet .cab Microsoft 1992 Windows 3.1 Deflate YesOptional PKCS7 Authenticode signatureNoOptional (with SDK)Yes2 s ? ?
Compact File Set .cfsJoe Lowe (Pismo Technic Inc.)2008ZIP/LZMAYesYes ?YesYes ? ? Free software
Compact Pro .cptBill Goodman1990 (as "Compactor")OriginalYesYesNoYes ? ? ? Proprietary software
Disk Archive (DAR).darDenis Corbin2002OriginalYesYesYes [lower-alpha 14] YesYes1 μsYes GPLv2
DGCA .dgcShin-ichi Tsuruta2001GCAYesYesYesYesYes ? ? ?
FreeArc .arcBulat Ziganshin2006 LZMA, PPMd, TTAYesYesYesYes, AES, Blowfish, Twofish and SerpentYes ? ? GPLv2
LHA (also LZH).lzh, .lhaHaruyasu Yoshizaki1988 Frozen YesOnly on recent LHA releasesNoNoNo1–2 s ? ?
LZX .lzx Jonathan Forbes and Tomi Poutanen1995 LZ77 YesOnly on recent LZX releases ? ? ? ? ? ?
ISO image .iso, .img, .ima ISO 9660 1988 High Sierra Format No ? ? ? ? ? ? ?
Spark.arcDavid Pilling1989 ARC YesCRC16Noweak XOR onlyNo1 csNo CDDL 1.0 [23]
WinMount format .mou ?2007 ?YesYesYesYesYes ? ? Proprietary software
Macintosh Disk Image .dmg Apple Computer 2001 Mac OS X OriginalYesYes ?Yes ? ? ? ?
Partition Image (PartImage).partimgFrançois Dupoux and Franck Ladurelle2000 ?Yes ? ? ? ? ? ? GPLv2
PAQ Family (Several formats) [lower-alpha 15] .paq#*, .lpaq#*Matt Mahoney2002–2006OriginalYes ? ? ? ? ? ? ?
PEA .peaGiorgio Tani2006Original, Deflate based compressionYesYes Adler32, CRC32, CRC64, MD5, SHA1, RIPEMD-160, SHA256, SHA512, WhirlpoolNoYes Authenticated Encryption, AES128 and AES256 in EAX modeYes system dependentYes arbitrary ? Public domain
PIM.pimIlia Muraviev2004–2008OriginalYesYesNoNoYesNo ? ?
Quadruple D.qdaTaku Hayase (aka sandman)1997 ?Yes ? ? ? ? ? ? ?
RAR .rar Eugene Roshal 1993OriginalYesYes,
CRC32,
BLAKE2
Yes,
Reed–Solomon
Yes,
AES-256 for RAR 5.0 archives
Yes,
UTF-8
2 s, 1 s, 6.5536 ms, 25.6 μs or 100 ns [lower-alpha 16] Dropped Proprietary software
RK.rkM Software, Ltd.2004OriginalYesYesNoYes, AES, Square, TwofishYes1 s ? ?
NuFX.shkAndy Nicholas1989OriginalYesCRC16NoNoNo1 s ? ?
StuffIt (also SIT).sitRaymond Lau1987 ?Yes ? ?Yes ? ? ? Proprietary software
StuffIt X (also SITx).sitxAladdin/Allume Systems 2002 ?Yes ?OptionalYes, RC4,Blowfish,
AES,DES
Yes ? ? Proprietary software
UltraCompressor II.uc .uc0 .uc2
.ucn .ur2 .ue2
Nico de Vries [26] 1992–1996 LZ77 and Huffman coding YesYesYesYes, triple DES ? ? ? ?
Windows Image .wim, .swm, .esd Microsoft 2006OriginalYesYesNoPartial [lower-alpha 17] Yes100 ns ? ?
ZIP .zip Phil Katz 1989 Deflate YesYesNoYes, AESYes1–2 s, depending on version ? ?
ZPAQ .zpaq Matt Mahoney 2009 PAQ YesYes, SHA-1NoYes, AES-256Yes ? ? MIT, Public domain

Software packaging and distribution

FormatFilename
extension
Created
by
Introduced inBased onIntegrity checkRecovery recordEncryption supported Unicode filenamesModification date resolution
Debian package (deb).deb Debian 1994 Debian 0.91 ar, tar, and gzip, xz YesNoNoYes1 s
Macintosh Installer .pkg, .mpkg (metapackage) NeXT 1989 NeXTSTEP 1.0 pax and gzip Yes ? ?Yes ?
RPM Package Manager (RPM).rpm Red Hat 1995 Red Hat Linux 1.0 cpio and gzip Yes ? ? ?1 s
Slackware Package.tgz Patrick Volkerding 1993 Slackware 1.0 tar and gzip YesNoNo ? ?
Windows Installer (also MSI).msi Microsoft 2000 Windows 2000 OLE Structured Storage, Cabinet and SQL Optional PKCS7 Authenticode SignatureNoNoNo2 s
MSIX .appx, .appxbundle, .emsix, .emsixbundle, .msix, .msixbundle Microsoft 2012 Windows 8 Zip YesNoYesYes1 s
Java Archive (JAR [lower-alpha 18] ).jar Sun Microsystems 1997 JDK 1.1 PKZIP YesNo ?Yes ?
Google Chrome extension package.crx Google 2009 (Chrome 4.0) Zip Yes ?No ? ?
Pacman .pkg.tar.zst, .pkg.tar (no compression) Judd Vinet 2001 (before ArchLinux 0.1) tar and zstd [27] (formerly xz)YesNoNoYes1 s

Features

Archive formatBuilt-in compressionSelf-extractingDirectory structure POSIX attributes ACLs Alternate data streams
cpio No [lower-alpha 19] [lower-alpha 19] NoYesYesNo ?
tar No [lower-alpha 19] NoYesYesSome(in Solaris implementation)
dar Yes [lower-alpha 20] NoYesYesYesYes
ar NoNoNoYesNo ?
pax NoNoYesYesYes ?
dump No [lower-alpha 19] NoYesYesYes ?
shar NoYesYesYesNo ?
makeselfYesYesYesYesYes ?
zip YesYes [lower-alpha 21] YesNo ? ?
rar YesYes [lower-alpha 21] YesNo ?Yes
ace Yes ?YesNo ? ?
arj YesYes [lower-alpha 21] YesNoNo ?
zoo Yes ?YesNo ? ?
ISO 9660 (CD-ROM)No [lower-alpha 19] NoYes(with Rock Ridge extension)No ?
cab YesYes [lower-alpha 21]  ?No ? ?
rpm YesNoYesYes ? ?
deb YesNoYesYes ? ?
7z YesYesYesNo ? ?
Archive formatBuilt-in compressionSelf-extractingDirectory Structure POSIX attributes ACLs Alternate data streams

See also

Notes

  1. 1 2 3 4 File extensions may differ across platforms. The case of these extensions may differ on case-insensitive platforms.
  2. 1 2 3 4 MIME media types may be conjectural. Very few have been officially registered with the IANA. Compression-only formats should often be denoted by the media type of the decompressed data, with a content coding indicating the compression format.
  3. 1 2 3 4 Official names may be disputed.
  4. 1 2 3 4 Creation platform indicates the platform(s) under which a format can be created.
  5. If attaching .F to the file name is not possible with the DOS operating system, the second and third character of the filename extension are replaced by XF.
  6. Restoration platform indicates the platform(s) under which a format can be restored/extracted. Most file formats can be understood by more than one platform.
  7. "Restorable with free software" indicates whether the format can be restored using an extraction tool that is free software.
  8. RARLAB UnRAR is proprietary. The free unar has partial unpacking support for RAR1.3, RAR1.5, RAR2, RAR3 & RAR5 [13] and the free libarchive has partial unpacking support for RAR3 [14] & RAR5 [15] .
  9. The DOS and Windows operating systems required filenames to include an extension (of at least one, and typically 3 characters) to identify the file type. Such extensions must be unique for each type of file. Many operating systems identify a file's type from its contents without the need for an extension in its name. However, the use of three-character extensions has been embraced as a useful and efficient shorthand for identifying file types.
  10. Archive files are often stored on magnetic or other media subject to storage errors. Many archive formats contain extra error detection or correction information which can be used by the software used to read the archive files to detect and possibly correct errors.
  11. Many archive formats contain redundant data embedded in the files in order to detect data storage or transmission errors, and the software used to read the archive files contains logic to detect and correct errors.
  12. Many archive formats include the capability to encrypt contents to prevent unauthorised access, using one of many available encryption methods.
  13. While the original tar format uses the ASCII character encoding, current implementations use the UTF-8 (Unicode) encoding, which is backwards compatible with ASCII.
  14. Supports the external Parchive program (par2).
  15. The PAQ family (with its lighter weight derivative LPAQ) went through many revisions, each revision suggested its own extension. For example: ".paq9a".
  16. From 3.20 release RAR can store modification, creation and last access time with the precision up to 0.0000001 second (0.1 μs). [24] [25]
  17. WIM can store the ciphertext of encrypted files on an NTFS volume, but such files can only by decrypted if an administrator extracts the file to an NTFS volume, and the decryption key is available (typically from the file's original owner on the same Windows installation). Microsoft has also distributed some download versions of the Windows operating system as encrypted WIM files, but via an external encryption process and not a feature of WIM.
  18. Not to be confused with the archiver JAR written by Robert K. Jung, which produces ".j" files.
  19. 1 2 3 4 5 Compression is not a built-in feature of the formats, however, the resulting archive can be compressed with any algorithm of choice. Several implementations include functionality to do this automatically
  20. Per-file compression with gzip, bzip2, lzo, xz, lzma (as opposed to compressing the whole archive). An individual can choose not to compress already compressed filenames based on their suffix as well.
  21. 1 2 3 4 Most implementations can optionally produce a self-extracting executable

Related Research Articles

A file archiver is a computer program that combines a number of files together into one archive file, or a series of archive files, for easier transportation or storage. File archivers may employ lossless data compression in their archive formats to reduce the size of the archive.

gzip GNU file compression/decompression tool

gzip is a file format and a software application used for file compression and decompression. The program was created by Jean-loup Gailly and Mark Adler as a free software replacement for the compress program used in early Unix systems, and intended for use by GNU. Version 0.1 was first publicly released on 31 October 1992, and version 1.0 followed in February 1993.

zlib DEFLATE codec library

zlib is a software library used for data compression as well as a data format. zlib was written by Jean-loup Gailly and Mark Adler and is an abstraction of the DEFLATE compression algorithm used in their gzip file compression program. zlib is also a crucial component of many software platforms, including Linux, macOS, and iOS. It has also been used in gaming consoles such as the PlayStation 4, PlayStation 3, Wii U, Wii, Xbox One and Xbox 360.

NT File System (NTFS) is a proprietary journaling file system developed by Microsoft in the 1990s.

In computing, tar is a computer software utility for collecting many files into one archive file, often referred to as a tarball, for distribution or backup purposes. The name is derived from "tape archive", as it was originally developed to write data to sequential I/O devices with no file system of their own, such as devices that use magnetic tape. The archive data sets created by tar contain various file system parameters, such as name, timestamps, ownership, file-access permissions, and directory organization. POSIX abandoned tar in favor of pax, yet tar sees continued widespread use.

ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed. The ZIP file format permits a number of compression algorithms, though DEFLATE is the most common. This format was originally created in 1989 and was first implemented in PKWARE, Inc.'s PKZIP utility, as a replacement for the previous ARC compression format by Thom Henderson. The ZIP format was then quickly supported by many software utilities other than PKZIP. Microsoft has included built-in ZIP support in versions of Microsoft Windows since 1998 via the "Plus! 98" addon for Windows 98. Native support was added as of the year 2000 in Windows ME. Apple has included built-in ZIP support in Mac OS X 10.3 and later. Most free operating systems have built in support for ZIP in similar manners to Windows and macOS.

RAR is a proprietary archive file format that supports data compression, error correction and file spanning. It was developed in 1993 by Russian software engineer Eugene Roshal and the software is licensed by win.rar GmbH. The name RAR stands for Roshal Archive.

Delta encoding is a way of storing or transmitting data in the form of differences (deltas) between sequential data rather than complete files; more generally this is known as data differencing. Delta encoding is sometimes called delta compression, particularly where archival histories of changes are required.

Cabinet is an archive-file format for Microsoft Windows that supports lossless data compression and embedded digital certificates used for maintaining archive integrity. Cabinet files have .cab filename extensions and are recognized by their first four bytes MSCF. Cabinet files were known originally as Diamond files.

<span class="mw-page-title-main">7-Zip</span> Open-source file archiver

7-Zip is a free and open-source file archiver, a utility used to place groups of files within compressed containers known as "archives". It is developed by Igor Pavlov and was first released in 1999. 7-Zip has its own archive format called 7z, but can read and write several others.

7z is a compressed archive file format that supports several different data compression, encryption and pre-processing algorithms. The 7z format initially appeared as implemented by the 7-Zip archiver. The 7-Zip program is publicly available under the terms of the GNU Lesser General Public License. The LZMA SDK 4.62 was placed in the public domain in December 2008. The latest stable version of 7-Zip and LZMA SDK is version 24.05.

StuffIt is a discontinued family of computer software utilities for archiving and compressing files. Originally produced for Macintosh, versions for Microsoft Windows, Linux (x86), and Sun Solaris were later created. The proprietary compression format used by the StuffIt utilities is also termed StuffIt.

The following tables compare general and technical information for a number of file archivers. Please see the individual products' articles for further information. They are neither all-inclusive nor are some entries necessarily up to date. Unless otherwise specified in the footnotes section, comparisons are based on the stable versions—without add-ons, extensions or external programs.

In computing, an archive file is a computer file that is composed of one or more files along with metadata. Many archive formats also support compression of member files. Archive files are used to collect multiple data files together into a single file for easier portability and storage, or simply to compress files to use less storage space. Archive files often store directory structures, error detection and correction information, comments, and some use built-in encryption.

An image file format is a file format for a digital image. There are many formats that can be used, such as JPEG, PNG, and GIF. Most formats up until 2022 were for storing 2D images, not 3D ones. The data stored in an image file format may be compressed or uncompressed. If the data is compressed, it may be done so using lossy compression or lossless compression. For graphic design applications, vector formats are often used. Some image file formats support transparency.

<span class="mw-page-title-main">Comic book archive</span> File format

A comic book archive or comic book reader file is a type of archive file for the purpose of sequential viewing of images, commonly for comic books. The idea was made popular by the CDisplay sequential image viewer; since then, many viewers for different platforms have been created.

<span class="mw-page-title-main">The Unarchiver</span> File decompression utility

The Unarchiver is a proprietary freeware data decompression utility, which supports more formats than Archive Utility, the built-in archive unpacker program in macOS. It can also handle filenames in various character encodings, created using operating system versions that use those character encodings. The latest version requires Mac OS X Lion or higher. The Unarchiver does not compress files.

<span class="mw-page-title-main">PeaZip</span> File archive computer program

PeaZip is a free and open-source file manager and file archiver for Microsoft Windows, ReactOS, Linux, MacOS and BSD by Giorgio Tani. It supports its native PEA archive format and other mainstream formats, with special focus on handling open formats. Version 9.4.0 supported 234 file extensions.

lzip Data compression utility

lzip is a free, command-line tool for the compression of data; it employs the Lempel–Ziv–Markov chain algorithm (LZMA) with a user interface that is familiar to users of usual Unix compression tools, such as gzip and bzip2.

<span class="mw-page-title-main">Apple Disk Image</span> File format developed by Apple and used by macOS

AppleDisk Image is a disk image format commonly used by the macOS operating system. When opened, an Apple Disk Image is mounted as a volume within the Finder.

References

  1. Mozilla ARchive
  2. 1 2 "MarcoPon/SeqBox". May 13, 2024. Archived from the original on February 12, 2022. Retrieved March 20, 2019 via GitHub.
  3. "Archived copy". Archived from the original on 2022-02-01. Retrieved 2009-03-05.{{cite web}}: CS1 maint: archived copy as title (link)
  4. "Genozip - A Universal Extensible Genomic Data Compressor". Archived from the original on 2022-12-26. Retrieved 2022-12-26.
  5. The 'application/zlib' and 'application/gzip' Media Types. doi: 10.17487/RFC6713 . RFC 6713 . Retrieved 2016-05-03.
  6. "Overview". dev.b1.org. Archived from the original on 2023-04-04. Retrieved 2014-03-03.
  7. "B6Zip - Extract .B6Z and .B6 files - Download". B6Zip. Archived from the original on 2024-05-23. Retrieved 2017-10-06.
  8. "Genozip | Compression for FASTQ, BAM, VCF". Genozip. Archived from the original on 2024-05-27. Retrieved 2021-07-22.
  9. "Archived copy". Archived from the original on 2024-04-17. Retrieved 2009-03-05.{{cite web}}: CS1 maint: archived copy as title (link)
  10. "WinHKI". Archived from the original on 2006-12-05.
  11. "Tandem Unpak GUI". Archived from the original on 2020-09-22. Retrieved 2013-09-16.
  12. "Data Compression Programs". mattmahoney.net. Archived from the original on 2024-05-28. Retrieved 2010-09-29.
  13. "unar RAR parsing implementation" . Retrieved 2024-09-11.
  14. "libarchive RARv3 parsing implementation" . Retrieved 2024-09-11.
  15. "libarchive RARv5 parsing implementation" . Retrieved 2024-09-11.
  16. "Archived copy". Archived from the original on 2018-03-20. Retrieved 2011-03-08.{{cite web}}: CS1 maint: archived copy as title (link)
  17. "PerfectCompress". Archived from the original on 2012-07-23. Retrieved 2009-11-08.
  18. DeepFreezer [ circular reference ]
  19. Zzip
  20. "darrenldl/blockyarchive". July 7, 2024. Archived from the original on May 1, 2019. Retrieved April 11, 2019 via GitHub.
  21. "7-Zip / Discussion / Open Discussion: File times: creation, access - precision". sourceforge.net. Archived from the original on 2021-07-25. Retrieved 2020-02-08.
  22. Lemke, Marcel (1998). "Technical Information on the archiver ACE V1.2". hugi.scene.org. Archived from the original on 4 April 2023. Retrieved 22 May 2023.
  23. "LICENCE · master · RiscOS / Sources / FileSys / ImageFS / SparkFS / Codecs / SparkSpark · GitLab". 28 January 2023. Retrieved 2023-03-26.
  24. WinRAR download and support: Knowledge Base
  25. "Switch -TS[N] - save or restore file time (modification, creat..." archive.ph. February 9, 2013. Archived from the original on February 9, 2013. Retrieved November 28, 2021.
  26. "Professional – Nico de Vries". Archived from the original on 2012-12-30. Retrieved 2020-02-08.
  27. "Arch Linux - News: Now using Zstandard instead of xz for package compression". archlinux.org.