Internet media type | video/FFV1 |
---|---|
Developed by |
|
Initial release | 9 June 2003 |
Latest release | Version 3 (FFV1.3) 3 August 2013 |
Type of format | Video coding format |
Contained by | AVI, MKV, MOV, NUT, etc. |
Standard | RFC 9043 Version 4 (draft) |
Open format? | Yes |
Website | Specification development repository |
FFV1 (short for FF Video 1 [1] ) is a lossless intra-frame video coding format. FFV1 is particularly popular for its performance regarding speed and size, compared to other lossless preservation codecs, such as M-JPEG2000. [2] [3] [4]
The encoder and decoder have been part of the free, open-source library libavcodec in the project FFmpeg since June 2003. [5] FFV1 is also included in ffdshow and LAV Filters, [6] which makes the video codec available to Microsoft Windows applications that support system-wide codecs over Video for Windows (VfW) or DirectShow.
FFV1 has been standardized at the IETF under RFC 9043. [7] The European Broadcasting Union (EBU) lists FFV1 under the codec-family index "31" in their combined list of video codec references. [8]
For long-term preservation of digital video sustainable container formats as well as audio/video codecs are necessary. There is no consensus as of 2013 among the archival community as to which file format or codecs should be used for preservation purposes for digital video. [9] The previously proclaimed encodings were Motion JPEG 2000 (lossless) [10] and uncompressed video. [11]
FFV1 proved to be a viable archival encoding and the U.S. Library of Congress began regarding it as a suitable option for preservation encoding in 2014. [2] [12] Compared to lossless JPEG 2000, FFV1 features comparable compression ratios and lower computing requirements. As of 2014, it is being used by archives, particularly where the collections do not feature extensive broadcast materials and instead consist of oral history and the like. [13] [14]
Since around 2015, the European PREFORMA Project started working on the standardisation of FFV1 through the Internet Engineering Task Force (IETF). [15] It was standardised in August 2021 as RFC 9043. [7] The PERFORMA Project also implemented a conformance checker for FFV1 in the Matroska container. [16] Details of FFV1's standardization plan [17] have been prepared by MediaArea (authors of MediaInfo) as part of their conformance checking tool "Media CONCH". [18]
It is also listed as a format option for long-term preservation of moving images on sites of the U.S. Library of Congress', [12] [19] State Records NSW [9] and others. The Society of American Archivists has published a paper in August 2014, suggesting only FFV1 as preservation codec for video. [20]
The Digital Preservation project at the U.S. Library of Congress identified AVI and Matroska as common container formats for FFV1. [21] [22]
Within the video archiving domain, the interest in FFV1 is increasing, as can be seen in a thread on the AMIA-L mailing list, [23] the PrestoCentre Forum [24] or the Archivematica mailing list. [25] [26] Companies are also picking up FFV1 support. For example, NOA (formerly "NOA Audio Solutions"), announced support for the FFV1 in their product line in July 2013 [27] and KEM-Studiotechnik released a film-scanner with FFV1 output in November 2013. [28]
In an interview for The New York Times magazine about "Tips on Archiving Family History", [29] Bertram Lyons from the U.S. Library of Congress says:
"[...] for video, there are many choices when it comes to codecs (the way the bits are encoded/decoded to represent the visual data, e.g., ffv1, H.264, Apple ProRes) [...]"
In January 2013, the possible use and adoption of FFV1 as an archiving codec was addressed in the issue of PrestoCentre's [30] AV Insider magazine: [31]
"FFV1 has many beneficial technical features [...], but adoption rates are relatively low compared with alternatives, for example JPEG2000. [...] But holding back too long only serves to self-perpetuate the status of FFV1.
The adoption by Archivematica and the Austrian Mediathek with their active promotion of FFV1 along with others may start to break this vicious circle. This could lead to a virtuous circle of wider take-up, to shared development, to incorporation into commercial products and a host of other benefits for the community."
PACKED - the "Centre of Expertise in Digital Heritage" in Belgium, say in an article about video formats for archiving: [32]
"When removing the proprietary codecs from this list, only a few are left. [...] This basically leaves heritage institutions that want to use a lossless codec, with only two options: Jpeg2000 and FFV1."
In 2015, the International Federation of Television Archives (FIAT/IFTA) mentioned FFV1 explicitly in their call-for-presentations for their annual World Conference, asking "Is FFV1 the new JPEG2000"?. [33] A workshop titled "FFV1 for Preservation" is also featured. [34]
The "Österreichische Mediathek" has also developed DVA-Profession a Free Software solution for archive-suitable mass video digitization, mainly using FFV1 as video encoding throughout the whole workflow, without transcoding. [51] Additionally, they have initiated the development of "FFV1.3" (=version 3 of FFV1) together with Michael Niedermayer (FFmpeg), Peter Bubestinger-Steindl and Dave Rice; see #Versions below. [52]
Here is a list of applications known to be able to read and/or write FFV1 video files, either natively or by installing codec packages.
Entries marked with "-" means that they generally only support either encoding or decoding.
The term "built-in" means that the application can handle FFV1 without the necessity to install additional codec packages. Applications that come with FFV1 support out of the box, usually use FFmpeg's or Libav's libraries in order to do so.
The list is far from being complete, and will be augmented over time:
Application | Encoding | Decoding | Method |
---|---|---|---|
Adobe Premiere | Yes | Yes | DirectShow [53] [54] |
Archivematica [55] | Yes | Yes | built-in |
AVID | Unknown | Yes | Their transcoder can handle FFV1 |
Avidemux | Yes | Yes | built-in |
Davinci Resolve [56] | Yes | Yes | built-in |
Blender | Yes [57] | Yes | built-in |
DVA-Profession [58] | Yes | Yes | built-in |
ffdshow-tryouts | Yes | Yes | built-in |
FFmpeg | Yes | Yes | built-in |
HandBrake | Yes | Yes | built-in |
Harris Broadcast Velocity | Yes | Yes | Video for Windows [53] |
kdenlive | Yes | Yes | built-in |
KEM Scan (motion picture film scanner) [28] [59] | Yes | - | built-in |
LAV Filters [54] | Yes | Yes | built-in |
MediaInfo | - | Yes | built-in |
Media Lovin' Toolkit [60] | Yes | Yes | built-in |
Media Player Classic | - | Yes | built-in |
MPlayer/MEncoder | Yes | Yes | built-in |
NOA MediaButler [61] | Yes | Yes | built-in |
QUADRIGA Video [62] | Yes [63] | Unknown | Unknown |
Shotcut [64] | Yes | Yes | built-in |
Sorenson Squeeze | Unknown | Yes | built-in |
VirtualDub | Yes | Yes | Video for Windows [53] |
VLC media player | No | Yes | built-in |
Windows Media Player | Unknown | Yes | DirectShow [53] [54] |
FFV1 is not strictly an intra-frame format; despite not using inter-frame prediction, it allows the context model to adapt over multiple frames. This can be useful for compression due to the very large size of the context table, but can be disabled to force the encoder to generate a strictly intra-frame bitstream. As the gained compression seems to decrease [65] with later versions of FFV1 (version 2,3), the use of GOP size greater than "1" might disappear in the future.
During progressive scanning of a frame, the difference between a current pixel and its predicted value, judging by neighboring pixels, is sent to the entropy-coding process. The prediction is done as follows:
The third value, Top + Left - TopLeft, is effectively equivalent to applying the "top" predictor to the current and the left sample, followed by applying the left predictor to the prediction residual of the top predictor. This method, also known as the gradient, exploits both horizontal and vertical redundancy. So in simple terms the prediction is the median of the top, left, and gradient prediction methods. For improved performance and simplicity, the edges of the frame are assumed to be zero to avoid special cases. The prediction in encoding and decoding is managed using a ring buffer. [66]
The residuals are coded using either Golomb-Rice coding [67] or range coding. Both options use a very large context model. The "small" context model uses (11×11×11+1)/2=666 contexts based on the neighboring values of (Left−TopLeft), (TopLeft-Top), and (Top−TopRight). The "large" context model uses (11×11×5×5×5+1)/2=7563 contexts based on the same values as before, but also (TopTop−Top) and (LeftLeft−Left), where TopTop is the pixel two above the current one vertically, and LeftLeft is the pixel two to the left of the current one. In range coding, each "context" actually has 32 sub-contexts used for various portions of coding each residual, resulting in a grand total of 242,016 contexts for the "large" model.
Early experimental versions of FFV1 used the CABAC Arithmetic coder from H.264, but due to the uncertain patent/royalty situation, as well as its slightly worse performance, CABAC was replaced by range coding. [68]
On April 16, 2006, a commit-message by Michael Niedermayer confirmed that the bitstream of FFV1 (version 1) is frozen: [69]
"ffv1 and ffvhuff haven't changed since a long time and no one proposed any changes within 1 month after my warning so they are officially no longer experimental and we will guarantee decodability of files encoded with the current ffv1/ffvhuff in the future"
The current authoritative documentation was started in April 2012, and stayed in a very basic state until 2015. [75] In 2015, as part of the IETF standardization process, the documentation is now improved and reviewed by the CELLAR working group in close cooperation with Michael Niedermayer. [1]
An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data is called the audio coding format and can be uncompressed, or compressed to reduce the file size, often using lossy compression. The data can be a raw bitstream in an audio coding format, but it is usually embedded in a container format or an audio data format with defined storage layer.
Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality digital multimedia. Its name is derived from "ogging", jargon from the computer game Netrek.
A video file format is a type of file format for storing digital video data on a computer system. Video is almost always stored using lossy compression to reduce the file size.
FLAC is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software package that includes a codec implementation. Digital audio compressed by FLAC's algorithm can typically be reduced to between 50 and 70 percent of its original size and decompresses to an identical copy of the original audio data.
Theora is a free lossy video compression format. It was developed by the Xiph.Org Foundation and distributed without licensing fees alongside their other free and open media projects, including the Vorbis audio format and the Ogg container.
Dirac is an open and royalty-free video compression format, specification and software video codec developed by BBC Research & Development. Dirac aimed to provide high-quality video compression for Ultra HDTV and competed with existing formats such as H.264.
FFmpeg is a free and open-source software project consisting of a suite of libraries and programs for handling video, audio, and other multimedia files and streams. At its core is the command-line ffmpeg
tool itself, designed for processing video and audio files. It is widely used for format transcoding, basic editing, video scaling, video post-production effects, and standards compliance.
Matroska is a project to create a container format that can hold an unlimited number of video, audio, picture, or subtitle tracks in one file. The Matroska Multimedia Container is similar in concept to other containers like AVI, MP4, or Advanced Systems Format (ASF), but is an open standard.
Huffyuv is a lossless video codec created by Ben Rudiak-Gould which is meant to replace uncompressed YCbCr as a video capture format. The codec can also compress in the RGB color space.
QuickTime Animation format is a video compression format and codec created by Apple Computer to enable playback of RGB video in real time without expensive hardware. It is generally found in the QuickTime container with the FourCC 'rle '. It can perform either lossless or lossy compression and is one of the few video codecs that supports an alpha channel. Supported color depths are 1-bit (monochrome), 15-bit RGB, 24-bit RGB, 32-bit ARGB, as well as palettized RGB. As a result of reverse-engineering of the format, a decoder is implemented in XAnim as well as an encoder and decoder in libavcodec.
These tables compare features of multimedia container formats, most often used for storing or streaming digital video or digital audio content. To see which multimedia players support which container format, look at comparison of media players.
libavcodec is a free and open-source library of codecs for encoding and decoding video and audio data.
Constrained Energy Lapped Transform (CELT) is an open, royalty-free lossy audio compression format and a free software codec with especially low algorithmic delay for use in low-latency audio communication. The algorithms are openly documented and may be used free of software patent restrictions. Development of the format was maintained by the Xiph.Org Foundation and later coordinated by the Opus working group of the Internet Engineering Task Force (IETF).
WebP is a raster graphics file format developed by Google intended as a replacement for JPEG, PNG, and GIF file formats. It supports both lossy and lossless compression, as well as animation and alpha transparency.
x265 is an encoder for creating digital video streams in the High Efficiency Video Coding (HEVC/H.265) video compression format developed by the Joint Collaborative Team on Video Coding (JCT-VC). It is available as a command-line app or a software library, under the terms of GNU General Public License (GPL) version 2 or later; however, customers may request a commercial license.
Opus is a lossy audio coding format developed by the Xiph.Org Foundation and standardized by the Internet Engineering Task Force, designed to efficiently code speech and general audio in a single format, while remaining low-latency enough for real-time interactive communication and low-complexity enough for low-end embedded processors. Opus replaces both Vorbis and Speex for new applications, and several blind listening tests have ranked it higher-quality than any other standard audio format at any given bitrate until transparency is reached, including MP3, AAC, and HE-AAC.
A video coding format is a content representation format of digital video content, such as in a data file or bitstream. It typically uses a standardized video compression algorithm, most commonly based on discrete cosine transform (DCT) coding and motion compensation. A specific software, firmware, or hardware implementation capable of compression or decompression in a specific video coding format is called a video codec.
VP9 is an open and royalty-free video coding format developed by Google.