High Efficiency Video Coding | |
Status | In force |
---|---|
Year started | 7 June 2013 |
First published | July 7, 2013 |
Latest version | 10.0 July 29, 2024 |
Organization | ITU-T, ISO, IEC |
Committee | SG16 (Secretary: Simao Campos) (VCEG), MPEG |
Base standards | H.261, H.262, H.263, ISO/IEC 14496-2, H.264 |
Related standards | H.266, MPEG-5, MPEG-H |
Domain | Video compression |
License | MPEG LA [1] |
Website | www |
High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard designed as part of the MPEG-H project as a successor to the widely used Advanced Video Coding (AVC, H.264, or MPEG-4 Part 10). In comparison to AVC, HEVC offers from 25% to 50% better data compression at the same level of video quality, or substantially improved video quality at the same bit rate. It supports resolutions up to 8192×4320, including 8K UHD, and unlike the primarily 8-bit AVC, HEVC's higher fidelity Main 10 profile has been incorporated into nearly all supporting hardware.
While AVC uses the integer discrete cosine transform (DCT) with 4×4 and 8×8 block sizes, HEVC uses both integer DCT and discrete sine transform (DST) with varied block sizes between 4×4 and 32×32. The High Efficiency Image Format (HEIF) is based on HEVC. [2]
In most ways, HEVC is an extension of the concepts in H.264/MPEG-4 AVC. Both work by comparing different parts of a frame of video to find areas that are redundant, both within a single frame and between consecutive frames. These redundant areas are then replaced with a short description instead of the original pixels. The primary changes for HEVC include the expansion of the pattern comparison and difference-coding areas from 16×16 pixel to sizes up to 64×64, improved variable-block-size segmentation, improved "intra" prediction within the same picture, improved motion vector prediction and motion region merging, improved motion compensation filtering, and an additional filtering step called sample-adaptive offset filtering. Effective use of these improvements requires much more signal processing capability for compressing the video but has less impact on the amount of computation needed for decompression.
HEVC was standardized by the Joint Collaborative Team on Video Coding (JCT-VC), a collaboration between the ISO/IEC MPEG and ITU-T Study Group 16 VCEG. The ISO/IEC group refers to it as MPEG-H Part 2 and the ITU-T as H.265. The first version of the HEVC standard was ratified in January 2013 and published in June 2013. The second version, with multiview extensions (MV-HEVC), range extensions (RExt), and scalability extensions (SHVC), was completed and approved in 2014 and published in early 2015. Extensions for 3D video (3D-HEVC) were completed in early 2015, and extensions for screen content coding (SCC) were completed in early 2016 and published in early 2017, covering video containing rendered graphics, text, or animation as well as (or instead of) camera-captured video scenes. In October 2017, the standard was recognized by a Primetime Emmy Engineering Award as having had a material effect on the technology of television. [3] [4] [5] [6] [7]
HEVC contains technologies covered by patents owned by the organizations that participated in the JCT-VC. Implementing a device or software application that uses HEVC may require a license from HEVC patent holders. The ISO/IEC and ITU require companies that belong to their organizations to offer their patents on reasonable and non-discriminatory licensing (RAND) terms. Patent licenses can be obtained directly from each patent holder, or through patent licensing bodies, such as MPEG LA, Access Advance, and Velos Media.
The combined licensing fees currently offered by all of the patent licensing bodies are higher than for AVC. The licensing fees are one of the main reasons HEVC adoption has been low on the web and is why some of the largest tech companies (Amazon, AMD, Apple, ARM, Cisco, Google, Intel, Microsoft, Mozilla, Netflix, Nvidia, and more) have joined the Alliance for Open Media, [8] which finalized royalty-free alternative video coding format AV1 on March 28, 2018. [9]
The HEVC format was jointly developed by more than a dozen organisations across the world. The majority of active patent contributions towards the development of the HEVC format came from five organizations: Samsung Electronics (4,249 patents), General Electric (1,127 patents), [10] M&K Holdings (907 patents), NTT (878 patents), and JVC Kenwood (628 patents). [11] Other patent holders include Fujitsu, Apple, Canon, Columbia University, KAIST, Kwangwoon University, MIT, Sungkyunkwan University, Funai, Hikvision, KBS, KT and NEC. [12]
In 2004, the ITU-T Video Coding Experts Group (VCEG) began a major study of technology advances that could enable the creation of a new video compression standard (or substantial compression-oriented enhancements of the H.264/MPEG-4 AVC standard). [13] In October 2004, various techniques for potential enhancement of the H.264/MPEG-4 AVC standard were surveyed. In January 2005, at the next meeting of VCEG, VCEG began designating certain topics as "Key Technical Areas" (KTA) for further investigation. A software codebase called the KTA codebase was established for evaluating such proposals. [14] The KTA software was based on the Joint Model (JM) reference software that was developed by the MPEG & VCEG Joint Video Team for H.264/MPEG-4 AVC. Additional proposed technologies were integrated into the KTA software and tested in experiment evaluations over the next four years. [15] [13] [16] [17]
Two approaches for standardizing enhanced compression technology were considered: either creating a new standard or creating extensions of H.264/MPEG-4 AVC. The project had tentative names H.265 and H.NGVC (Next-generation Video Coding), and was a major part of the work of VCEG until it evolved into the HEVC joint project with MPEG in 2010. [18] [19] [20]
The preliminary requirements for NGVC were the capability to have a bit rate reduction of 50% at the same subjective image quality compared with the H.264/MPEG-4 AVC High profile, and computational complexity ranging from 1/2 to 3 times that of the High profile. [20] NGVC would be able to provide 25% bit rate reduction along with 50% reduction in complexity at the same perceived video quality as the High profile, or to provide greater bit rate reduction with somewhat higher complexity. [20] [21]
The ISO/IEC Moving Picture Experts Group (MPEG) started a similar project in 2007, tentatively named High-performance Video Coding. [22] [23] An agreement of getting a bit rate reduction of 50% had been decided as the goal of the project by July 2007. [22] Early evaluations were performed with modifications of the KTA reference software encoder developed by VCEG. [13] By July 2009, experimental results showed average bit reduction of around 20% compared with AVC High Profile; these results prompted MPEG to initiate its standardization effort in collaboration with VCEG. [23]
MPEG and VCEG established a Joint Collaborative Team on Video Coding (JCT-VC) to develop the HEVC standard. [13] [24] [25] [26]
A formal joint Call for Proposals on video compression technology was issued in January 2010 by VCEG and MPEG, and proposals were evaluated at the first meeting of the MPEG & VCEG Joint Collaborative Team on Video Coding (JCT-VC), which took place in April 2010. A total of 27 full proposals were submitted. [18] [27] Evaluations showed that some proposals could reach the same visual quality as AVC at only half the bit rate in many of the test cases, at the cost of 2–10× increase in computational complexity, and some proposals achieved good subjective quality and bit rate results with lower computational complexity than the reference AVC High profile encodings. At that meeting, the name High Efficiency Video Coding (HEVC) was adopted for the joint project. [13] [18] Starting at that meeting, the JCT-VC integrated features of some of the best proposals into a single software codebase and a "Test Model under Consideration", and performed further experiments to evaluate various proposed features. [13] [28] The first working draft specification of HEVC was produced at the third JCT-VC meeting in October 2010. Many changes in the coding tools and configuration of HEVC were made in later JCT-VC meetings. [13]
On January 25, 2013, the ITU announced that HEVC had received first stage approval (consent) in the ITU-T Alternative Approval Process (AAP). [29] [30] [31] On the same day, MPEG announced that HEVC had been promoted to Final Draft International Standard (FDIS) status in the MPEG standardization process. [32] [33]
On April 13, 2013, HEVC/H.265 was approved as an ITU-T standard. [34] [35] [36] The standard was formally published by the ITU-T on June 7, 2013, and by the ISO/IEC on November 25, 2013. [24] [17]
On July 11, 2014, MPEG announced that the 2nd edition of HEVC will contain three recently completed extensions which are the multiview extensions (MV-HEVC), the range extensions (RExt), and the scalability extensions (SHVC). [37]
On October 29, 2014, HEVC/H.265 version 2 was approved as an ITU-T standard. [38] [39] [40] It was then formally published on January 12, 2015. [24]
On April 29, 2015, HEVC/H.265 version 3 was approved as an ITU-T standard. [41] [42] [43]
On June 3, 2016, HEVC/H.265 version 4 was consented in the ITU-T and was not approved during a vote in October 2016. [44] [45]
On December 22, 2016, HEVC/H.265 version 4 was approved as an ITU-T standard. [46] [47]
On September 29, 2014, MPEG LA announced their HEVC license which covers the essential patents from 23 companies. [48] The first 100,000 "devices" (which includes software implementations) are royalty free, and after that the fee is $0.20 per device up to an annual cap of $25 million. [49] This is significantly more expensive than the fees on AVC, which were $0.10 per device, with the same 100,000 waiver, and an annual cap of $6.5 million. MPEG LA does not charge any fee on the content itself, something they had attempted when initially licensing AVC, but subsequently dropped when content producers refused to pay it. [50] The license has been expanded to include the profiles in version 2 of the HEVC standard. [51]
When the MPEG LA terms were announced, commenters noted that a number of prominent patent holders were not part of the group. Among these were AT&T, Microsoft, Nokia, and Motorola. Speculation at the time was that these companies would form their own licensing pool to compete with or add to the MPEG LA pool. Such a group was formally announced on March 26, 2015, as HEVC Advance. [52] The terms, covering 500 essential patents, were announced on July 22, 2015, with rates that depend on the country of sale, type of device, HEVC profile, HEVC extensions, and HEVC optional features. Unlike the MPEG LA terms, HEVC Advance reintroduced license fees on content encoded with HEVC, through a revenue sharing fee. [53]
The initial HEVC Advance license had a maximum royalty rate of US$2.60 per device for Region 1 countries and a content royalty rate of 0.5% of the revenue generated from HEVC video services. Region 1 countries in the HEVC Advance license include the United States, Canada, European Union, Japan, South Korea, Australia, New Zealand, and others. Region 2 countries are countries not listed in the Region 1 country list. The HEVC Advance license had a maximum royalty rate of US$1.30 per device for Region 2 countries. Unlike MPEG LA, there was no annual cap. On top of this, HEVC Advance also charged a royalty rate of 0.5% of the revenue generated from video services encoding content in HEVC. [53]
When they were announced, there was considerable backlash from industry observers about the "unreasonable and greedy" fees on devices, which were about seven times that of the MPEG LA's fees. Added together, a device would require licenses costing $2.80, twenty-eight times as expensive as AVC, as well as license fees on the content. This led to calls for "content owners [to] band together and agree not to license from HEVC Advance". [54] Others argued the rates might cause companies to switch to competing standards such as Daala and VP9. [55]
On December 18, 2015, HEVC Advance announced changes in the royalty rates. The changes include a reduction in the maximum royalty rate for Region 1 countries to US$2.03 per device, the creation of annual royalty caps, and a waiving of royalties on content that is free to end users. The annual royalty caps for a company is US$40 million for devices, US$5 million for content, and US$2 million for optional features. [56]
On February 3, 2016, Technicolor SA announced that they had withdrawn from the HEVC Advance patent pool [57] and would be directly licensing their HEVC patents. [58] HEVC Advance previously listed 12 patents from Technicolor. [59] Technicolor announced that they had rejoined on October 22, 2019. [60]
On November 22, 2016, HEVC Advance announced a major initiative, revising their policy to allow software implementations of HEVC to be distributed directly to consumer mobile devices and personal computers royalty free, without requiring a patent license. [61]
On March 31, 2017, Velos Media announced their HEVC license which covers the essential patents from Ericsson, Panasonic, Qualcomm Incorporated, Sharp, and Sony. [62]
As of April 2019, [update] the MPEG LA HEVC patent list is 164 pages long. [63] [64]
The following organizations currently hold the most active patents in the HEVC patent pools listed by MPEG LA and HEVC Advance:
Organization | Active patents | Ref |
---|---|---|
Samsung Electronics | 4249 | [10] |
General Electric (GE) | 1127 | |
M&K Holdings Inc | 907 | [11] |
Nippon Telegraph and Telephone (including NTT Docomo) | 878 | |
JVC Kenwood | 628 | |
Dolby Laboratories | 624 | [10] |
Infobridge Pte. Ltd. | 572 | [11] |
Mitsubishi Electric | 401 | [10] |
SK Telecom (including SK Planet) | 380 | [11] |
MediaTek (through HFI Inc.) | 337 | [10] |
Sejong University | 330 | |
KT Corp | 289 | [11] |
Philips | 230 | [10] |
Godo Kaisha IP Bridge | 219 | |
NEC Corporation | 219 | [11] |
Electronics and Telecommunications Research Institute (ETRI) of Korea | 208 | |
Canon Inc. | 180 | |
Tagivan II | 162 | |
Fujitsu | 144 | |
Kyung Hee University | 103 |
Versions of the HEVC/H.265 standard using the ITU-T approval dates. [24]
On February 29, 2012, at the 2012 Mobile World Congress, Qualcomm demonstrated a HEVC decoder running on an Android tablet, with a Qualcomm Snapdragon S4 dual-core processor running at 1.5 GHz, showing H.264/MPEG-4 AVC and HEVC versions of the same video content playing side by side. In this demonstration, HEVC reportedly showed almost a 50% bit rate reduction compared with H.264/MPEG-4 AVC. [73]
On February 11, 2013, researchers from MIT demonstrated the world's first published HEVC ASIC decoder at the International Solid-State Circuits Conference (ISSCC) 2013. [74] Their chip was capable of decoding a 3840×2160p at 30 fps video stream in real time, consuming under 0.1 W of power. [75] [76]
On April 3, 2013, Ateme announced the availability of the first open source implementation of a HEVC software player based on the OpenHEVC decoder and GPAC video player which are both licensed under LGPL. The OpenHEVC decoder supports the Main profile of HEVC and can decode 1080p at 30 fps video using a single core CPU. [77] A live transcoder that supports HEVC and used in combination with the GPAC video player was shown at the ATEME booth at the NAB Show in April 2013. [77] [78]
On July 23, 2013, MulticoreWare announced, and made the source code available for the x265 HEVC Encoder Library under the GPL v2 license. [79] [80]
On August 8, 2013, Nippon Telegraph and Telephone announced the release of their HEVC-1000 SDK software encoder which supports the Main 10 profile, resolutions up to 7680×4320, and frame rates up to 120 fps. [81]
On November 14, 2013, DivX developers released information on HEVC decoding performance using an Intel i7 CPU at 3.5 GHz with 4 cores and 8 threads. [82] The DivX 10.1 Beta decoder was capable of 210.9 fps at 720p, 101.5 fps at 1080p, and 29.6 fps at 4K. [82]
On December 18, 2013, ViXS Systems announced shipments of their XCode (not to be confused with Apple's Xcode IDE for MacOS) 6400 SoC which was the first SoC to support the Main 10 profile of HEVC. [83]
On April 5, 2014, at the NAB show, eBrisk Video, Inc. and Altera Corporation demonstrated an FPGA-accelerated HEVC Main10 encoder that encoded 4Kp60/10-bit video in real-time, using a dual-Xeon E5-2697-v2 platform. [84] [85]
On August 13, 2014, Ittiam Systems announced availability of its third generation H.265/HEVC codec with 4:2:2 12-bit support. [86]
On September 5, 2014, the Blu-ray Disc Association announced that the 4K Blu-ray Disc specification would support HEVC-encoded 4K video at 60 fps, the Rec. 2020 color space, high dynamic range (PQ and HLG), and 10-bit color depth. [87] [88] 4K Blu-ray Discs have a data rate of at least 50 Mbit/s and disc capacity up to 100 GB. [87] [88] 4K Blu-ray Discs and players became available for purchase in 2015 or 2016. [87] [88]
On September 9, 2014, Apple announced the iPhone 6 and iPhone 6 Plus which support HEVC/H.265 for FaceTime over cellular. [89]
On September 18, 2014, Nvidia released the GeForce GTX 980 (GM204) and GTX 970 (GM204), which includes Nvidia NVENC, the world's first HEVC hardware encoder in a discrete graphics card. [90]
On October 31, 2014, Microsoft confirmed that Windows 10 will support HEVC out of the box, according to a statement from Gabriel Aul, the leader of Microsoft Operating Systems Group's Data and Fundamentals Team. [91] [92] Windows 10 Technical Preview Build 9860 added platform level support for HEVC and Matroska. [93] [94]
On November 3, 2014, Android Lollipop was released with out of the box support for HEVC using Ittiam Systems' software. [95]
On January 5, 2015, ViXS Systems announced the XCode 6800 which is the first SoC to support the Main 12 profile of HEVC. [96]
On January 5, 2015, Nvidia officially announced the Tegra X1 SoC with full fixed-function HEVC hardware decoding. [97] [98]
On January 22, 2015, Nvidia released the GeForce GTX 960 (GM206), which includes the world's first full fixed function HEVC Main/Main10 hardware decoder in a discrete graphics card. [99]
On February 23, 2015, Advanced Micro Devices (AMD) announced that their UVD ASIC to be found in the Carrizo APUs would be the first x86 based CPUs to have a HEVC hardware decoder. [100]
On February 27, 2015, VLC media player version 2.2.0 was released with robust support of HEVC playback. The corresponding versions on Android and iOS are also able to play HEVC.
On March 31, 2015, VITEC announced the MGW Ace which was the first 100% hardware-based portable HEVC encoder that provides mobile HEVC encoding. [101]
On August 5, 2015, Intel launched Skylake products with full fixed function Main/8-bit decoding/encoding and hybrid/partial Main10/10-bit decoding.
On September 9, 2015 Apple announced the Apple A9 chip, first used in the iPhone 6S, its first processor with a hardware HEVC decoder supporting Main 8 and 10. This feature would not be unlocked until the release of iOS 11 in 2017. [102]
On April 11, 2016, full HEVC (H.265) support was announced in the newest MythTV version (0.28). [103]
On August 30, 2016, Intel officially announced 7th generation Core CPUs (Kaby Lake) products with full fixed function HEVC Main10 hardware decoding support. [104]
On September 7, 2016 Apple announced the Apple A10 chip, first used in the iPhone 7, which included a hardware HEVC encoder supporting Main 8 and 10. This feature would not be unlocked until the release of iOS 11 in 2017. [102]
On October 25, 2016, Nvidia released the GeForce GTX 1050Ti (GP107) and GeForce GTX 1050 (GP107), which includes full fixed function HEVC Main10/Main12 hardware encoder.
On June 5, 2017, Apple announced HEVC H.265 support in macOS High Sierra, iOS 11, tvOS, [105] HTTP Live Streaming [106] and Safari. [107] [108]
On June 25, 2017, Microsoft released a free HEVC app extension for Windows 10, enabling some Windows 10 devices with HEVC decoding hardware to play video using the HEVC format inside any app. [109]
On September 19, 2017, Apple released iOS 11 and tvOS 11 with HEVC encoding & decoding support. [110] [105]
On September 25, 2017, Apple released macOS High Sierra with HEVC encoding & decoding support.
On September 28, 2017, GoPro released the Hero6 Black action camera, with 4K60P HEVC video encoding. [111]
On October 17, 2017, Microsoft removed HEVC decoding support from Windows 10 with the Version 1709 Fall Creators Update, making HEVC available instead as a separate, paid download from the Microsoft Store. [112]
On November 2, 2017, Nvidia released the GeForce GTX 1070 Ti (GP104), which includes full fixed function HEVC Main10/Main12 hardware decoder.
On September 20, 2018, Nvidia released the GeForce RTX 2080 (TU104), which includes full fixed function HEVC Main 4:4:4 12 hardware decoder.
On October 25, 2022, Chrome released version 107, which starts supporting HEVC hardware decoding for all platforms "out of the box", if the hardware is supported.
HEVC is implemented in these web browsers:
In June 2023, an estimated 88.31% of browsers in use on desktop and mobile systems were able to play HEVC videos in HTML5 webpages, based on data from Can I Use. [117]
Microsoft Windows | macOS | Android | iOS | |
---|---|---|---|---|
Codec support | Yes | Yes | Yes | Yes |
Container support | MP4 (.mp4, .m4v) QuickTime File Format (.mov) Matroska (.mkv) | MP4 (.mp4, .m4v) QuickTime File Format (.mov) | MP4 (.mp4, .m4v) Matroska (.mkv) | MP4 (.mp4, .m4v) QuickTime File Format (.mov) |
Notes | - Support introduced in Windows 10 version 1507. - Built-in support was removed in Windows 10 version 1709 due to licensing costs. The HEVC Video Extensions add-on can be purchased from the Microsoft Store to enable HEVC playback on the default media player app Microsoft Movies & TV. [112] - Since Windows 11 version 22H2, the HEVC Video Extensions is built-in by default installation. [118] | Support introduced in macOS 10.13 High Sierra [119] | - Support introduced in Android 5.0 [113] - Some Android devices may only support 8-bit (Main profile) hardware decoding, but not 10-bit (Main 10 profile). | - Support introduced in iOS 11.0 - Playback with software decoding is possible on iPhone 5s (at 720p/240 fps, 1080p/60 fps) and iPhone 6 (at 1080p/240 fps). - Hardware decoding is available on Apple A9 (iPhone 6s), while hardware decoding & encoding is available on Apple A10 (iPhone 7). [120] |
Most video coding standards are designed primarily to achieve the highest coding efficiency. Coding efficiency is the ability to encode video at the lowest possible bit rate while maintaining a certain level of video quality. There are two standard ways to measure the coding efficiency of a video coding standard, which are to use an objective metric, such as peak signal-to-noise ratio (PSNR), or to use subjective assessment of video quality. Subjective assessment of video quality is considered to be the most important way to measure a video coding standard since humans perceive video quality subjectively. [121]
HEVC benefits from the use of larger coding tree unit (CTU) sizes. This has been shown in PSNR tests with a HM-8.0 HEVC encoder where it was forced to use progressively smaller CTU sizes. For all test sequences, when compared with a 64×64 CTU size, it was shown that the HEVC bit rate increased by 2.2% when forced to use a 32×32 CTU size, and increased by 11.0% when forced to use a 16×16 CTU size. In the Class A test sequences, where the resolution of the video was 2560×1600, when compared with a 64×64 CTU size, it was shown that the HEVC bit rate increased by 5.7% when forced to use a 32×32 CTU size, and increased by 28.2% when forced to use a 16×16 CTU size. The tests showed that large CTU sizes increase coding efficiency while also reducing decoding time. [121]
The HEVC Main Profile (MP) has been compared in coding efficiency to H.264/MPEG-4 AVC High Profile (HP), MPEG-4 Advanced Simple Profile (ASP), H.263 High Latency Profile (HLP), and H.262/MPEG-2 Main Profile (MP). The video encoding was done for entertainment applications and twelve different bitrates were made for the nine video test sequences with a HM-8.0 HEVC encoder being used. Of the nine video test sequences, five were at HD resolution, while four were at WVGA (800×480) resolution. The bit rate reductions for HEVC were determined based on PSNR with HEVC having a bit rate reduction of 35.4% compared with H.264/MPEG-4 AVC HP, 63.7% compared with MPEG-4 ASP, 65.1% compared with H.263 HLP, and 70.8% compared with H.262/MPEG-2 MP. [121]
HEVC MP has also been compared with H.264/MPEG-4 AVC HP for subjective video quality. The video encoding was done for entertainment applications and four different bitrates were made for nine video test sequences with a HM-5.0 HEVC encoder being used. The subjective assessment was done at an earlier date than the PSNR comparison and so it used an earlier version of the HEVC encoder that had slightly lower performance. The bit rate reductions were determined based on subjective assessment using mean opinion score values. The overall subjective bitrate reduction for HEVC MP compared with H.264/MPEG-4 AVC HP was 49.3%. [121]
École Polytechnique Fédérale de Lausanne (EPFL) did a study to evaluate the subjective video quality of HEVC at resolutions higher than HDTV. The study was done with three videos with resolutions of 3840×1744 at 24 fps, 3840×2048 at 30 fps, and 3840×2160 at 30 fps. The five second video sequences showed people on a street, traffic, and a scene from the open source computer animated movie Sintel . The video sequences were encoded at five different bitrates using the HM-6.1.1 HEVC encoder and the JM-18.3 H.264/MPEG-4 AVC encoder. The subjective bit rate reductions were determined based on subjective assessment using mean opinion score values. The study compared HEVC MP with H.264/MPEG-4 AVC HP and showed that, for HEVC MP, the average bitrate reduction based on PSNR was 44.4%, while the average bitrate reduction based on subjective video quality was 66.5%. [122] [123] [124] [125]
In a HEVC performance comparison released in April 2013, the HEVC MP and Main 10 Profile (M10P) were compared with H.264/MPEG-4 AVC HP and High 10 Profile (H10P) using 3840×2160 video sequences. The video sequences were encoded using the HM-10.0 HEVC encoder and the JM-18.4 H.264/MPEG-4 AVC encoder. The average bit rate reduction based on PSNR was 45% for inter frame video.
In a video encoder comparison released in December 2013, the HM-10.0 HEVC encoder was compared with the x264 encoder (version r2334) and the VP9 encoder (version v1.2.0-3088-ga81bd12). The comparison used the Bjøntegaard-Delta bit-rate (BD-BR) measurement method, in which negative values tell how much lower the bit rate is reduced, and positive values tell how much the bit rate is increased for the same PSNR. In the comparison, the HM-10.0 HEVC encoder had the highest coding efficiency and, on average, to get the same objective quality, the x264 encoder needed to increase the bit rate by 66.4%, while the VP9 encoder needed to increase the bit rate by 79.4%. [126]
Video coding standard | Average bit rate reduction compared with H.264/MPEG-4 AVC HP | ||||
---|---|---|---|---|---|
480p | 720p | 1080p | 2160p | ||
HEVC | 52% | 56% | 62% | 64% |
In a subjective video performance comparison released in May 2014, the JCT-VC compared the HEVC Main profile to the H.264/MPEG-4 AVC High profile. The comparison used mean opinion score values and was conducted by the BBC and the University of the West of Scotland. The video sequences were encoded using the HM-12.1 HEVC encoder and the JM-18.5 H.264/MPEG-4 AVC encoder. The comparison used a range of resolutions and the average bit rate reduction for HEVC was 59%. The average bit rate reduction for HEVC was 52% for 480p, 56% for 720p, 62% for 1080p, and 64% for 4K UHD. [127]
In a subjective video codec comparison released in August 2014 by the EPFL, the HM-15.0 HEVC encoder was compared with the VP9 1.2.0–5183 encoder and the JM-18.8 H.264/MPEG-4 AVC encoder. Four 4K resolutions sequences were encoded at five different bit rates with the encoders set to use an intra period of one second. In the comparison, the HM-15.0 HEVC encoder had the highest coding efficiency and, on average, for the same subjective quality the bit rate could be reduced by 49.4% compared with the VP9 1.2.0–5183 encoder, and it could be reduced by 52.6% compared with the JM-18.8 H.264/MPEG-4 AVC encoder. [128] [129] [130]
In August, 2016, Netflix published the results of a large-scale study comparing the leading open-source HEVC encoder, x265, with the leading open-source AVC encoder, x264 and the reference VP9 encoder, libvpx. [131] Using their advanced Video Multimethod Assessment Fusion (VMAF) video quality measurement tool, Netflix found that x265 delivered identical quality at bit rates ranging from 35.4% to 53.3% lower than x264, and from 17.8% to 21.8% lower than VP9. [132]
HEVC was designed to substantially improve coding efficiency compared with H.264/MPEG-4 AVC HP, i.e. to reduce bitrate requirements by half with comparable image quality, at the expense of increased computational complexity. [13] HEVC was designed with the goal of allowing video content to have a data compression ratio of up to 1000:1. [133] Depending on the application requirements, HEVC encoders can trade off computational complexity, compression rate, robustness to errors, and encoding delay time. [13] Two of the key features where HEVC was improved compared with H.264/MPEG-4 AVC was support for higher resolution video and improved parallel processing methods. [13]
HEVC is targeted at next-generation HDTV displays and content capture systems which feature progressive scanned frame rates and display resolutions from QVGA (320×240) to 4320p (7680×4320), as well as improved picture quality in terms of noise level, color spaces, and dynamic range. [21] [134] [135] [136]
The HEVC video coding layer uses the same "hybrid" approach used in all modern video standards, starting from H.261, in that it uses inter-/intra-picture prediction and 2D transform coding. [13] A HEVC encoder first proceeds by splitting a picture into block shaped regions for the first picture, or the first picture of a random access point, which uses intra-picture prediction. [13] Intra-picture prediction is when the prediction of the blocks in the picture is based only on the information in that picture. [13] For all other pictures, inter-picture prediction is used, in which prediction information is used from other pictures. [13] After the prediction methods are finished and the picture goes through the loop filters, the final picture representation is stored in the decoded picture buffer. [13] Pictures stored in the decoded picture buffer can be used for the prediction of other pictures. [13]
HEVC was designed with the idea that progressive scan video would be used and no coding tools were added specifically for interlaced video. [13] Interlace specific coding tools, such as MBAFF and PAFF, are not supported in HEVC. [137] HEVC instead sends metadata that tells how the interlaced video was sent. [13] Interlaced video may be sent either by coding each frame as a separate picture or by coding each field as a separate picture. [13] For interlaced video HEVC can change between frame coding and field coding using Sequence Adaptive Frame Field (SAFF), which allows the coding mode to be changed for each video sequence. [138] This allows interlaced video to be sent with HEVC without needing special interlaced decoding processes to be added to HEVC decoders. [13]
The HEVC standard supports color spaces such as generic film (colour filters using Illuminant C), NTSC, PAL, Rec. 601 (SMPTE 170M), Rec. 709, Rec. 2020, Rec. 2100, SMPTE 240M, sRGB, sYCC, xvYCC, XYZ, and externally specified color spaces such as Dolby Vision or HDR Vivid. [24] HEVC supports color encoding representations such as RGB, YCbCr and ICtCp, and YCoCg. [24]
HEVC replaces 16×16 pixel macroblocks, which were used with previous standards, with coding tree units (CTUs) which can use larger block structures of up to 64×64 samples and can better sub-partition the picture into variable sized structures. [13] [139] HEVC initially divides the picture into CTUs which can be 64×64, 32×32, or 16×16 with a larger pixel block size usually increasing the coding efficiency. [13]
HEVC specifies four transform units (TUs) sizes of 4×4, 8×8, 16×16, and 32×32 to code the prediction residual. [13] A CTB may be recursively partitioned into 4 or more TUs. [13] TUs use integer basis functions based on the discrete cosine transform (DCT). [13] [2] In addition, 4×4 luma transform blocks that belong to an intra coded region are transformed using an integer transform that is derived from discrete sine transform (DST). [13] This provides a 1% bit rate reduction but was restricted to 4×4 luma transform blocks due to marginal benefits for the other transform cases. [13] Chroma uses the same TU sizes as luma so there is no 2×2 transform for chroma. [13]
HEVC uses a context-adaptive binary arithmetic coding (CABAC) algorithm that is fundamentally similar to CABAC in H.264/MPEG-4 AVC. [13] CABAC is the only entropy encoder method that is allowed in HEVC while there are two entropy encoder methods allowed by H.264/MPEG-4 AVC. [13] CABAC and the entropy coding of transform coefficients in HEVC were designed for a higher throughput than H.264/MPEG-4 AVC, [140] while maintaining higher compression efficiency for larger transform block sizes relative to simple extensions. [141] For instance, the number of context coded bins have been reduced by 8× and the CABAC bypass-mode has been improved in terms of its design to increase throughput. [13] [140] [142] Another improvement with HEVC is that the dependencies between the coded data has been changed to further increase throughput. [13] [140] Context modeling in HEVC has also been improved so that CABAC can better select a context that increases efficiency when compared with H.264/MPEG-4 AVC. [13]
HEVC specifies 33 directional modes for intra prediction compared with the 8 directional modes for intra prediction specified by H.264/MPEG-4 AVC. [13] HEVC also specifies DC intra prediction and planar prediction modes. [13] The DC intra prediction mode generates a mean value by averaging reference samples and can be used for flat surfaces. [13] The planar prediction mode in HEVC supports all block sizes defined in HEVC while the planar prediction mode in H.264/MPEG-4 AVC is limited to a block size of 16×16 pixels. [13] The intra prediction modes use data from neighboring prediction blocks that have been previously decoded from within the same picture. [13]
For the interpolation of fractional luma sample positions HEVC uses separable application of one-dimensional half-sample interpolation with an 8-tap filter or quarter-sample interpolation with a 7-tap filter while, in comparison, H.264/MPEG-4 AVC uses a two-stage process that first derives values at half-sample positions using separable one-dimensional 6-tap interpolation followed by integer rounding and then applies linear interpolation between values at nearby half-sample positions to generate values at quarter-sample positions. [13] HEVC has improved precision due to the longer interpolation filter and the elimination of the intermediate rounding error. [13] For 4:2:0 video, the chroma samples are interpolated with separable one-dimensional 4-tap filtering to generate eighth-sample precision, while in comparison H.264/MPEG-4 AVC uses only a 2-tap bilinear filter (also with eighth-sample precision). [13]
As in H.264/MPEG-4 AVC, weighted prediction in HEVC can be used either with uni-prediction (in which a single prediction value is used) or bi-prediction (in which the prediction values from two prediction blocks are combined). [13]
HEVC defines a signed 16-bit range for both horizontal and vertical motion vectors (MVs). [24] [143] [144] [145] This was added to HEVC at the July 2012 HEVC meeting with the mvLX variables. [24] [143] [144] [145] HEVC horizontal/vertical MVs have a range of −32768 to 32767 which given the quarter pixel precision used by HEVC allows for a MV range of −8192 to 8191.75 luma samples. [24] [143] [144] [145] This compares to H.264/MPEG-4 AVC which allows for a horizontal MV range of −2048 to 2047.75 luma samples and a vertical MV range of −512 to 511.75 luma samples. [144]
HEVC allows for two MV modes which are Advanced Motion Vector Prediction (AMVP) and merge mode. [13] AMVP uses data from the reference picture and can also use data from adjacent prediction blocks. [13] The merge mode allows for the MVs to be inherited from neighboring prediction blocks. [13] Merge mode in HEVC is similar to "skipped" and "direct" motion inference modes in H.264/MPEG-4 AVC but with two improvements. [13] The first improvement is that HEVC uses index information to select one of several available candidates. [13] The second improvement is that HEVC uses information from the reference picture list and reference picture index. [13]
HEVC specifies two loop filters that are applied sequentially, with the deblocking filter (DBF) applied first and the sample adaptive offset (SAO) filter applied afterwards. [13] Both loop filters are applied in the inter-picture prediction loop, i.e. the filtered image is stored in the decoded picture buffer (DPB) as a reference for inter-picture prediction. [13]
The DBF is similar to the one used by H.264/MPEG-4 AVC but with a simpler design and better support for parallel processing. [13] In HEVC the DBF only applies to a 8×8 sample grid while with H.264/MPEG-4 AVC the DBF applies to a 4×4 sample grid. [13] DBF uses a 8×8 sample grid since it causes no noticeable degradation and significantly improves parallel processing because the DBF no longer causes cascading interactions with other operations. [13] Another change is that HEVC only allows for three DBF strengths of 0 to 2. [13] HEVC also requires that the DBF first apply horizontal filtering for vertical edges to the picture and only after that does it apply vertical filtering for horizontal edges to the picture. [13] This allows for multiple parallel threads to be used for the DBF. [13]
The SAO filter is applied after the DBF and is designed to allow for better reconstruction of the original signal amplitudes by applying offsets stored in a lookup table in the bitstream. [13] [146] Per CTB the SAO filter can be disabled or applied in one of two modes: edge offset mode or band offset mode. [13] [146] The edge offset mode operates by comparing the value of a sample to two of its eight neighbors using one of four directional gradient patterns. [13] [146] Based on a comparison with these two neighbors, the sample is classified into one of five categories: minimum, maximum, an edge with the sample having the lower value, an edge with the sample having the higher value, or monotonic. [13] [146] For each of the first four categories an offset is applied. [13] [146] The band offset mode applies an offset based on the amplitude of a single sample. [13] [146] A sample is categorized by its amplitude into one of 32 bands (histogram bins). [13] [146] Offsets are specified for four consecutive of the 32 bands, because in flat areas which are prone to banding artifacts, sample amplitudes tend to be clustered in a small range. [13] [146] The SAO filter was designed to increase picture quality, reduce banding artifacts, and reduce ringing artifacts. [13] [146]
Range extensions in MPEG are additional profiles, levels, and techniques that support needs beyond consumer video playback: [24]
Within these new profiles came enhanced coding features, many of which support efficient screen encoding or high-speed processing:
HEVC version 2 adds several supplemental enhancement information (SEI) messages:
Additional coding tool options have been added in the March 2016 draft of the screen content coding (SCC) extensions: [150]
The ITU-T version of the standard that added the SCC extensions (approved in December 2016 and published in March 2017) added support for the hybrid log–gamma (HLG) transfer function and the ICtCp color matrix. [65] This allows the fourth version of HEVC to support both of the HDR transfer functions defined in Rec. 2100. [65]
The fourth version of HEVC adds several supplemental enhancement information (SEI) messages which include:
Feature | Version 1 | Version 2 | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Main | Main 10 | Main 12 | Main 4:2:2 10 | Main 4:2:2 12 | Main 4:4:4 | Main 4:4:4 10 | Main 4:4:4 12 | Main 4:4:4 16 Intra | ||
Bit depth | 8 | 8 to 10 | 8 to 12 | 8 to 10 | 8 to 12 | 8 | 8 to 10 | 8 to 12 | 8 to 16 | |
Chroma sampling formats | 4:2:0 | 4:2:0 | 4:2:0 | 4:2:0/ 4:2:2 | 4:2:0/ 4:2:2 | 4:2:0/ 4:2:2/ 4:4:4 | 4:2:0/ 4:2:2/ 4:4:4 | 4:2:0/ 4:2:2/ 4:4:4 | 4:2:0/ 4:2:2/ 4:4:4 | |
4:0:0 (Monochrome) | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | |
High precision weighted prediction | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | |
Chroma QP offset list | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | |
Cross-component prediction | No | No | No | No | No | Yes | Yes | Yes | Yes | |
Intra smoothing disabling | No | No | No | No | No | Yes | Yes | Yes | Yes | |
Persistent Rice adaptation | No | No | No | No | No | Yes | Yes | Yes | Yes | |
RDPCM implicit/explicit | No | No | No | No | No | Yes | Yes | Yes | Yes | |
Transform skip block sizes larger than 4×4 | No | No | No | No | No | Yes | Yes | Yes | Yes | |
Transform skip context/rotation | No | No | No | No | No | Yes | Yes | Yes | Yes | |
Extended precision processing | No | No | No | No | No | No | No | No | Yes |
Version 1 of the HEVC standard defines three profiles: Main, Main 10, and Main Still Picture. [24] Version 2 of HEVC adds 21 range extensions profiles, two scalable extensions profiles, and one multi-view profile. [24] HEVC also contains provisions for additional profiles. [24] Extensions that were added to HEVC include increased bit depth, 4:2:2/4:4:4 chroma sampling, Multiview Video Coding (MVC), and Scalable Video Coding (SVC). [13] [153] The HEVC range extensions, HEVC scalable extensions, and HEVC multi-view extensions were completed in July 2014. [154] [155] [156] In July 2014 a draft of the second version of HEVC was released. [154] Screen content coding (SCC) extensions were under development for screen content video, which contains text and graphics, with an expected final draft release date of 2015. [157] [158]
A profile is a defined set of coding tools that can be used to create a bitstream that conforms to that profile. [13] An encoder for a profile may choose which coding tools to use as long as it generates a conforming bitstream while a decoder for a profile must support all coding tools that can be used in that profile. [13]
The Main profile allows for a bit depth of 8 bits per sample with 4:2:0 chroma sampling, which is the most common type of video used with consumer devices. [13] [24] [155]
The Main 10 (Main10
) profile was added at the October 2012 HEVC meeting based on proposal JCTVC-K0109 which proposed that a 10-bit profile be added to HEVC for consumer applications. The proposal said this was to allow for improved video quality and to support the Rec. 2020 color space that has become widely used in UHDTV systems and to be able to deliver higher dynamic range and color fidelity avoiding the banding artifacts. A variety of companies supported the proposal which included Ateme, BBC, BSkyB, Cisco, DirecTV, Ericsson, Motorola Mobility, NGCodec, NHK, RAI, ST, SVT, Thomson Video Networks, Technicolor, and ViXS Systems. [159] The Main 10 profile allows for a bit depth of 8 to 10 bits per sample with 4:2:0 chroma sampling. HEVC decoders that conform to the Main 10 profile must be capable of decoding bitstreams made with the following profiles: Main and Main 10. [24] A higher bit depth allows for a greater number of colors. 8 bits per sample allows for 256 shades per primary color (a total of 16.78 million colors) while 10 bits per sample allows for 1024 shades per primary color (a total of 1.07 billion colors). A higher bit depth allows for a smoother transition of color which resolves the problem known as color banding. [160] [161]
The Main 10 profile allows for improved video quality since it can support video with a higher bit depth than what is supported by the Main profile. [159] Additionally, in the Main 10 profile 8-bit video can be coded with a higher bit depth of 10 bits, which allows improved coding efficiency compared to the Main profile. [162] [163] [164]
Ericsson said the Main 10 profile would bring the benefits of 10 bits per sample video to consumer TV. They also said that for higher resolutions there is no bit rate penalty for encoding video at 10 bits per sample. [160] Imagination Technologies said that 10-bit per sample video would allow for larger color spaces and is required for the Rec. 2020 color space that will be used by UHDTV. They also said the Rec. 2020 color space would drive the widespread adoption of 10-bit-per-sample video. [161] [165]
In a PSNR based performance comparison released in April 2013 the Main 10 profile was compared to the Main profile using a set of 3840×2160 10-bit video sequences. The 10-bit video sequences were converted to 8 bits for the Main profile and remained at 10 bits for the Main 10 profile. The reference PSNR was based on the original 10-bit video sequences. In the performance comparison the Main 10 profile provided a 5% bit rate reduction for inter frame video coding compared to the Main profile. The performance comparison states that for the tested video sequences the Main 10 profile outperformed the Main profile. [166]
Still image coding standard (test method) | Average bit rate reduction compared to | |
---|---|---|
JPEG 2000 | JPEG | |
HEVC (PSNR) | 20% | 62% |
HEVC (MOS) | 31% | 43% |
The Main Still Picture (MainStillPicture
) profile allows for a single still picture to be encoded with the same constraints as the Main profile. As a subset of the Main profile the Main Still Picture profile allows for a bit depth of 8 bits per sample with 4:2:0 chroma sampling. [13] [24] [155] An objective performance comparison was done in April 2012 in which HEVC reduced the average bit rate for images by 56% compared to JPEG. [168] A PSNR based performance comparison for still image compression was done in May 2012 using the HEVC HM 6.0 encoder and the reference software encoders for the other standards. For still images HEVC reduced the average bit rate by 15.8% compared to H.264/MPEG-4 AVC, 22.6% compared to JPEG 2000, 30.0% compared to JPEG XR, 31.0% compared to WebP, and 43.0% compared to JPEG. [169]
A performance comparison for still image compression was done in January 2013 using the HEVC HM 8.0rc2 encoder, Kakadu version 6.0 for JPEG 2000, and IJG version 6b for JPEG. The performance comparison used PSNR for the objective assessment and mean opinion score (MOS) values for the subjective assessment. The subjective assessment used the same test methodology and images as those used by the JPEG committee when it evaluated JPEG XR. For 4:2:0 chroma sampled images the average bit rate reduction for HEVC compared to JPEG 2000 was 20.26% for PSNR and 30.96% for MOS while compared to JPEG it was 61.63% for PSNR and 43.10% for MOS. [167]
A PSNR based HEVC performance comparison for still image compression was done in April 2013 by Nokia. HEVC has a larger performance improvement for higher resolution images than lower resolution images and a larger performance improvement for lower bit rates than higher bit rates. For lossy compression to get the same PSNR as HEVC took on average 1.4× more bits with JPEG 2000, 1.6× more bits with JPEG-XR, and 2.3× more bits with JPEG. [170]
A compression efficiency study of HEVC, JPEG, JPEG XR, and WebP was done in October 2013 by Mozilla. The study showed that HEVC was significantly better at compression than the other image formats that were tested. Four different methods for comparing image quality were used in the study which were Y-SSIM, RGB-SSIM, IW-SSIM, and PSNR-HVS-M. [171] [172]
Version 2 of HEVC adds 21 range extensions profiles, two scalable extensions profiles, and one multi-view profile: Monochrome, Monochrome 12, Monochrome 16, Main 12, Main 4:2:2 10, Main 4:2:2 12, Main 4:4:4, Main 4:4:4 10, Main 4:4:4 12, Monochrome 12 Intra, Monochrome 16 Intra, Main 12 Intra, Main 4:2:2 10 Intra, Main 4:2:2 12 Intra, Main 4:4:4 Intra, Main 4:4:4 10 Intra, Main 4:4:4 12 Intra, Main 4:4:4 16 Intra, Main 4:4:4 Still Picture, Main 4:4:4 16 Still Picture, High Throughput 4:4:4 16 Intra, Scalable Main, Scalable Main 10, and Multiview Main. [24] [173] All of the inter frame range extensions profiles have an Intra profile. [24]
HbrFactor
12 times higher than other HEVC profiles, allowing it to have a maximum bit rate 12 times higher than the Main 4:4:4 16 Intra profile. [24] [174] The High Throughput 4:4:4 16 Intra profile is designed for high end professional content creation and decoders for this profile are not required to support other profiles. [174] This section may require cleanup to meet Wikipedia's quality standards. The specific problem is: Description length is getting a little out of hand. Try saying "this profile" and "following profiles, in addition to those mandatory for $PROFILE".(November 2023) |
Version 3 of HEVC added one 3D profile: 3D Main. The February 2016 draft of the screen content coding extensions added seven screen content coding extensions profiles, three high throughput extensions profiles, and four scalable extensions profiles: Screen-Extended Main, Screen-Extended Main 10, Screen-Extended Main 4:4:4, Screen-Extended Main 4:4:4 10, Screen-Extended High Throughput 4:4:4, Screen-Extended High Throughput 4:4:4 10, Screen-Extended High Throughput 4:4:4 14, High Throughput 4:4:4, High Throughput 4:4:4 10, High Throughput 4:4:4 14, Scalable Monochrome, Scalable Monochrome 12, Scalable Monochrome 16, and Scalable Main 4:4:4. [24] [150]
The HEVC standard defines two tiers, Main and High, and thirteen levels. A level is a set of constraints for a bitstream. For levels below level 4 only the Main tier is allowed. The Main tier is a lower tier than the High tier. The tiers were made to deal with applications that differ in terms of their maximum bit rate. The Main tier was designed for most applications while the High tier was designed for very demanding applications. A decoder that conforms to a given tier/level is required to be capable of decoding all bitstreams that are encoded for that tier/level and for all lower tiers/levels. [13] [24]
Level | Max luma sample rate (samples/s) | Max luma picture size (samples) | Max bit rate for Main and Main 10 profiles (kbit/s) [A] | Example picture resolution @ highest frame rate [B] (MaxDpbSize [C] ) More/Fewer examples | |
---|---|---|---|---|---|
Main tier | High tier | ||||
1 | 552,960 | 36,864 | 128 | – | 128×96@33.7 (6) 176×144@15 (6) |
2 | 3,686,400 | 122,880 | 1,500 | – | 176×144@100 (16) 352×288@30 (6) |
2.1 | 7,372,800 | 245,760 | 3,000 | – | 352×288@60 (12) 640×360@30 (6) |
3 | 16,588,800 | 552,960 | 6,000 | – | 640×360@67.5 (12) 960×540@30 (6)720×576@37.5 (8) |
3.1 | 33,177,600 | 983,040 | 10,000 | – | 720×576@75 (12) 1280×720@33.7 (6)960×540@60 (8) |
4 | 66,846,720 | 2,228,224 | 12,000 | 30,000 | 1,280×720@68 (12) 2,048×1,080@30.0 (6)1,920×1,080@32 (6) |
4.1 | 133,693,440 | 20,000 | 50,000 | 1,280×720@136 (12) 2,048×1,080@60 (6)1,920×1,080@64 (6) | |
5 | 267,386,880 | 8,912,896 | 25,000 | 100,000 | 1,920×1,080@128 (16) 4,096×2,160@30 (6)3,840×2,160@32 (6) |
5.1 | 534,773,760 | 40,000 | 160,000 | 1,920×1,080@256 (16) 4,096×2,160@60 (6)3,840×2,160@64 (6) | |
5.2 | 1,069,547,520 | 60,000 | 240,000 | 1,920×1,080@300 (16) 4,096×2,160@120 (6)3,840×2,160@128 (6) | |
6 | 1,069,547,520 | 35,651,584 | 60,000 | 240,000 | 3,840×2,160@128 (16) 8,192×4,320@30 (6)7,680×4,320@32 (6) |
6.1 | 2,139,095,040 | 120,000 | 480,000 | 3,840×2,160@256 (16) 8,192×4,320@60 (6)7,680×4,320@64 (6) | |
6.2 | 4,278,190,080 | 240,000 | 800,000 | 3,840×2,160@300 (16) 8,192×4,320@120 (6)7,680×4,320@128 (6) |
Previously decoded pictures are stored in a decoded picture buffer (DPB), and are used by HEVC encoders to form predictions for subsequent pictures. The maximum number of pictures that can be stored in the DPB, called the DPB capacity, is 6 (including the current picture) for all HEVC levels when operating at the maximum picture size supported by the level. The DPB capacity (in units of pictures) increases from 6 to 8, 12, or 16 as the picture size decreases from the maximum picture size supported by the level. The encoder selects which specific pictures are retained in the DPB on a picture-by-picture basis, so the encoder has the flexibility to determine for itself the best way to use the DPB capacity when encoding the video content. [24]
MPEG has published an amendment which added HEVC support to the MPEG transport stream used by ATSC, DVB, and Blu-ray Disc; MPEG decided not to update the MPEG program stream used by DVD-Video. [175] [176] MPEG has also added HEVC support to the ISO base media file format. [177] [178] HEVC is also supported by the MPEG media transport standard. [175] [179] Support for HEVC was added to Matroska starting with the release of MKVToolNix v6.8.0 after a patch from DivX was merged. [180] [181] A draft document has been submitted to the Internet Engineering Task Force which describes a method to add HEVC support to the Real-time Transport Protocol. [182]
Using HEVC's intra frame encoding, a still-image coded format called Better Portable Graphics (BPG) has been proposed by the programmer Fabrice Bellard. [183] It is essentially a wrapper for images coded using the HEVC Main 4:4:4 16 Still Picture profile with up to 14 bits per sample, although it uses an abbreviated header syntax and adds explicit support for Exif, ICC profiles, and XMP metadata. [183] [184]
License terms and fees for HEVC patents, compared with its main competitors:
Video format | Licensor | Codec royalties | Codec royalty exemptions | Codec royalty annual cap | Content distribution fee |
---|---|---|---|---|---|
HEVC | MPEG LA | ▪ US$0.20 per unit | ▪ First 100k units each year [49] | ▪ US$25 million | ▪ US$0 |
HEVC Advance | Region 1: ▪ US$0.40 (mobile) ▪ US$1.20 (4K TV) ▪ US$0.20-0.80 (other) Region 2: ▪ US$0.20 (mobile) ▪ US$0.60 (4K TV) ▪ US$0.20–0.40 (other) [185] | ▪ US$25,000 each year [186] ▪ Most software HEVC implementation distributed to consumer devices after first sale [187] | ▪ US$40 million | Physical distribution: ▪ $0.0225 per disc/title (Region 1) [188] ▪ $0.01125 per disc/title (Region 2) [188] Non-physical distribution: ▪ US$0 [189] | |
Technicolor | tailor-made agreements [58] | ▪ US$0 [58] | |||
Velos Media [62] | ? | ▪ Presumed to charge royalty [190] | |||
others (AT&T, Microsoft, Motorola, Nokia, Cisco, ...) [52] [191] [192] | ? | ||||
AVC | MPEG LA | Codecs to end users and OEM for PC but not part of PC OS: ▪ US$0.20: 100k+ units/year ▪ US$0.10: 5M+ units/year Branded OEM Codecs for PC OS: ▪ US$0.20: 100k+ units/year ▪ US$0.10: 5M+ units/year [193] | Codecs to end users and OEM for PC but not part of PC OS: ▪ First 100k units each year Branded OEM Codecs for PC OS: ▪ First 100k units each year [193] | Codecs to end users and OEM for PC but not part of PC OS: ▪ US$9.75 million (for 2017-20 period) Branded OEM Codecs for PC OS: ▪ US$9.75 million (for 2017-20 period) [193] | Free Television: ▪ one time $2,500 per transmission encoder, or ▪ $2,500...$10,000 annual fee Internet Broadcast: ▪ US$0 Paid Subscriber Model: ▪ $0/yr: 0k...100k subscribers ▪ $25,000/yr: 100k...250k subscribers ▪ $50,000/yr: 250k...500k subscribers ▪ $75,000/yr: 500k...1M subscribers ▪ $100,000/yr: 1M+ subscribers Paid by Title Model: ▪ 0...12 min: no royalty ▪ 12+ min: lower of 2% or US$0.02/title Maximum Annual Content Related Royalty: ▪ US$8.125 million |
others (Nokia, Qualcomm, Broadcomm, Blackberry, Texas Instruments, MIT) [194] | ? | ||||
AV1 | Alliance for Open Media | ▪ US$0 | — | ▪ US$0 | |
Daala | Mozilla & Xiph.org | ▪ US$0 | — | ▪ US$0 | |
VP9 | ▪ US$0 | — | ▪ US$0 |
As with its predecessor AVC, software distributors that implement HEVC in products must pay a price per distributed copy. [i] While this licensing model is manageable for paid software, it is an obstacle to most free and open-source software, which is meant to be freely distributable. In the opinion of MulticoreWare, the developer of x265, enabling royalty-free software encoders and decoders is in the interest of accelerating HEVC adoption. [191] [195] [196] HEVC Advance made an exception that specifically waives the royalties on software-only implementations (both decoders and encoders) when not bundled with hardware. [197] However, the exempted software is not free from the licensing obligations of other patent holders (e.g. members of the MPEG LA pool).
While the obstacle to free software is no concern in for example TV broadcast networks, this problem, combined with the prospect of future collective lock-in to the format, makes several organizations like Mozilla (see OpenH264) and the Free Software Foundation Europe [198] wary of royalty-bearing formats for internet use. Competing formats intended for internet use (VP9 and AV1) are intended to steer clear of these concerns by being royalty free (provided there are no third-party claims of patent rights).
^i : Regardless of how the software is licensed from the software authors (see software licensing), if what it does is patented, its use remains bound by the patent holders' rights unless the use of the patents has been authorized by a license.
In October 2015, MPEG and VCEG formed Joint Video Exploration Team (JVET) [199] to evaluate available compression technologies and study the requirements for a next-generation video compression standard. The new algorithm should have 30–50% better compression rate for the same perceptual quality, with support for lossless and subjectively lossless compression. It should also support YCbCr 4:4:4, 4:2:2 and 4:2:0 with 10 to 16 bits per component, BT.2100 wide color gamut and high dynamic range (HDR) of more than 16 stops (with peak brightness of 1,000, 4,000 and 10,000 nits), auxiliary channels (for depth, transparency, etc.), variable and fractional frame rates from 0 to 120 Hz, scalable video coding for temporal (frame rate), spatial (resolution), SNR, color gamut and dynamic range differences, stereo/multiview coding, panoramic formats, and still picture coding. Encoding complexity of 10 times that of HEVC is expected. JVET issued a final "Call for Proposals" in October 2017, with the first working draft of the Versatile Video Coding (VVC) standard released in April 2018. [200] [201] The VVC standard was finalized on July 6, 2020. [202]
A video codec is software or hardware that compresses and decompresses digital video. In the context of video compression, codec is a portmanteau of encoder and decoder, while a device that only compresses is typically called an encoder, and one that only decompresses is a decoder.
Chroma subsampling is the practice of encoding images by implementing less resolution for chroma information than for luma information, taking advantage of the human visual system's lower acuity for color differences than for luminance.
Advanced Video Coding (AVC), also referred to as H.264 or MPEG-4 Part 10, is a video compression standard based on block-oriented, motion-compensated coding. It is by far the most commonly used format for the recording, compression, and distribution of video content, used by 91% of video industry developers as of September 2019. It supports a maximum resolution of 8K UHD.
H.261 is an ITU-T video compression standard, first ratified in November 1988. It is the first member of the H.26x family of video coding standards in the domain of the ITU-T Study Group 16 Video Coding Experts Group. It was the first video coding standard that was useful in practical terms.
H.262 or MPEG-2 Part 2 is a video coding format standardised and jointly maintained by ITU-T Study Group 16 Video Coding Experts Group (VCEG) and ISO/IEC Moving Picture Experts Group (MPEG), and developed with the involvement of many companies. It is the second part of the ISO/IEC MPEG-2 standard. The ITU-T Recommendation H.262 and ISO/IEC 13818-2 documents are identical.
MPEG-4 Part 2, MPEG-4 Visual is a video compression format developed by the Moving Picture Experts Group (MPEG). It belongs to the MPEG-4 ISO/IEC standards. It uses block-wise motion compensation and a discrete cosine transform (DCT), similar to previous standards such as MPEG-1 Part 2 and H.262/MPEG-2 Part 2.
Α video codec is software or a device that provides encoding and decoding for digital video, and which may or may not include the use of video compression and/or decompression. Most codecs are typically implementations of video coding formats.
The macroblock is a processing unit in image and video compression formats based on linear block transforms, typically the discrete cosine transform (DCT). A macroblock typically consists of 16×16 samples, and is further subdivided into transform blocks, and may be further subdivided into prediction blocks. Formats which are based on macroblocks include JPEG, where they are called MCU blocks, H.261, MPEG-1 Part 2, H.262/MPEG-2 Part 2, H.263, MPEG-4 Part 2, and H.264/MPEG-4 AVC. In H.265/HEVC, the macroblock as a basic processing unit has been replaced by the coding tree unit.
Gary Joseph Sullivan is an American electrical engineer who led the development of the AVC, HEVC, and VVC video coding standards and created the DirectX Video Acceleration (DXVA) API/DDI video decoding feature of the Microsoft Windows operating system. He is currently Director of Video Research and Standards at Dolby Laboratories and is the chair of ISO/IEC JTC 1/SC 29 and of the ITU-T Video Coding Experts Group (VCEG).
A deblocking filter is a video filter applied to decoded compressed video to improve visual quality and prediction performance by smoothing the sharp edges which can form between macroblocks when block coding techniques are used. The filter aims to improve the appearance of decoded pictures. It is a part of the specification for both the SMPTE VC-1 codec and the ITU H.264 codec.
Video Acceleration API (VA-API) is an open source application programming interface that allows applications such as VLC media player or GStreamer to use hardware video acceleration capabilities, usually provided by the graphics processing unit (GPU). It is implemented by the free and open-source library libva, combined with a hardware-specific driver, usually provided together with the GPU driver.
AVC-Intra is a type of video coding developed by Panasonic, and then supported in products made by other companies. AVC-Intra is available in Panasonic's high definition broadcast products, such as, for example, their P2 card equipped broadcast cameras.
x265 is an encoder for creating digital video streams in the High Efficiency Video Coding (HEVC/H.265) video compression format developed by the Joint Collaborative Team on Video Coding (JCT-VC). It is available as a command-line app or a software library, under the terms of GNU General Public License (GPL) version 2 or later; however, customers may request a commercial license.
A video coding format is a content representation format of digital video content, such as in a data file or bitstream. It typically uses a standardized video compression algorithm, most commonly based on discrete cosine transform (DCT) coding and motion compensation. A specific software, firmware, or hardware implementation capable of compression or decompression in a specific video coding format is called a video codec.
High Efficiency Video Coding tiers and levels are constraints that define a High Efficiency Video Coding (HEVC) bitstream in terms of maximum bit rate, maximum luma sample rate, maximum luma picture size, minimum compression ratio, maximum number of slices allowed, and maximum number of tiles allowed. Lower tiers are more constrained than higher tiers and lower levels are more constrained than higher levels.
Coding tree unit (CTU) is the basic processing unit of the High Efficiency Video Coding (HEVC) video standard and conceptually corresponds in structure to macroblock units that were used in several previous video standards. CTU is also referred to as largest coding unit (LCU).
High Efficiency Video Coding implementations and products covers the implementations and products of High Efficiency Video Coding (HEVC).
High Efficiency Image File Format (HEIF) is a digital container format for storing individual digital images and image sequences. The standard covers multimedia files that can also include other media streams, such as timed text, audio and video.
Versatile Video Coding (VVC), also known as H.266, ISO/IEC 23090-3, and MPEG-I Part 3, is a video compression standard finalized on 6 July 2020, by the Joint Video Experts Team (JVET) of the VCEG working group of ITU-T Study Group 16 and the MPEG working group of ISO/IEC JTC 1/SC 29. It is the successor to High Efficiency Video Coding. It was developed with two primary goals – improved compression performance and support for a very broad range of applications.
{{cite web}}
: CS1 maint: multiple names: authors list (link){{cite web}}
: CS1 maint: multiple names: authors list (link)Since they haven't, many producers presume that the pool will impose content royalties.
A number of important companies with HEVC patents have not yet joined one of the patent pools. (…) To accelerate HEVC adoption, I propose that HEVC patent licensors agree to the following principles; · Software decoding on consumer devices must be royalty free. · Software encoding on consumer devices must be royalty free. · Content distribution must be royalty free.
Transforms are identical to H.265/HEVC (Cisco IPR)