MPEG-4 Part 2

Last updated

MPEG-4 Part 2, MPEG-4 Visual (formally ISO/IEC 14496-2 [1] ) is a video compression format developed by the Moving Picture Experts Group (MPEG). It belongs to the MPEG-4 ISO/IEC standards. It uses block-wise motion compensation and a discrete cosine transform (DCT), similar to previous standards such as MPEG-1 Part 2 and H.262/MPEG-2 Part 2.

Contents

Several popular codecs including DivX, Xvid, and Nero Digital implement this standard. MPEG-4 Part 10 defines a different format from MPEG-4 Part 2. MPEG-4 Part 10 is commonly referred to as H.264 or AVC, and was jointly developed by ITU-T and MPEG.

MPEG-4 Part 2 is H.263 compatible in the sense that a basic H.263 bitstream is correctly decoded by an MPEG-4 Video decoder. (MPEG-4 Video decoder is natively capable of decoding a basic form of H.263.) [2] [3] [4] In MPEG-4 Visual, there are two types of video object layers: the video object layer that provides full MPEG-4 functionality, and a reduced functionality video object layer, the video object layer with short headers (which provides bitstream compatibility with base-line H.263). [5] MPEG-4 Part 2 is partially based on ITU-T H.263. [6] The first MPEG-4 Video Verification Model (simulation and test model) used ITU-T H.263 coding tools together with shape coding. [7]

History

The MPEG-4 Visual format was developed by the Moving Picture Experts Group (MPEG) committee. The specification was authored by Iranian-Swiss engineer Touradj Ebrahimi (later the president of JPEG) and Dutch engineer Caspar Horne. [5] The standard was developed using patents from over a dozen organizations, listed by MPEG LA in a patent pool. The majority of patents used for the MPEG-4 Visual format were from three Japanese companies: Mitsubishi Electric (255 patents), Hitachi (206 patents), and Panasonic (200 patents). [8] See Patent holders below for a full list of patent holders.

Editions

MPEG-4 Visual editions [9]
EditionRelease dateLatest amendmentStandardDescription
First edition19992000ISO/IEC 14496-2:1999 [10]
Second edition20012003ISO/IEC 14496-2:2001 [11]
Third edition20042009 [1] ISO/IEC 14496-2:2004 [1]

Profiles

To address various applications ranging from low-quality, low-resolution surveillance cameras to high definition TV broadcasting and DVDs, many video standards group features into profiles and levels. MPEG-4 Part 2 contains approximately 21 profiles.

The most commonly deployed profiles are the Advanced Simple Profile (ASP or AS Profile) and the Simple Profile (SP), which is a subset of the ASP.

Other profiles include the Advanced Coding Efficiency Profile (ACEP or ACE Profile), the Advanced Real Time Simple Profile (ARTSP or ARTS Profile), the Core Profile (CP), the Main Profile (MP), and the Simple Studio Profile (SStP or SSt Profile).

Most of the video compression schemes standardize the bitstream (and thus the decoder) leaving the encoder design to the individual implementations. Therefore, implementations for a particular profile (such as DivX or Nero Digital which are implementations of Advanced Simple Profile and Xvid that implements both profiles) are all technically identical on the decoder side. A point of comparison would be that an MP3 file can be played in any MP3 player, whether it was created through iTunes, Windows Media Player, LAME, or the common Fraunhofer encoder.

Simple Profile (SP)

The Simple Profile is mostly aimed for use in situations where low bit rate and low resolution are mandated by other conditions of the applications, like network bandwidth, device size etc. Examples are mobile phones, some low end video conferencing systems, electronic surveillance systems etc.

Levels

LevelMax. bufferMax. bitrateMax. delay at max. bitrateMax. VP sizeMax. VOP sizeMax. decoder rateMax. framesize
@ 30 Hz@ 25 Hz@ 24 Hz@ 15 Hz@ 12.5 Hz
0160642.502,048991,485QCIF (176×144)
0b320128
116064128×96144×96160×96
26401285.004,0963965,940256×192304×192, 288×208304×208 CIF (352×288)
33841.668,19211,880CIF (352×288)
4a1,2804,0000.3216,3841,20036,000VGA (640×480)
51,7928,0000.221,62040,500D1 NTSC (720×480)D1 PAL (720×576)
63,96812,0000.333,600108,000720p (1280x720)
Unitskbitskbits/ssecondsbitsmacroblocksmacroblocks/spixels

Advanced Simple Profile (ASP)

The Advanced Simple Profile was not included in the original standard. Its notable technical features relative to the Simple Profile, which is roughly similar to H.263, include:

The MPEG quantization and interlace support are designed in basically similar ways to the way it is found in MPEG-2 Part 2. The B picture support is designed in a basically similar way to the way it is found in MPEG-2 Part 2 and H.263v2.

The quarter-pixel motion compensation feature of ASP was innovative, and was later also included (in somewhat different forms) in later designs such as MPEG-4 Part 10, HEVC, VC-1 and VVC. Some implementations of MPEG-4 Part 2 omit support for this feature, because it has a significantly harmful effect on the speed of software decoders and it is not always beneficial for quality.

The global motion compensation feature is not actually supported in most implementations although the standard officially requires decoders to support it. Most encoders do not support it either, and some experts say that it does not ordinarily provide any benefit in compression. When used, ASP's global motion compensation has a large unfavorable impact on speed and adds considerable complexity to the implementation.

Levels

LevelMax. bufferMax. bitrateMax. delay at max. bitrateMax. VP sizeMax. VOP sizeMax. decoder rateMax. framesize
@ 30 Hz@ 25 Hz@ 24 Hz@ 15 Hz@ 12.5 Hz
01601281.252,048992,970QCIF (176×144)
1
26403841.664,0963965,940256×192304×192, 288×208304×208CIF (352×288)
37680.8611,880CIF (352×288)
3b1,0401,5000.69
41,2803,0000.438,19279223,760352×576, 704×288
51,7928,0000.2216,3841,62048,600720×576
Unitskbitskbits/ssecondsbitsmacroblocksmacroblocks/spixels

Simple Studio Profile (SStP)

The Simple Studio Profile has six levels, ranging from SDTV to 4K resolution. [12] SStP allows for up to 12-bit bit depth and up to 4:4:4 chroma subsampling, [12] using intra-frame coding only. [13] SStP is used by HDCAM SR. [12]

Levels

Levels with maximum property values [12]
LevelMax. bit depth and
chroma subsampling
Max. resolution
and frame rate
Max. data rate
(Mbit/s)
110-bit 4:2:2SDTV180
210-bit 4:2:21920×1080 30p/30i600
312-bit 4:4:41920×1080 30p/30i900
412-bit 4:4:42K×2K 30p1,350
512-bit 4:4:44K×2K 30p1,800
612-bit 4:4:44K×2K 60p3,600

Patent holders

MPEG-4 Part 2 patents expired worldwide, with the exception of only Brazil. The last US patent expired on November 14, 2023. [14] The following organizations held patents for MPEG-4 Visual technology, as listed in the patent pool administered by MPEG LA.

Organization [15] Patents [8]
Mitsubishi Electric 255
Hitachi 206
Panasonic 200
Sun Patent Trust125
Toshiba 96
Samsung Electronics 92
Sony 84
Philips 73
Sharp Corporation 44
Pantech 36
Robert Bosch GmbH 27
Nippon Telegraph and Telephone 24
GE Technology Development23
CIF Licensing20
Dolby 19
Telenor 19
Siemens AG 15
JVC Kenwood 14
Orange S.A. 14
LG Electronics 13
Fujitsu 11
ZTE 10
Google 9
BT Group 3
Calmare Therapeutics2
Cable Television Laboratories, Inc. 1
Canon Inc. 1
KDDI 1
Microsoft 1
Oki Electric Industry 1
Sanyo 1

Criticisms

MPEG-4 Part 2 has drawn some industry criticism. FFmpeg's maintainer Michael Niedermayer has criticised MPEG-4 for lacking an in-loop deblocking filter, GMC being too computationally intensive, and OBMC being defined but not allowed in any profiles among other things. [16] Microsoft's Ben Waggoner states "Microsoft (well before my time) went down the codec standard route before with MPEG-4 part 2, which turns out to be a profound disappointment across the industry - it didn't offer that much of a compression advantage over MPEG-2, and the protracted license agreement discussions scared off a lot of adoption. I was involved in many digital media projects that wouldn't even touch MPEG-4 in the late 1990s to early 2000s because there was going to be a 'content fee' that hadn't been fully defined yet." [17]

See also

Notes

  1. 1 2 3 ISO. "ISO/IEC 14496-2:2004 - Information technology -- Coding of audio-visual objects -- Part 2: Visual". ISO. Retrieved 2009-11-01.
  2. chiariglione.org (2006-08-10). "Riding the Media Bits, End of the Ride?". Archived from the original on 2011-11-01. Retrieved 2010-03-10.
  3. chiariglione.org (2003-10-25). "Riding the Media Bits, Inside MPEG-4 - Part B". Archived from the original on 2011-11-01. Retrieved 2010-03-10.
  4. ISO/IEC JTC1/SC29/WG11 (March 2000). "MPEG-4 Video - Frequently Asked Questions". chiariglione.org. Retrieved 2010-03-10.
  5. 1 2 Touradj Ebrahimi and Caspar Horne. "MPEG-4 Natural Video Coding - An overview". chiariglione.org. Archived from the original on 2010-03-22. Retrieved 2010-03-10.
  6. chiariglione.org (2009-09-06). "Riding the Media Bits, The development of MPEG-1 - Part A". Archived from the original on 2011-01-22. Retrieved 2010-03-10.
  7. Fernando Pereira. "MPEG-4: Why, What, How and When?". chiariglione.org. Archived from the original on 2011-10-18. Retrieved 2010-03-10.
  8. 1 2 "MPEG-4 Visual - Patent List" (PDF). MPEG LA . Retrieved 6 July 2019.
  9. MPEG. "MPEG standards - Full list of standards developed or under development". chiariglione.org. Archived from the original on 2010-04-20. Retrieved 2009-10-31.
  10. ISO. "ISO/IEC 14496-2:1999 - Information technology -- Coding of audio-visual objects -- Part 2: Visual". ISO. Retrieved 2009-11-01.
  11. ISO. "ISO/IEC 14496-2:2001 - Information technology -- Coding of audio-visual objects -- Part 2: Visual". ISO. Retrieved 2009-11-01.
  12. 1 2 3 4 Yasuhiko Mikami; Hugo Gaggioni. "4K End-to-End HPA Technology Retreat 2010" (PDF). Sony. Retrieved 2012-11-28.[ permanent dead link ]
  13. Caroline R. Arms; Carl Fleischhauer; Kate Murray (December 2011). "MPEG-4, Visual Coding, Simple Studio Profile". Sustainability of Digital Formats. Library of Congress. Retrieved 9 March 2015.
  14. "MPEG-4 Visual Patent List".
  15. "Licensors Included in the MPEG-4 Visual Patent Portfolio License". MPEG LA . Retrieved 6 July 2019.
  16. Melanson, Mike. "15 reasons why MPEG4 sucks". Lair of the Multimedia Guru. Retrieved 2021-09-11.
  17. VC-1 and H264 - Page 2 - Doom9's Forum

Related Research Articles

H.263 is a video compression standard originally designed as a low-bit-rate compressed format for videotelephony. It was standardized by the ITU-T Video Coding Experts Group (VCEG) in a project ending in 1995/1996. It is a member of the H.26x family of video coding standards in the domain of the ITU-T.

<span class="mw-page-title-main">Moving Picture Experts Group</span> Alliance of working groups to set standards for multimedia coding

The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and file formats for various applications. Together with JPEG, MPEG is organized under ISO/IEC JTC 1/SC 29 – Coding of audio, picture, multimedia and hypermedia information.

MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s without excessive quality loss, making video CDs, digital cable/satellite TV and digital audio broadcasting (DAB) practical.

<span class="mw-page-title-main">MPEG-2</span> Video encoding standard

MPEG-2 is a standard for "the generic coding of moving pictures and associated audio information". It describes a combination of lossy video compression and lossy audio data compression methods, which permit storage and transmission of movies using currently available storage media and transmission bandwidth. While MPEG-2 is not as efficient as newer standards such as H.264/AVC and H.265/HEVC, backwards compatibility with existing hardware and software means it is still widely used, for example in over-the-air digital television broadcasting and in the DVD-Video standard.

MPEG-4 is a group of international standards for the compression of digital audio and visual data, multimedia systems, and file storage formats. It was originally introduced in late 1998 as a group of audio and video coding formats and related technology agreed upon by the ISO/IEC Moving Picture Experts Group (MPEG) under the formal standard ISO/IEC 14496 – Coding of audio-visual objects. Uses of MPEG-4 include compression of audiovisual data for Internet video and CD distribution, voice and broadcast television applications. The MPEG-4 standard was developed by a group led by Touradj Ebrahimi and Fernando Pereira.

<span class="mw-page-title-main">Xvid</span> Video codec library

Xvid is a video codec library following the MPEG-4 video coding standard, specifically MPEG-4 Part 2 Advanced Simple Profile (ASP). It uses ASP features such as b-frames, global and quarter pixel motion compensation, lumi masking, trellis quantization, and H.263, MPEG and custom quantization matrices.

Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate.

<span class="mw-page-title-main">Advanced Video Coding</span> Most widely used standard for video compression

Advanced Video Coding (AVC), also referred to as H.264 or MPEG-4 Part 10, is a video compression standard based on block-oriented, motion-compensated coding. It is by far the most commonly used format for the recording, compression, and distribution of video content, used by 91% of video industry developers as of September 2019. It supports a maximum resolution of 8K UHD.

MPEG-4 Part 3 or MPEG-4 Audio is the third part of the ISO/IEC MPEG-4 international standard developed by Moving Picture Experts Group. It specifies audio coding methods. The first version of ISO/IEC 14496-3 was published in 1999.

H.262 or MPEG-2 Part 2 is a video coding format standardised and jointly maintained by ITU-T Study Group 16 Video Coding Experts Group (VCEG) and ISO/IEC Moving Picture Experts Group (MPEG), and developed with the involvement of many companies. It is the second part of the ISO/IEC MPEG-2 standard. The ITU-T Recommendation H.262 and ISO/IEC 13818-2 documents are identical.

The Extensible MPEG-4 Textual Format (XMT) is a high-level, XML-based file format for storing MPEG-4 data in a way suitable for further editing. In contrast, the more common MPEG-4 Part 14 (MP4) format is less flexible and used for distributing finished content.

MPEG-4 Part 11Scene description and application engine was published as ISO/IEC 14496-11 in 2005. MPEG-4 Part 11 is also known as BIFS, XMT, MPEG-J. It defines:

MPEG-4 Audio Lossless Coding, also known as MPEG-4 ALS, is an extension to the MPEG-4 Part 3 audio standard to allow lossless audio compression. The extension was finalized in December 2005 and published as ISO/IEC 14496-3:2005/Amd 2:2006 in 2006. The latest description of MPEG-4 ALS was published as subpart 11 of the MPEG-4 Audio standard in December 2019.

MPEG-4 Structured Audio is an ISO/IEC standard for describing sound. It was published as subpart 5 of MPEG-4 Part 3 in 1999.

The Video Coding Experts Group or Visual Coding Experts Group is a working group of the ITU Telecommunication Standardization Sector (ITU-T) concerned with standards for compression coding of video, images, audio, and other signals. It is responsible for standardization of the "H.26x" line of video coding standards, the "T.8xx" line of image coding standards, and related technologies.

<span class="mw-page-title-main">MP4 file format</span> Digital format for storing video and audio

MPEG-4 Part 14 or MP4 is a digital multimedia container format most commonly used to store video and audio, but it can also be used to store other data such as subtitles and still images. Like most modern container formats, it allows streaming over the Internet. The only filename extension for MPEG-4 Part 14 files as defined by the specification is .mp4. MPEG-4 Part 14 is a standard specified as a part of MPEG-4.

MPEG Surround, also known as Spatial Audio Coding (SAC) is a lossy compression format for surround sound that provides a method for extending mono or stereo audio services to multi-channel audio in a backwards compatible fashion. The total bit rates used for the core and the MPEG Surround data are typically only slightly higher than the bit rates used for coding of the core. MPEG Surround adds a side-information stream to the core bit stream, containing spatial image data. Legacy stereo playback systems will ignore this side-information while players supporting MPEG Surround decoding will output the reconstructed multi-channel audio.

The ISO base media file format (ISOBMFF) is a container file format that defines a general structure for files that contain time-based multimedia data such as video and audio. It is standardized in ISO/IEC 14496-12, a.k.a. MPEG-4 Part 12, and was formerly also published as ISO/IEC 15444-12, a.k.a. JPEG 2000 Part 12.

Internet Video Coding is a video coding standard. IVC was created by MPEG, and was intended to be a royalty-free video coding standard for use on the Internet, as an alternative to non-free formats such as AVC and HEVC. As such, IVC was designed to only use coding techniques which were not covered by royalty-requiring patents.