JPEG XT

Last updated
JPEG XT
JPEG XT logo.svg
Developed by Joint Photographic Experts Group
Initial releaseJune 8, 2015;9 years ago (2015-06-08)
Type of format lossy and lossless image format
Standard ISO/IEC 18477
Website www.jpeg.org/jpegxt/

JPEG XT (ISO/IEC 18477) is an image compression standard which specifies backward-compatible extensions of the base JPEG standard (ISO/IEC 10918-1 and ITU Rec. T.81).

Contents

JPEG XT extends JPEG with support for higher integer bit depths, high dynamic range imaging and floating-point coding, lossless coding, alpha channel coding, and an extensible file format based on JFIF. It also includes reference software implementation and conformance testing specification.

JPEG XT extensions are backward compatible with base JPEG/JFIF file format - existing software is forward compatible and can read the JPEG XT binary stream, though it would only decode the base 8-bit lossy image. [1]

The JPEG XT standard

JPEG standards are formally named as Information technology – Scalable compression and coding of continuous-tone still images. ISO/IEC 18477 consists of the following parts:

JPEG XT Scalable Compression and Coding of Continuous-Tone Still Images - Parts
PartFirst public release dateISO/IEC NumberITU NumberFormal Title
Part 12015-06 ISO/IEC 18477-1 Core Coding System Specification
Part 22016-07 ISO/IEC 18477-2 Coding of High Dynamic Range Images
Part 32015-12 ISO/IEC 18477-3 Box file format
Part 42017-10 ISO/IEC 18477-4 Conformance Testing
Part 52018-03 ISO/IEC 18477-5 Reference software
Part 62016-02 ISO/IEC 18477-6 IDR Integer coding
Part 72017-05 ISO/IEC 18477-7 HDR Floating-Point Coding
Part 82016-10 ISO/IEC 18477-8 Lossless and Near-lossless Coding
Part 92016-10 ISO/IEC 18477-9 Alpha channel coding

Overview

The core Part 1 of the standard defines the JPEG specifications in common use today, such as ISO/IEC 10918-1 (base format), 10918-5 JPEG File Interchange Format (JFIF), and 10918-6 (printing applications). It restricts the JPEG coding modes to baseline, sequential, and progressive Huffman, and includes JFIF definitions of Rec. 601 color space transformations with YCbCr chroma subsampling. [2] [1] The first specification was authored by Thomas Richter from Germany, Tim Bruylants and Peter Schelkens from Belgium, and Swiss-Iranian engineer Touradj Ebrahimi. [2]

Part 3 Box file format defines an extensible format which is backward-compatible with JFIF. Extensions are based on 'boxes' - 64 KB chunks tagged by application marker 11 ('APP11'), containing enhancement data layers and additional binary metadata describing how to combine them with the base 8-bit layer to form full-precision image. [3] Part 3 builds on the ISO base media file format used by JPEG 2000; similar arrangement was employed in the earlier JPEG-HDR format from Dolby Labs, which is standardized in JPEG XT Part 2. [2]

Part 7 includes floating-point HDR coding tools which produce an enhancement image layer from full-precision image and gamma-corrected tone-mapped 8-bit base image layer. These tools are intended for high dynamic range imaging with multiple photo exposures and computer-generated images which exceed linear 16-bit integer precision. [2]

It defines three main algorithms for reconstructing the HDR image: Profile A uses a common logarithmic scale factor for inverse tone-mapping of the base layer; Profile B uses a divisor image extension layer scaled by the common exposure value; Profile C is similar to A but uses per-component scaling factors and logarithmic space with piece-wise linear functions, which allows lossless encoding. Profile A is based on the Radiance RGBE image format [2] and Profile B is based on the XDepth format from Trellis Management. [4]

Profile D uses a simple algorithm which does not generate an enhancement image – the enhancement layer is used to store extended precision of discrete cosine transform (DCT) transfer coefficients, and non-gamma transfer function is applied to increase dynamic range to 12 bits. Backward compatibility is limited because legacy decoders do not understand new EOTF curves and produce undersaturated colors. [2] Profile D is not implemented in reference software.

JPEG XT also allows mixing of various elements from different profiles in the code stream, allowing extended DCT precision and lossless encoding in all profiles (the 'Full Profile'). [1]

Part 6, Integer coding of Intermediate Dynamic Range (IDR) images, is an extension for coding 9 to 16-bit integer samples typical for RAW sensor data; its coding tools are identical to Part 7 Profile C. [2]

Part 2 defines a HDR imaging implementation based on JPEG-HDR format from Dolby. [5] It uses RGBE image format defined by Part 7 Profile A, supporting both integer and floating point samples; file format is based on Part 3 but uses proprietary text-based metadata syntax. [2]

Part 8 Lossless coding is an extension of integer and floating point coding based on Part 7 Profile C, allowing for scalable lossy to lossless compression. For 10 and 12-bit precision, lossless integer-to-integer DCT is used, which replaces each rotation space with three shearings (similar to wavelet transform in JPEG2000). For 16 bit precision, a lossy fixed-point DCT approximation is specified by the standard and is required for decoders to implement. This makes it possible for the encoder to predict coding errors and store them in the enhancement layer, allowing lossless reconstruction. The error residuals in the enhancement layer can be either uncompressed, or compressed with lossless integer-to-integer DCT. [1] Compression and image quality performance of Part 8 is comparable to PNG. [2] [3]

Part 9 Alpha channel extension allows lossy and lossless coding of transparent images and arbitrarily shaped images. It uses an opacity (transparency) layer, encoded with integer or floating point precision, and metadata to specify if content was pre-multiplied with alpha, or pre-multiplied and blended with background color. [1] [2]

In the future, privacy protection and security extensions would allow encoding of private image regions (or entire images) with reduced resolution, with digitally encrypted enhancement layers to restore full-resolution image only to those having the private decryption key. Only the public regions will be visible to those not having the key. [1] [2]

JPEG-HDR

JPEG XT Part 2 HDR coding is based on Dolby JPEG-HDR format, [5] created in 2005 by Greg Ward [6] from BrightSide Technologies and Maryann Simmons from Walt Disney Feature Animation as a way to store high dynamic range images inside a standard JPEG file. BrightSide Technologies was acquired by Dolby Laboratories in 2007.

The image encoding is based on two-layer RGBE image format used by Radiance renderer, both of which were also created by Ward. Reduction in filesize is achieved by first converting the image into a tone mapped version, then storing a reconstructive multiplier image in APP11 markers in the same JPEG/JFIF file. Ordinary viewing software will ignore the multiplier image allowing anyone to see the tone mapped version of the image presented in a standard dynamic range and color gamut.

JPEG-HDR file format is similar to JPEG XT Part 3 Box file format but uses text-based metadata. [2]

Programs that support JPEG-HDR include Photosphere by Greg Ward [7] and pfstools. [8]

Reference software

libjpeg
Developer(s) Thomas Richter /JPEG Reference AhG
Initial releaseJuly 27, 2012;12 years ago (2012-07-27)
Stable release
1.65 / February 21, 2023;19 months ago (2023-02-21)
License GNU GPLv3; ISO
Website github.com/thorfdbg/libjpeg

ISO/IEC Joint Photography Experts Group maintains a reference software implementation for base JPEG (ISO/IEC 10918-1 and 18477-1) and JPEG XT extensions (ISO/IEC 18477 Parts 2 and 6-9), as well as JPEG-LS (ISO/IEC 14495). A reduced version without JPEG-LS, arithmetic coding, and a mozjpeg-like hierarchical progressive coder is available under the ISO license. [9] As a contestant for the ICIP Grand Challenge, the author also includes some existing JPEG optimization techniques known as "JPEG on steroids" in the library. [10]

A software JPEG-HDR encoder is provided by Dolby Labs; JPEG XT Part 7 Profile B software is provided by XDepth/Trellis Management; an implementation of all remaining parts was provided by the University of Stuttgart.

See also

Related Research Articles

<span class="mw-page-title-main">JPEG</span> Lossy compression method for reducing the size of digital images

JPEG is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality. JPEG typically achieves 10:1 compression with little perceptible loss in image quality. Since its introduction in 1992, JPEG has been the most widely used image compression standard in the world, and the most widely used digital image format, with several billion JPEG images produced every day as of 2015.

<span class="mw-page-title-main">Image compression</span> Reduction of image size to save storage and transmission costs

Image compression is a type of data compression applied to digital images, to reduce their cost for storage or transmission. Algorithms may take advantage of visual perception and the statistical properties of image data to provide superior results compared with generic data compression methods which are used for other digital data.

<span class="mw-page-title-main">JPEG 2000</span> Image compression standard and coding system

JPEG 2000 (JP2) is an image compression standard and coding system. It was developed from 1997 to 2000 by a Joint Photographic Experts Group committee chaired by Touradj Ebrahimi, with the intention of superseding their original JPEG standard, which is based on a discrete cosine transform (DCT), with a newly designed, wavelet-based method. The standardized filename extension is .jp2 for ISO/IEC 15444-1 conforming files and .jpx for the extended part-2 specifications, published as ISO/IEC 15444-2. The registered MIME types are defined in RFC 3745. For ISO/IEC 15444-1 it is image/jp2.

The JPEG File Interchange Format (JFIF) is an image file format standard published as ITU-T Recommendation T.871 and ISO/IEC 10918-5. It defines supplementary specifications for the container format that contains the image data encoded with the JPEG algorithm. The base specifications for a JPEG container format are defined in Annex B of the JPEG standard, known as JPEG Interchange Format (JIF). JFIF builds over JIF to solve some of JIF's limitations, including unnecessary complexity, component sample registration, resolution, aspect ratio, and color space. Because JFIF is not the original JPG standard, one might expect another MIME type. However, it is still registered as "image/jpeg".

<span class="mw-page-title-main">Joint Photographic Experts Group</span> Non-profit organisation

The Joint Photographic Experts Group (JPEG) is the joint committee between ISO/IEC JTC 1/SC 29 and ITU-T Study Group 16 that created and maintains the JPEG, JPEG 2000, JPEG XR, JPEG XT, JPEG XS, JPEG XL, and related digital image standards. It also has the responsibility for maintenance of the JBIG and JBIG2 standards that were developed by the former Joint Bi-level Image Experts Group.

<span class="mw-page-title-main">Advanced Video Coding</span> Most widely used standard for video compression

Advanced Video Coding (AVC), also referred to as H.264 or MPEG-4 Part 10, is a video compression standard based on block-oriented, motion-compensated coding. It is by far the most commonly used format for the recording, compression, and distribution of video content, used by 91% of video industry developers as of September 2019. It supports a maximum resolution of 8K UHD.

libjpeg Free software library for handling the JPEG image format

libjpeg is a free library with functions for handling the JPEG image data format. It implements a JPEG codec alongside various utilities for handling JPEG data. It is written in C and distributed as free software together with its source code under the terms of a custom permissive (BSD-like) free software license, which demands attribution. The original variant is maintained and published by the Independent JPEG Group (IJG). Meanwhile, there are several forks with additional features.

MPEG-4 Part 2, MPEG-4 Visual is a video compression format developed by the Moving Picture Experts Group (MPEG). It belongs to the MPEG-4 ISO/IEC standards. It uses block-wise motion compensation and a discrete cosine transform (DCT), similar to previous standards such as MPEG-1 Part 2 and H.262/MPEG-2 Part 2.

An image file format is a file format for a digital image. There are many formats that can be used, such as JPEG, PNG, and GIF. Most formats up until 2022 were for storing 2D images, not 3D ones. The data stored in an image file format may be compressed or uncompressed. If the data is compressed, it may be done so using lossy compression or lossless compression. For graphic design applications, vector formats are often used. Some image file formats support transparency.

Lossless JPEG is a 1993 addition to JPEG standard by the Joint Photographic Experts Group to enable lossless compression. However, the term may also be used to refer to all lossless compression schemes developed by the group, including JPEG 2000, JPEG-LS, and JPEG XL.

JPEG XR is an image compression standard for continuous tone photographic images, based on the HD Photo specifications that Microsoft originally developed and patented. It supports both lossy and lossless compression, and is the preferred image format for Ecma-388 Open XML Paper Specification documents.

RGBE or Radiance HDR is an image format invented by Gregory Ward Larson for the Radiance rendering system. It stores pixels as one byte each for RGB values with a one byte shared exponent. Thus it stores four bytes per pixel.

<span class="mw-page-title-main">Progressive Graphics File</span> File format

PGF is a wavelet-based bitmapped image format that employs lossless and lossy data compression. PGF was created to improve upon and replace the JPEG format. It was developed at the same time as JPEG 2000 but with a focus on speed over compression ratio.

High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard designed as part of the MPEG-H project as a successor to the widely used Advanced Video Coding. In comparison to AVC, HEVC offers from 25% to 50% better data compression at the same level of video quality, or substantially improved video quality at the same bit rate. It supports resolutions up to 8192×4320, including 8K UHD, and unlike the primarily 8-bit AVC, HEVC's higher fidelity Main 10 profile has been incorporated into nearly all supporting hardware.

A video coding format is a content representation format of digital video content, such as in a data file or bitstream. It typically uses a standardized video compression algorithm, most commonly based on discrete cosine transform (DCT) coding and motion compensation. A specific software, firmware, or hardware implementation capable of compression or decompression in a specific video coding format is called a video codec.

Motion JPEG 2000 is a file format for motion sequences of JPEG 2000 images and associated audio, based on the MP4 and QuickTime format. Filename extensions for Motion JPEG 2000 video files are .mj2 and .mjp2, as defined in RFC 3745.

High Efficiency Image File Format (HEIF) is a [[Digital container for storing individual digital images and image sequences. The standard covers multimedia files that can also include other media streams, such as timed text, audio and video.

JPEG XL is a royalty-free raster-graphics file format that supports both lossy and lossless compression. It is designed to outperform existing raster formats and thus become their universal replacement.

JPEG XS is an interoperable, visually lossless, low-latency and lightweight image and video coding system used in professional applications. Applications of the standard include streaming high-quality content for virtual reality, drones, autonomous vehicles using cameras, gaming, and broadcasting. It was the first ISO codec ever designed for this specific purpose. JPEG XS, built on core technology from both intoPIX and Fraunhofer IIS, is formally standardized as ISO/IEC 21122 by the Joint Photographic Experts Group with the first edition published in 2019. Although not official, the XS acronym was chosen to highlight the eXtra Small and eXtra Speed characteristics of the codec. Today, the JPEG committee is still actively working on further improvements to XS, with the second edition scheduled for publication and initial efforts being launched towards a third edition.

References

  1. 1 2 3 4 5 6 Thomas Richter; Alessandro Artusi; Touradj Ebrahimi (2016). "JPEG XT: A new family of JPEG backward-compatible standards" (PDF). IEEE MultiMedia. 23 (July/Sept 2016): 80–88. doi:10.1109/MMUL.2016.49.
  2. 1 2 3 4 5 6 7 8 9 10 11 12 Thomas Richter, Tim Bruylants, Peter Schelkens, Touradj Ebrahimi (22 September 2015). The JPEG XT Suite of Standards: Status and Future Plans Archived 2017-08-09 at the Wayback Machine
  3. 1 2 Alessandro Artusi, Rafal K. Mantiuk, Thomas Richter, Pavel Korshunov, Philippe Hanhart, Touradj Ebrahimi, Massimiliano Agostinelli. JPEG XT: A Compression Standard for HDR and WCG Images (Standards in a Nutshell)
  4. Artusi, Alessandro; Mantiuk, Rafał K.; Richter, Thomas; Hanhart, Philippe; Korshunov, Pavel; Agostinelli, Massimiliano; Ten, Arkady; Ebrahimi, Touradj (2015-12-19). "Overview and evaluation of the JPEG XT HDR image compression standard". Journal of Real-Time Image Processing. 16 (2): 413–428. doi:10.1007/s11554-015-0547-x. ISSN   1861-8200. S2CID   14247192.
  5. 1 2 "Discover home entertainment in Dolby".
  6. "OutBox page for Gregory Ward Larson".
  7. "Anyhere Software". anyhere.com.
  8. "pfstools About". pfstools.sourceforge.net.
  9. "Jpeg - Jpeg Xt".
  10. Richter, Thomas (September 2016). "JPEG on STEROIDS: Common optimization techniques for JPEG image compression". 2016 IEEE International Conference on Image Processing (ICIP). pp. 61–65. doi:10.1109/ICIP.2016.7532319. ISBN   978-1-4673-9961-6. S2CID   14922251.

Publications