Pixel aspect ratio

Last updated
Pixel aspect ratio 1:1 PAR-1to1.svg
Pixel aspect ratio 1:1
Pixel aspect ratio 2:1 PAR-2to1.svg
Pixel aspect ratio 2:1

A Pixel aspect ratio (often abbreviated PAR) is a mathematical ratio that describes how the width of a pixel in a digital image compared to the height of that pixel.

Contents

Most digital imaging systems display an image as a grid of tiny, square pixels. However, some imaging systems, especially those that must be compatible with standard-definition television motion pictures, display an image as a grid of rectangular pixels, in which the pixel width and height are different. Pixel aspect ratio describes this difference.

Use of pixel aspect ratio mostly involves pictures pertaining to standard-definition television and some other exceptional cases. Most other imaging systems, including those that comply with SMPTE standards and practices, use square pixels.

PAR is also known as sample aspect ratio and abbreviated SAR, though it can be confused with storage aspect ratio.

Introduction

The ratio of the width to the height of an image is known as the aspect ratio, or more precisely the display aspect ratio (DAR) – the aspect ratio of the image as displayed; for TV, DAR was traditionally 4:3 (a.k.a. fullscreen), with 16:9 (a.k.a. widescreen) now the standard for HDTV. In digital images, there is a distinction with the storage aspect ratio (SAR), which is the ratio of pixel dimensions. If an image is displayed with square pixels, then these ratios agree; if not, then non-square, "rectangular" pixels are used, and these ratios disagree. The aspect ratio of the pixels themselves is known as the pixel aspect ratio (PAR) – for square pixels this is 1:1 – and these are related by the identity:

SAR × PAR = DAR

Rearranging (solving for PAR) yields:

PAR = DAR / SAR

For example:

In analog images such as film there is no notion of pixel, nor notion of SAR or PAR, but in the digitization of analog images the resulting digital image has pixels, hence SAR (and accordingly PAR, if displayed at the same aspect ratio as the original).

Non-square pixels arise often in early digital TV standards, related to digitalization of analog TV signals – whose vertical and "effective" horizontal resolutions differ and are thus best described by non-square pixels – and also in some digital video cameras and computer display modes, such as Color Graphics Adapter (CGA). Today they arise also in transcoding between resolutions with different SARs.

Actual displays do not generally have non-square pixels, though digital sensors might; they are rather a mathematical abstraction used in resampling images to convert between resolutions.

There are several complicating factors in understanding PAR, particularly as it pertains to digitization of analog video:

Background

Video is presented as a sequential series of images called video frames. Historically, video frames were created and recorded in analog form. As digital display technology, digital broadcast technology, and digital video compression evolved separately, it resulted in video frame differences that must be addressed using pixel aspect ratio. Digital video frames are generally defined as a grid of pixels used to present each sequential image. The horizontal component is defined by pixels (or samples), and is known as a video line. The vertical component is defined by the number of lines, as in 480 lines.

Standard-definition television standards and practices were developed as broadcast technologies and intended for terrestrial broadcasting, and were therefore not designed for digital video presentation. Such standards define an image as an array of well-defined horizontal " Lines ", well-defined vertical "Line Duration" and a well-defined picture center. However, there is not a standard-definition television standard that properly defines image edges or explicitly demands a certain number of picture elements per line. Furthermore, analog video systems such as NTSC 480i and PAL 576i, instead of employing progressively displayed frames, employ fields or interlaced half-frames displayed in an interwoven manner to reduce flicker and double the image rate for smoother motion.

Analog-to-digital conversion

As a result of computers becoming powerful enough to serve as video editing tools, video digital-to-analog converters and analog-to-digital converters were made to overcome this incompatibility. To convert analog video lines into a series of square pixels, the industry adopted a default sampling rate at which luma values were extracted into pixels. The luma sampling rate for 480i pictures was 12+311 MHz and for 576i pictures was 14+34 MHz.

The term pixel aspect ratio was first coined when ITU-R BT.601 (commonly known as Rec. 601 ) specified that standard-definition television pictures are made of lines of exactly 720 non-square pixels. ITU-R BT.601 did not define the exact pixel aspect ratio but did provide enough information to calculate the exact pixel aspect ratio based on industry practices: The standard luma sampling rate of precisely 13+12 MHz. Based on this information:

SMPTE RP 187 further attempted to standardize the pixel aspect ratio values for 480i and 576i. It designated 177:160 for 480i or 1035:1132 for 576i. However, due to significant difference with practices in effect by industry and the computational load that they imposed upon the involved hardware, SMPTE RP 187 was simply ignored. SMPTE RP 187 information annex A.4 further suggested the use of 10:11 for 480i.

As of this writing, ITU-R BT.601-6, which is the latest edition of ITU-R BT.601, still implies that the pixel aspect ratios mentioned above are correct.

Digital video processing

As stated above, ITU-R BT.601 specified that standard-definition television pictures are made of lines of 720 non-square pixels, sampled with a precisely specified sampling rate. A simple mathematical calculation reveals that a 704 pixel width would be enough to contain a 480i or 576i standard 4:3 picture:

Unfortunately, not all standard TV pictures are exactly 4:3: As mentioned earlier, in analog video, the center of a picture is well-defined but the edges of the picture are not standardized. As a result, some analog devices (mostly PAL devices but also some NTSC devices) generated motion pictures that were horizontally (slightly) wider. This also proportionately applies to anamorphic widescreen (16:9) pictures. Therefore, to maintain a safe margin of error, ITU-R BT.601 required sampling 16 more non-square pixels per line (8 more at each edge) to ensure saving all video data near the margins.

This requirement, however, had implications for PAL motion pictures. PAL pixel aspect ratios for standard (4:3) and anamorphic wide screen (16:9), respectively 59:54 and 118:81, were awkward for digital image processing, especially for mixing PAL and NTSC video clips. Therefore, video editing products chose the almost equivalent values, respectively 12:11 and 16:11, which were more elegant and could create PAL digital images at exactly 704 pixels wide, as illustrated:

Inconsistency in defined pixel aspect ratio values

Commonly found on the Internet and in various other published media are numerous sources that introduce different and highly incompatible values as the pixel aspect ratios of various video pictures and video systems. (See the Supplementary sources section.)

To neutrally judge the accuracy and/or feasibility of these sources, please note that as the digital motion picture was invented years after the traditional motion picture, all video pictures targeted for standard definition television and compatible media, digital or otherwise, have (and must have) specifications compatible with standard definition television. Therefore, the pixel aspect ratio of digital video must be calculated from the specification of common traditional equipment rather than the specifications of digital video. Otherwise, any pixel aspect ratio that is calculated from a digital video source is only usable in certain cases for the same kind of video sources and cannot be considered/used as a general pixel aspect ratio of any standard definition television system.

In addition, unlike digital video that has well-defined picture edges, traditional video systems have never standardized a well-defined edge for the picture. Therefore, the pixel aspect ratio of common standard television systems cannot be calculated based on edges of pictures. Such a calculated aspect ratio value would not be entirely wrong, but also cannot be considered as the general pixel aspect ratio of any specific video system. The use of such values would be restricted only to certain cases.

Modern standards and practices

In modern digital imaging systems and high-definition televisions, especially those that comply with SMPTE standards and practices, only square pixels are used for broadcast and display. However, some formats (ex., HDV, DVCPRO HD) use non-square pixels internally for image storage, as a way to reduce the amount of data that must be processed, thus limiting the necessary transfer rates and maintaining compatibility with existing interfaces.

Issues of non-square pixels

Directly mapping an image with a certain pixel aspect ratio on a device whose pixel aspect ratio is different makes the image look unnaturally stretched or squashed in either the horizontal or vertical direction. For example, a circle generated for a computer display with square pixels looks like a vertical ellipse on a standard-definition NTSC television that uses vertically rectangular pixels. This issue is more evident on wide-screen TVs.

Pixel aspect ratio must be taken into consideration by video editing software products that edit video files with non-square pixels, especially when mixing video clips with different pixel aspect ratios. This would be the case when creating a video montage from various cameras employing different video standards (a relatively rare situation). Special effects software products must also take the pixel aspect ratio into consideration, since some special effects require calculation of the distances from a certain point so that they look visually correct. An example of such effects would be radial blur, motion blur, or even a simple image rotation.

Use of pixel aspect ratio

Pixel aspect ratio value is used mainly in digital video software, where motion pictures must be converted or reconditioned to use video systems other than the original. The video player software may use pixel aspect ratio to properly render digital video on screen. Video editing software uses pixel aspect ratio to properly scale and render a video into a new format.

The pixel aspect ratio support is also required to display, without distortion, legacy digital images from computer standards and video-games what existed in the 80s. In that generation, square pixels were too expensive to produce, so machines and video cards like the SNES, CGA, EGA, Hercules, C64, MSX, PC-88, X68000 etc had non-square pixels. [1]

Confusion with display aspect ratio

DVD Flick 1.3.0.7: Example of a computer program that has mislabeled picture aspect ratio as pixel aspect ratio Screenshot of DVD Flick 1.3.0.7 build 738.png
DVD Flick 1.3.0.7: Example of a computer program that has mislabeled picture aspect ratio as pixel aspect ratio

Pixel aspect ratio is often confused with different types of image aspect ratios; the ratio of the image width and height. Due to non-squareness of pixels in Standard-definition TV, there are two types of such aspect ratios: storage aspect ratio (SAR) and display aspect ratio (abbreviated DAR, also known as image aspect ratio and picture aspect ratio ). Also, pixel aspect ratio (PAR) is also known as sample aspect ratio (abbreviated SAR) in some industrial standards (such as H.264 [2] ) and output of programs (such as ffmpeg [3] ). Note the reuse of the abbreviations PAR and SAR. This article uses only the terms pixel aspect ratio, display aspect ratio and storage aspect ratio to avoid ambiguity.

Storage aspect ratio is the ratio of the image width to height in pixels, and can be easily calculated from the video file. Display aspect ratio is the ratio of image width to height (in a unit of length such as centimeters or inches) when displayed on screen, and is calculated from the combination of pixel aspect ratio and storage aspect ratio.

However, users who know the definition of these concepts may get confused as well. Poorly crafted user-interfaces or poorly written documentations can easily cause such confusion: Some video-editing software applications often ask users to specify an "aspect ratio" for their video file, presenting him or her with the choices of "4:3" and "16:9". Sometimes, these choices may be "PAL 4:3", "NTSC 4:3", "PAL 16:9" and "NTSC 16:9". In such situations, the video editing program is implicitly asking for the pixel aspect ratio of the video file by asking for information about the video system from which the video file originated. The program then uses a table (similar to the one below) to determine the correct pixel aspect ratio value.

Generally speaking, to avoid confusion, it can be assumed that video editing products never ask for the storage aspect ratio as they can directly retrieve or calculate it. Non-square-pixel–aware applications also need only to ask for either pixel aspect ratio or display aspect ratio, from either of which they can calculate the other.

Pixel aspect ratios of common video formats

Pixel aspect ratio values for common standard-definition video formats are listed below. Note that for PAL video formats, two different types of pixel aspect ratio values are listed:

  1. Rec.601 , a Rec.601-compliant value, which is considered the real pixel aspect ratio of standard-definition video of that type.
  2. Digital , which is roughly equivalent to Rec.601 and is more suitable to use in Digital Video Editing software.

Note that sources differ on PARs for common formats – for example, 576 lines (PAL) displayed at 4:3 (DAR) corresponds to either PAR of 12:11 (if 704×576, SAR = 11:9), or a PAR of 16:15 (if 720×576, SAR = 5:4). See references for sources giving both, and SDTV: Resolution for a table of storage, display and pixel aspect ratios. Also note that CRT televisions do not have pixels, but scanlines.


Non-square aspect ratios of common video formats
Video systemDARPicture
dimensions
(px × px)
SARPARPAR (decimal)Width (px)Type
PAL 4∶3 704 × 576 72∶59 [N 1] 59∶541.0925769, 385Rec.601
11∶912∶111.09768, 384digital
720 × 576 5∶416∶151.06
16∶9 704 × 576 72∶59 [N 2] 118∶811.4567901231026, 513Rec.601
11∶916∶111.451024, 512digital
720 × 576 5∶464∶451.42
NTSC 4∶3 704 × 480 22∶1510∶110.90640, 320
16∶940∶331.21853, 427
HDV / HDCAM 16∶9 1440 × 1080 4∶34∶31.31920
  1. SAR = DAR : PAR = 4/3 : 59/54 = 72/59  (But it is still not explained why 704 × 576 can have two Storage Aspect Ratios: 11:9 and 72:59)
  2. SAR = DAR : PAR = 16/9 : 118/81 = 72/59 
Sample aspect ratios with predefined indicators in the H.26x family of video codecs
IndexAspect ratio
0unspecified
11∶1
212∶11
310∶11
416∶11
540∶33
624∶11
720∶11
832∶11
980∶33
1018∶11
1115∶11
1264∶33
13160∶99
144∶3
153∶2
162∶1
255extended

Related Research Articles

<span class="mw-page-title-main">MPEG-2</span> Video encoding standard

MPEG-2 is a standard for "the generic coding of moving pictures and associated audio information". It describes a combination of lossy video compression and lossy audio data compression methods, which permit storage and transmission of movies using currently available storage media and transmission bandwidth. While MPEG-2 is not as efficient as newer standards such as H.264/AVC and H.265/HEVC, backwards compatibility with existing hardware and software means it is still widely used, for example in over-the-air digital television broadcasting and in the DVD-Video standard.

<span class="mw-page-title-main">Standard-definition television</span> Digital television with a similar definition to legacy analog systems

Standard-definition television is a television system which uses a resolution that is not considered to be either high or enhanced definition. "Standard" refers to offering a similar resolution to the analog broadcast systems used when it was introduced.

<span class="mw-page-title-main">Video</span> Electronic moving image

Video is an electronic medium for the recording, copying, playback, broadcasting, and display of moving visual media. Video was first developed for mechanical television systems, which were quickly replaced by cathode-ray tube (CRT) systems which, in turn, were replaced by flat panel displays of several types.

<span class="mw-page-title-main">Rec. 601</span> Standard from the International Telecommunication Union

ITU-R Recommendation BT.601, more commonly known by the abbreviations Rec. 601 or BT.601, is a standard originally issued in 1982 by the CCIR for encoding interlaced analog video signals in digital video form. It includes methods of encoding 525-line 60 Hz and 625-line 50 Hz signals, both with an active region covering 720 luminance samples and 360 chrominance samples per line. The color encoding system is known as YCbCr 4:2:2.

<span class="mw-page-title-main">D-1 (Sony)</span> Magnetic tape-based videocassette format

D-1 or 4:2:2 Component Digital is an SMPTE digital recording video standard, introduced in 1986 through efforts by SMPTE engineering committees. It started as a Sony and Bosch - BTS product and was the first major professional digital video format. SMPTE standardized the format within ITU-R 601, also known as Rec. 601, which was derived from SMPTE 125M and EBU 3246-E standards.

Enhanced-definition television, or extended-definition television (EDTV) is a Consumer Electronics Association (CEA) marketing shorthand term for certain digital television (DTV) formats and devices. Specifically, this term defines an extension of the standard-definition television (SDTV) format that enables a clearer picture during high-motion scenes compared to previous iterations of SDTV, but not producing images as detailed as high-definition television (HDTV).

Anamorphic widescreen is a process by which a comparatively wide widescreen image is horizontally compressed to fit into a storage medium with a narrower aspect ratio, reducing the horizontal resolution of the image while keeping its full original vertical resolution. Compatible play-back equipment can then expand the horizontal dimension to show the original widescreen image. This is typically used to allow one to store widescreen images on a medium that was originally intended for a narrower ratio, while using as much of the frame – and therefore recording as much detail – as possible.

<span class="mw-page-title-main">Common Intermediate Format</span>

CIF, also known as FCIF, is a standardized format for the picture resolution, frame rate, color space, and color subsampling of digital video sequences used in video teleconferencing systems. It was first defined in the H.261 standard in 1988.

<span class="mw-page-title-main">Display resolution</span> Number of distinct pixels in each dimension that can be displayed

The display resolution or display modes of a digital television, computer monitor or display device is the number of distinct pixels in each dimension that can be displayed. It can be an ambiguous term especially as the displayed resolution is controlled by different factors in cathode ray tube (CRT) displays, flat-panel displays and projection displays using fixed picture-element (pixel) arrays.

HD-MAC was a broadcast television standard proposed by the European Commission in 1986, as part of Eureka 95 project. It belongs to the MAC - Multiplexed Analogue Components standard family. It is an early attempt by the EEC to provide High-definition television (HDTV) in Europe. It is a complex mix of analogue signal, multiplexed with digital sound, and assistance data for decoding (DATV). The video signal was encoded with a modified D2-MAC encoder.

480p is the shorthand name for a family of video display resolutions. The p stands for progressive scan, i.e. non-interlaced. The 480 denotes a vertical resolution of 480 pixels, usually with a horizontal resolution of 640 pixels and 4:3 aspect ratio or a horizontal resolution of 854 pixels for an approximate 16:9 aspect ratio. Since a pixel count must be a whole number, in Wide VGA displays it is generally rounded up to 854 to ensure inclusion of the entire image. The frames are displayed progressively as opposed to interlaced. 480p was used for many early plasma televisions. Standard definition has always been a 4:3 aspect ratio with a pixel resolution of 720 × 480 at 60 Hz for NTSC regions, and 720 or 768 × 576 for PAL regions. However, standard definition defines a 15.7k Hz horizontal scanrate, which means that interlacing has to be used for those resolution modes. The lowercase letter "p" in 480p stands for progressive, so the two must not be confused.

1080i is a combination of frame resolution and scan type. 1080i is used in high-definition television (HDTV) and high-definition video. The number "1080" refers to the number of horizontal lines on the screen. The "i" is an abbreviation for "interlaced"; this indicates that only the even lines, then the odd lines of each frame are drawn alternately, so that only half the number of actual image frames are used to produce video. A related display resolution is 1080p, which also has 1080 lines of resolution; the "p" refers to progressive scan, which indicates that the lines of resolution for each frame are "drawn" on the screen in sequence.

<span class="mw-page-title-main">480i</span> Standard-definition video mode

480i is the video mode used for standard-definition digital video in the Caribbean, Japan, South Korea, Taiwan, Philippines, Western Sahara, and most of the Americas. The other common standard definition digital standard, used in the rest of the world, is 576i.

<span class="mw-page-title-main">576i</span> Standard-definition video mode

576i is a standard-definition digital video mode, originally used for digitizing analog television in most countries of the world where the utility frequency for electric power distribution is 50 Hz. Because of its close association with the legacy colour encoding systems, it is often referred to as PAL, PAL/SECAM or SECAM when compared to its 60 Hz NTSC-colour-encoded counterpart, 480i.

576p is the shorthand name for a video display resolution. The p stands for progressive scan, i.e. non-interlaced, the 576 for a vertical resolution of 576 pixels. Usually it corresponds to a digital video mode with a 4:3 anamorphic resolution of 720x576 and a frame rate of 25 frames per second (576p25), and thus using the same bandwidth and carrying the same amount of pixel data as 576i, but other resolutions and frame rates are possible.

Overscan is a behaviour in certain television sets, in which part of the input picture is cut off by the visible bounds of the screen. It exists because cathode-ray tube (CRT) television sets from the 1930s to the early 2000s were highly variable in how the video image was positioned within the borders of the screen. It then became common practice to have video signals with black edges around the picture, which the television was meant to discard in this way.

Source Input Format (SIF) defined in MPEG-1, is a video format that was developed to allow the storage and transmission of digital video.

Nominal analogue blanking is the outermost part of the overscan of a standard definition digital television image. It consists of a gap of black pixels at the left and right sides, which correspond to the end and start of the horizontal blanking interval: the front porch at the right side, and the back porch at the left side. Digital television ordinarily contains 720 pixels per line, but only 702 (PAL) to 704 (NTSC) of them contain picture content. The location is variable, since analogue equipment may shift the picture sideways in an unexpected amount or direction.

The aspect ratio of an image is the ratio of its width to its height, and is expressed with two numbers separated by a colon, such as 16:9, sixteen-to-nine. For the x:y aspect ratio, the image is x units wide and y units high. Common aspect ratios are 1.85:1 and 2.39:1 in cinematography, 4:3 and 16:9 in television photography, and 3:2 in still photography.

References

Main sources

Supplementary sources

Notes

  1. "Dot clock rates". pineight.com. Retrieved 2018-09-23.
  2. "Advanced video coding for generic audiovisual services". p. 25 (7). Retrieved 2022-06-10.
  3. "ffprobe shows PAR as SAR". ffmpeg.org. Retrieved 2022-06-10.