| |
Media type | Optical disc |
---|---|
Encoding | PhotoYCC |
Capacity | Up to 2500 photos in 256 × 384 resolution |
Read mechanism | 780 nm wavelength semiconductor laser |
Standard | Beige Book |
Developed by | Eastman Kodak |
Usage | Picture storage |
Extended from | White Book |
Extended to | Picture CD |
Released | 1991 |
Optical discs |
---|
Photo CD is a system designed by Kodak for digitizing and saving photos onto a CD. Launched in 1991, [1] the discs were designed to hold nearly 100 high quality images, scanned prints and slides using special proprietary encoding. Photo CDs are defined in the Beige Book and conform to the CD-ROM XA and CD-i Bridge specifications as well. They were intended to play on CD-i players, Photo CD players (Apple's PowerCD for example), and any computer with a suitable software (LaserSoft Imaging's SilverFast DC or HDR for example).
The system failed to gain mass usage among consumers partly due to its proprietary nature, the rapidly decreasing scanner prices, and the lack of CD-ROM drives in most home personal computers of the day. Furthermore, Photo CD relied on CRT-based TV sets for home use. However, these were designed for moving pictures. Their typical flicker became an issue when watching still photographs. The Photo CD system gained a fair level of acceptance among professional photographers due to the low cost of the high quality film scans. Prior to Photo CD, professionals who wished to digitize their film images were forced to pay much higher fees to obtain drum scans of their film negatives and transparencies. Both JPEG [2] and JPEG 2000 [3] support PhotoYCC colorspace as described below that is used in Photo CD files.
Designation | Resolution | Uncompressed Size | Intended typical use | |
---|---|---|---|---|
(px × px) | (Mpx) | (MB) | ||
Base/16 | 128 × 192 | 0.025 | 0.07 | Preview (index print, thumbnail) |
Base/4 | 256 × 384 | 0.098 | 0.28 | Web |
Base | 512 × 768 | 0.393 | 1.13 | Computer screen, TV, Web |
4 Base | 1024 × 1536 | 1.573 | 4.50 | HDTV screen |
16 Base | 2048 × 3072 | 6.291 | 18.00 | Print-out up to ca. 20 x 30 cm |
64 Base | 4096 × 6144 | 25.166 | 72.00 | Professional print, pre-press, archiving (optional) |
The Kodak Pro Photo CD Master Disc contains 25 images with maximum resolution of 6144 x 4096 pixels (six resolutions per file, Base/16 to 64 Base). This type is appropriate for 120 film, 4x5, but also for small picture film, if highest resolution is required.
Separate from the Photo CD format is Kodak's proprietary "Portfolio CD" format, which combines Red Book CD audio and Beige Book PCD with interactive menus and hotspots on PCD images. Some standalone Philips Photo/Audio CD players could play Portfolio CDs, and Windows player application was freely available. The Kodak Portfolio CD is not defined in any particular Rainbow Book.
The Photo CD system was announced by Kodak in 1990. [4] Photo CD targeted a full range of photographic needs, ranging from consumer level point-and-shoot cameras to high-end professionals using large format 4x5 sheet film. The first Photo CD products, including scanners for processing labs and Photo CD players for consumers, became available in 1992. [4] The project was expected to be a $600 million business by 1997 with $100 million in operational earnings. [5] Kodak entered into a number of partnerships in order to grow the usage of Photo CD. This included, for example, an arrangement with L.L. Bean in 1992 by which the catalog would be distributed in Photo CD format, [6] and an arrangement with Silicon Graphics in 1993 to make all Silicon Graphics image-processing workstations capable of accepting Kodak Photo CD optical disks. [7] These measures, together with the then relatively low cost of $3 per image and convenience, made Photo CD the digital imaging solution of choice for many photographers in the mid to late 1990s. [8]
By 2000, over 140 Photo CD processing labs in the U.S. were active, with many more outside the U.S. [8] However, by the late 1990s, Photo CD was being eclipsed by alternate formats, mainly based on the industry standard JPEG format. [9] In the consumer segment, the Photo CD format's relatively inefficient compression scheme meant that Photo CD files were significantly larger than JPEG files of similar quality, and thus less convenient for transmission across the internet, etc. For example, a 16Base Photo CD image of 5.5 Mb can be encoded as a JPEG image of 2.1 Mb at 80% quality, visually indistinguishable from the original. [10] When the Photo CD format was designed in the early 1990s, a design goal was to allow low cost playback-to-TV devices. At that time the available technology precluded 2-dimensional compression schemes such as JPEG, but by the late 1990s, advances in microprocessor technology had moved JPEG/PNG compression to well within the range of even very low cost consumer electronics.
In the professional and advanced amateur segments, Photo CD had been eclipsed by low cost desktop scanners such as those from Nikon and Minolta in the mid range, and by drum scanners at the very high end. [11] While the pixel resolution of Photo CD was still comparable or better than the alternatives, Photo CD suffered from a number of other disadvantages. Firstly, the Photo CD color space, designed for TV display, is smaller than what can be achieved by even a low cost desktop scanner. Secondly, the color rendition of Photo CD images changed over time and with different scanner versions; 4050 scanners had different color rendition to earlier versions. [12] [13] Thirdly, the dynamic range of scans was lower than for desktop scanners. Tests at the time indicated that the dmax rating (a measure of maximum density obtainable) of Photo CD was 2.8-3.0, while commonly available desktop scanners were reaching 4.2, [14] a substantial difference. As a result of this, and Photo CD's problems with color rendering, by 2004 the professional segment of the user community had generally turned against Photo CD. [11]
In the retail segment, while Photo CD was initially relatively popular with consumers, it was largely an economic failure for processing labs. At the time of its introduction, Kodak claimed that processing costs to labs would be close to $1 per image, [8] which would allow the lab profitably sell at the $3 per image mark. However this promise was never realized, often resulting in the scanning process being rushed, with a resulting fall in quality. [12] As a result of Photo CD's loss of market share and substantial corporate losses, partially attributed by Kodak Management to its scanning business, [15] Kodak abandoned the format over the period 2001-2004. By 2004, Kodak 4050 Photo CD scanners were being offered for free to anyone that would pay for their removal by more than one processing lab. [16] This abandonment generated considerable controversy both at the time and subsequently as the Photo CD format's technical specifications have never been released by Kodak. Photo CD remains an often quoted example of an “orphan format” and of the dangers of proprietary image formats within photographic circles. [17] [18] [19]
Despite Kodak not releasing the specifications for the Photo CD format, it has been reverse engineered, thus allowing images to be converted to more modern formats. The original reverse engineering work was performed by Hadmut Danisch of the University of Karlsruhe, who deciphered the format by studying hex dumps of Photo CD files, and subsequently wrote hpcdtoppm, which converts Photo CD Images to PPM format, in the early 1990s. [20] [21] During the early 1990s, hpcdtoppm was extensively distributed as part of various Linux distributions, but has since been almost entirely abandoned due to concerns about hpcdtoppm's restrictive license conditions [22] [23] [24] and lack of color management. However, Hadmut Danisch's reverse engineering work has been used to create a number of other open source implementations of Photo CD decoders such as ImageMagick. [25] In 2009, pcdtojpeg was created under the GPL open source licence. Also acknowledged by the author to be based on Hadmut Danisch's reverse engineering work, pcdtojpeg allows Photo CD metadata to be decoded, is color managed, and can decode all known variants of Photo CD files. [26] Currently hpcdtoppm and pcdtojpeg together form the core of information available in the public domain on the technical details of the Photo CD format.
Photo CD images are stored as a hierarchy of components ranging from Base/16 to 64Base which relate to the varying image resolutions that can be reconstructed. [27] The Base image, which has a luma resolution of 512 lines by 768 pixels, is typically used for the TV systems. The higher-resolution images are usually used for photographic imaging. The Base and lower resolution images are usually used for file indexes and thumbnails. The components used to create the Base/16 through 16Base resolutions are stored in a single file called an Image Pac. The Base, Base/4, and Base/16 images are stored uncompressed in order to allow easy extraction for display. 4Base and 16Base images are stored compressed, and as deltas (incremental images) from the resolution below them. Note all resolutions up the maximum resolution in a file (actually an Image Pac) are all present simultaneously. So, for example, a 16Base file contains a Base/16, Base/4, Base and 4Base image as well as the 16Base components. The sixth component, 64Base, is stored in separate files on the Photo CD as part of the IMAGE PAC Extension (IPE). This only exists on Photo CD Pro Master discs.
PCD images use a gamma transformed PhotoYCC encoding. [27] [28] Under this scheme, in order to encode scanned images into Photo CD file, the first step is to preshape the RGB data via a non-linear transform (the Rec. 709 OETF flipped to become an odd function, same as the later xvYCC [29] ):
In Kodak documentation, α = 1.099, β = 0.018. However, the full standard values (1.099296826809442, 0.018053968510807) provide mathematical continuity. [30]
As a result of this function, colors that are outside of the gamut defined by the CCIR 709 [a] primaries are encoded by the negative values. [27] Linear RGB values are from -0.20 to 2.00, while nonlinear RGB' are from -0.43357 to 1.402278.[ citation needed ] Reference white (perfect, non-fluorescent, white-reflecting diffuser in the original scene) is 1.0, just like in BT.709.
The preshaped RGB values are then converted to a luminance and two chrominance components via a CCIR 601-1-like matrix: [27]
Finally the luminance and chrominance components are scaled to 8-bit values by the following equations: [27]
Kodak claims that "the scale factors and offsets for the C1 and C2 channels result from the distribution of real world colors". [27] Given the 8-bit integer range for each of the 3 components, 0 ≤Y≤ 1.402, -1.40036 ≤B' - Y'≤ 0.888689, -1.01003 ≤G' - Y'≤ 0.86995.
An oddity of this encoding scheme is that it allows Photo CD images to represent colors which are above 100% (up to 140.2%), "whiter than white". [31] At the time of Photo CD's introduction, this allowed the display of images on the analog televisions of the day to show brighter, more vivid colors than would otherwise be the case. This was acceptable as the analog televisions of the day were designed to be driven beyond their 100% level without abruptly clipping. However, this aspect of Photo CD encoding is not compatible with modern image formats, and is the cause of the well documented "Blown Highlights" problem when converting Photo CD images to more modern formats. [32] [33] All modern photo formats enforce hard clipping at 100% (video formats in YUV still have headroom that is present on high level TVs), resulting in highlight clipping in converted images unless some form of Photo CD specific luminance and chrominance mapping is performed. [32]
In addition, although "toeroom" (Y < 0) is not provided, Kodak expressly allows the use of R, G, B < 0, allowing out-of-gamut (for Rec. 709) colors to be expressed. [27] The effect is similar to xvYCC, which came much later.
Photo CD images use three forms of compression in order to reduce image storage requirements. [27] Firstly, chroma subsampling reduces the size of the images by approximately 50%. This subsampling is by a factor of 4 for 4Base images, and a factor of 2 (4:2:0) for all other resolutions. Secondly an additional reduction in size is achieved by decomposing the highest-resolution image data, and storing the 4Base, 16Base and 64Base components as residuals (differences from pixels at the previous level of resolution). Thirdly and finally, the Photo CD system employs a form of quantization and Huffman coding to further compress this residual data. This Huffman encoding is performed on an image-row-by-image-row basis. The Huffman tables are encoded into the Photo CD image itself, and have different lengths depending on the compression class. These Huffman classes are: [34]
For example, to recompose an image of 1024 lines by 1536 pixels, the 512-line by 768-pixel luma Base image (which is neither "residual" or Huffman compressed) is interpolated to a 1024-line by 1536-pixel image. A 1024-line by 1536-pixel 4Base residual is then decompressed from its Huffman-encoded form, and its elements are added to each corresponding pixel. The resulting image contains detail for the full 1024-line by 1536-pixel sampled image. To recompose an image to a resolution of 2048 lines by 3072 pixels, the process is essentially repeated, using both the 4Base and 16Base residual. A similar operation is subsequently used to recompose the 4096 line by 6144 pixel resolution. In each step, an identical process is applied to the chroma channels.
Kodak based the PhotoYCC scheme on some existing video standards, notably on aspects of CCIR Recommendation 601 used with PAL and NTSC digital television systems, and also on CCIR Recommendation 709 (now ITU-R Recommendation BT.709) used for HDTV. The PhotoYCC definition is defined in a manner that is not constrained by the limitations of the actual video display. [27] [31]
In practice the color space of Photo CD images varies significantly from Rec. 709. Firstly, the Photo CD encoding scheme allows greater than 100% values for color components, thus allowing Photo CD images to display colors outside of the nominal Rec. 709 gamut. [32] In addition, in order to achieve accurate color reproduction, especially when scanning reversal film, Kodak found it necessary to provide ICC color profiles specific to film type and scanner. [35] As a result, by the time that the Photo CD format fell into disuse, five different color spaces were in common use in Photo CD images (PCD 4050 is a Kodak scanner model number):
Thus, while the use of the generic Photo CD color space will provide color reproduction that is acceptable for many purposes e.g., Web viewing of thumbnail images, in order to achieve fully accurate color reproduction, e.g., for photographic purposes, any Photo CD display or conversion software must use the correct color profile for the combination of the original media and scanner model.
For practical use, any Photo CD images will have to be converted to a modern format such as JPEG, TIFF or DNG. While there are a large number of image conversion programs available, both commercial and freeware or open source, most are capable of only basic, low resolution (often only to Base resolution, 512x768) conversions. The main distinguishing features between programs as regards Photo CD capabilities are:
In computer graphics and digital photography, a raster graphic represents a two-dimensional picture as a rectangular matrix or grid of pixels, viewable via a computer display, paper, or other display medium. A raster image is technically characterized by the width and height of the image in pixels and by the number of bits per pixel. Raster images are stored in image files with varying dissemination, production, generation, and acquisition formats.
JPEG 2000 (JP2) is an image compression standard and coding system. It was developed from 1997 to 2000 by a Joint Photographic Experts Group committee chaired by Touradj Ebrahimi, with the intention of superseding their original JPEG standard, which is based on a discrete cosine transform (DCT), with a newly designed, wavelet-based method. The standardized filename extension is .jp2 for ISO/IEC 15444-1 conforming files and .jpx for the extended part-2 specifications, published as ISO/IEC 15444-2. The registered MIME types are defined in RFC 3745. For ISO/IEC 15444-1 it is image/jp2.
Gamma correction or gamma is a nonlinear operation used to encode and decode luminance or tristimulus values in video or still image systems. Gamma correction is, in the simplest cases, defined by the following power-law expression:
The JPEG File Interchange Format (JFIF) is an image file format standard published as ITU-T Recommendation T.871 and ISO/IEC 10918-5. It defines supplementary specifications for the container format that contains the image data encoded with the JPEG algorithm. The base specifications for a JPEG container format are defined in Annex B of the JPEG standard, known as JPEG Interchange Format (JIF). JFIF builds over JIF to solve some of JIF's limitations, including unnecessary complexity, component sample registration, resolution, aspect ratio, and color space. Because JFIF is not the original JPG standard, one might expect another MIME type. However, it is still registered as "image/jpeg".
Y′UV, also written YUV, is the color model found in the PAL analogue color TV standard. A color is described as a Y′ component (luma) and two chroma components U and V. The prime symbol (') denotes that the luma is calculated from gamma-corrected RGB input and that it is different from true luminance. Today, the term YUV is commonly used in the computer industry to describe colorspaces that are encoded using YCbCr.
Tag Image File Format or Tagged Image File Format, commonly known by the abbreviations TIFF or TIF, is an image file format for storing raster graphics images, popular among graphic artists, the publishing industry, and photographers. TIFF is widely supported by scanning, faxing, word processing, optical character recognition, image manipulation, desktop publishing, and page-layout applications. The format was created by the Aldus Corporation for use in desktop publishing. It published the latest version 6.0 in 1992, subsequently updated with an Adobe Systems copyright after the latter acquired Aldus in 1994. Several Aldus or Adobe technical notes have been published with minor extensions to the format, and several specifications have been based on TIFF 6.0, including TIFF/EP, TIFF/IT, TIFF-F and TIFF-FX.
Chroma subsampling is the practice of encoding images by implementing less resolution for chroma information than for luma information, taking advantage of the human visual system's lower acuity for color differences than for luminance.
In digital photography, computer-generated imagery, and colorimetry, a greyscale or grayscale image is one in which the value of each pixel is a single sample representing only an amount of light; that is, it carries only intensity information. Grayscale images, a kind of black-and-white or gray monochrome, are composed exclusively of shades of gray. The contrast ranges from black at the weakest intensity to white at the strongest.
YCbCr, Y′CbCr, or Y Pb/Cb Pr/Cr, also written as YCBCR or Y′CBCR, is a family of color spaces used as a part of the color image pipeline in video and digital photography systems. Y′ is the luma component and CB and CR are the blue-difference and red-difference chroma components. Y′ is distinguished from Y, which is luminance, meaning that light intensity is nonlinearly encoded based on gamma corrected RGB primaries.
sRGB is a standard RGB color space that HP and Microsoft created cooperatively in 1996 to use on monitors, printers, and the World Wide Web. It was subsequently standardized by the International Electrotechnical Commission (IEC) as IEC 61966-2-1:1999. sRGB is the current defined standard colorspace for the web, and it is usually the assumed colorspace for images that are neither tagged for a colorspace nor have an embedded color profile.
The Cineon System was one of the first computer based digital film systems, created by Kodak in the early 1990s. It was an integrated suite of components consisting a Motion picture film scanner, a film recorder and workstation hardware with software for compositing, visual effects, image restoration and color management.
The ProPhoto RGB color space, also known as ROMM RGB, is an output referred RGB color space developed by Kodak. It offers an especially large gamut designed for use with photographic output in mind. The ProPhoto RGB color space encompasses over 90% of possible surface colors in the CIE L*a*b* color space, and 100% of likely occurring real-world surface colors documented by Michael Pointer in 1980, making ProPhoto even larger than the Wide-gamut RGB color space. The ProPhoto RGB primaries were also chosen in order to minimize hue rotations associated with non-linear tone scale operations. One of the downsides to this color space is that approximately 13% of the representable colors are imaginary colors that do not exist and are not visible colors.
A camera raw image file contains unprocessed or minimally processed data from the image sensor of either a digital camera, a motion picture film scanner, or other image scanner. Raw files are so named because they are not yet processed, and contain large amounts of potentially redundant data. Normally, the image is processed by a raw converter, in a wide-gamut internal color space where precise adjustments can be made before conversion to a viewable file format such as JPEG or PNG for storage, printing, or further manipulation. There are dozens of raw formats in use by different manufacturers of digital image capture equipment.
Digital photography uses cameras containing arrays of electronic photodetectors interfaced to an analog-to-digital converter (ADC) to produce images focused by a lens, as opposed to an exposure on photographic film. The digitized image is stored as a computer file ready for further digital processing, viewing, electronic publishing, or digital printing. It is a form of digital imaging based on gathering visible light.
JPEG XR is an image compression standard for continuous tone photographic images, based on the HD Photo specifications that Microsoft originally developed and patented. It supports both lossy and lossless compression, and is the preferred image format for Ecma-388 Open XML Paper Specification documents.
xvYCC or extended-gamut YCbCr is a color space that can be used in the video electronics of television sets to support a gamut 1.8 times as large as that of the sRGB color space. xvYCC was proposed by Sony, specified by the IEC in October 2005 and published in January 2006 as IEC 61966-2-4. xvYCC extends the ITU-R BT.709 tone curve by defining over-ranged values. xvYCC-encoded video retains the same color primaries and white point as BT.709, and uses either a BT.601 or BT.709 RGB-to-YCC conversion matrix and encoding. This allows it to travel through existing digital limited range YCC data paths, and any colors within the normal gamut will be compatible. It works by allowing negative RGB inputs and expanding the output chroma. These are used to encode more saturated colors by using a greater part of the RGB values that can be encoded in the YCbCr signal compared with those used in Broadcast Safe Level. The extra-gamut colors can then be displayed by a device whose underlying technology is not limited by the standard primaries.
scRGB is a wide color gamut RGB color space created by Microsoft and HP that uses the same color primaries and white/black points as the sRGB color space but allows coordinates below zero and greater than one. The full range is −0.5 through just less than +7.5.
PGF is a wavelet-based bitmapped image format that employs lossless and lossy data compression. PGF was created to improve upon and replace the JPEG format. It was developed at the same time as JPEG 2000 but with a focus on speed over compression ratio.
Rec. 709, also known as Rec.709, BT.709, and ITU 709, is a standard developed by ITU-R for image encoding and signal characteristics of high-definition television.
The YCoCg color model, also known as the YCgCo color model, is the color space formed from a simple transformation of an associated RGB color space into a luma value and two chroma values called chrominance green (Cg) and chrominance orange (Co). It is supported in video and image compression designs such as H.264/MPEG-4 AVC, HEVC, VVC, JPEG XR, and Dirac. It is simple to compute, has good transform coding gain, and can be losslessly converted to and from RGB with fewer bits than are needed with other color models. A reversible scaled version with even lower bit depth, YCoCg-R, is also supported in most of these designs and is also used in Display Stream Compression. The more complete definition with variable bit depths of Y and chrominance values is given in ITU-T H.273.