Rainbow storage

Last updated

Rainbow storage is a developing paper-based data storage technique first demonstrated by Indian student Sainul Abideen in November 2006. [1] Abideen received his MCA from MES Engineering College Kuttipuram in Kerala's Malappuram District .

Contents

Initial newspaper reports of the technology were disputed by multiple technical sources, although Abideen says those reports were based on a misunderstanding of the technology. The paper meant to demonstrate the capability of storing relatively large amounts of data (and not necessarily in the gigabyte range) using textures and diagrams. [2]

The Rainbow data storage technology claims to use geometric shapes such as triangles, circles and squares of various colors to store a large amount of data on ordinary paper or plastic surfaces. This would provide several advantages over current forms of optical- or magnetic data storage like less environmental pollution due to the biodegradability of paper, low cost and high capacity. Data could be stored on "Rainbow Versatile Disk" (RVD) or plastic/paper cards of any form factor (like SIM cards). [3]

Criticism

Following the wide media attention this news received, some of the claims have been disputed by various[ which? ] experts. [4] [5]

Printing at 1,200 dots per inch (DPI) leads to a theoretical maximum of 1,440,000 colored dots per square inch. If a scanner can reliably distinguish between 256 unique colors (thus encoding one byte per dot), the maximum possible storage is approximately 140 megabytes for a sheet of A4 papermuch lower when the necessary error correction is employed. If the scanner were able to accurately distinguish between 16,777,216 colors (24 bits, or 3 bytes per dot), the capacity would triple, but it still falls well below the media stories' claims of several hundred gigabytes.

Printing this quantity of unique colors would require specialized equipment to generate many spot colors. The process color model used by most printers provides only four colors, with additional colors simulated by a halftone pattern.

At least one of three things must be true for the claim to be valid:

The theory is: If Rainbow's "geometric" algorithm is to be encoded and decoded by a computer, it would equally viable to store the compressed data on a conventional disk rather than printing it to paper or other non-digital medium. Printing something as dots on a page rather than bits on a disk will not change the underlying compression ratio, so a lossless compression algorithm that could store 250 gigabytes within a few hundred megabytes of data would be revolutionary indeed. Likewise, data can be compressed with any algorithm and subsequently printed to paper as colored dots. The amount of data that can be reliably stored in this way is limited by the printer and scanner, as described above.

However Sainul Abideen says that the articles are based on misunderstandings. He claims, it as a method to store data in the form of colour, in any medium where colour can be represented, not only paper. Density of storage in paper will be very small and the density will be depends on the storage medium, capacity of colour representation and retrieval methods etc.

Demonstrations

Sainul Abdeen demonstrated his technology to the college and members of the Indian press in the MES College of Engineering computer lab, Kerala, and was able to compress 450 sheets plain text from foolscap paper into a 1-inch square. He also demonstrated a 45-second audio clip compressed using this technology on to an A4 sheet. Depending on the sampling frequency, bit depth, and audio compression (if any), a 45-second audio clip can consist of anywhere from a few kilobytes to a few megabytes of data. Abideen claimed that the technology could be extended to 250 gigabytes by using specific materials and devices. [ citation needed ]

This technology is based on two principles:

Principle I
“Every color or color combinations can be converted into some values and from the values the colors or color combinations can be regenerated”.
Principle II
“Every different color or color combinations will produce different values”.

Related Research Articles

In signal processing, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression reduces bits by identifying and eliminating statistical redundancy. No information is lost in lossless compression. Lossy compression reduces bits by removing unnecessary or less important information. Typically, a device that performs data compression is referred to as an encoder, and one that performs the reversal of the process (decompression) as a decoder.

The gigabyte is a multiple of the unit byte for digital information. The prefix giga means 109 in the International System of Units (SI). Therefore, one gigabyte is one billion bytes. The unit symbol for the gigabyte is GB.

JPEG Lossy compression method for digital images

JPEG or JPG is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable trade-off between storage size and image quality. JPEG typically achieves 10:1 compression with little perceptible loss in image quality. Since its introduction in 1992, JPEG has been the most widely used image compression standard in the world, and the most widely used digital image format, with several billion JPEG images produced every day as of 2015.

Lossy compression Data compression approach that reduces data size while discarding or changing some of it

In information technology, lossy compression or irreversible compression is the class of data encoding methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size for storing, handling, and transmitting content. The different versions of the photo of the cat on this page show how higher degrees of approximation create coarser images as more details are removed. This is opposed to lossless data compression which does not degrade the data. The amount of data reduction possible using lossy compression is much higher than using lossless techniques.

Lossless compression is a class of data compression algorithms that allows the original data to be perfectly reconstructed from the compressed data. By contrast, lossy compression permits reconstruction only of an approximation of the original data, though usually with greatly improved compression rates.

The megabyte is a multiple of the unit byte for digital information. Its recommended unit symbol is MB. The unit prefix mega is a multiplier of 1000000 (106) in the International System of Units (SI). Therefore, one megabyte is one million bytes of information. This definition has been incorporated into the International System of Quantities.

Portable Network Graphics Family of lossless compression file formats for image files

Portable Network Graphics is a raster-graphics file format that supports lossless data compression. PNG was developed as an improved, non-patented replacement for Graphics Interchange Format (GIF).

PCX, standing for PiCture eXchange, is an image file format developed by the now-defunct ZSoft Corporation of Marietta, Georgia, United States. It was the native file format for PC Paintbrush and became one of the first widely accepted DOS imaging standards, although it has since been succeeded by more sophisticated image formats, such as BMP, JPEG, and PNG. PCX files commonly stored palette-indexed images ranging from 2 or 4 colors to 16 and 256 colors, although the format has been extended to record true-color (24-bit) images as well.

Raster graphics Dot matrix data structure

In computer graphics and digital photography, a raster graphic is a dot matrix data structure that represents a generally rectangular grid of pixels, viewable via a computer display, paper, or other display medium. Raster images are stored in image files with varying dissemination, production, generation, and acquisition formats.

Laser printing Electrostatic digital printing process

Laser printing is an electrostatic digital printing process. It produces high-quality text and graphics by repeatedly passing a laser beam back and forth over a negatively charged cylinder called a "drum" to define a differentially charged image. The drum then selectively collects electrically charged powdered ink (toner), and transfers the image to paper, which is then heated in order to permanently fuse the text, imagery, or both, to the paper. As with digital photocopiers, laser printers employ a xerographic printing process. Laser printing differs from traditional xerography as implemented in analog photocopiers in that in the latter, the image is formed by reflecting light off an existing document onto the exposed drum.

TIFF Series of image file formats

Tag Image File Format, abbreviated TIFF or TIF, is a computer file format for storing raster graphics images, popular among graphic artists, the publishing industry, and photographers. TIFF is widely supported by scanning, faxing, word processing, optical character recognition, image manipulation, desktop publishing, and page-layout applications. The format was created by the Aldus Corporation for use in desktop publishing. It published the latest version 6.0 in 1992, subsequently updated with an Adobe Systems copyright after the latter acquired Aldus in 1994. Several Aldus or Adobe technical notes have been published with minor extensions to the format, and several specifications have been based on TIFF 6.0, including TIFF/EP, TIFF/IT, TIFF-F and TIFF-FX.

Compression artifact Distortion of media caused by lossy data compression

A compression artifact is a noticeable distortion of media caused by the application of lossy compression. Lossy data compression involves discarding some of the media's data so that it becomes small enough to be stored within the desired disk space or transmitted (streamed) within the available bandwidth. If the compressor cannot store enough data in the compressed version, the result is a loss of quality, or introduction of artifacts. The compression algorithm may not be intelligent enough to discriminate between distortions of little subjective importance and those objectionable to the user.

Image scanner Device that optically scans images, printed text

An image scanner—often abbreviated to just scanner, is a device that optically scans images, printed text, handwriting or an object and converts it to a digital image. Commonly used in offices are variations of the desktop flatbed scanner where the document is placed on a glass window for scanning. Hand-held scanners, where the device is moved by hand, have evolved from text scanning "wands" to 3D scanners used for industrial design, reverse engineering, test and measurement, orthotics, gaming and other applications. Mechanically driven scanners that move the document are typically used for large-format documents, where a flatbed design would be impractical.

A space–time or time–memory trade-off in computer science is a case where an algorithm or program trades increased space usage with decreased time. Here, space refers to the data storage consumed in performing a given task, and time refers to the time consumed in performing a given task.

Color digital images are made of pixels, and pixels are made of combinations of primary colors represented by a series of code. A channel in this context is the grayscale image of the same size as a color image, made of just one of these primary colors. For instance, an image from a standard digital camera will have a red, green and blue channel. A grayscale image has just one channel.

Image file formats are standardized means of organizing and storing digital images. An image file format may store data in an uncompressed format, a compressed format, or a vector format. Image files are composed of digital data in one of these formats so that the data can be rasterized for use on a computer display or printer. Rasterization converts the image data into a grid of pixels. Each pixel has a number of bits to designate its color. Rasterizing an image file for a specific device takes into account the number of bits per pixel that the device is designed to handle.

Bigtable is a compressed, high performance, proprietary data storage system built on Google File System, Chubby Lock Service, SSTable and a few other Google technologies. On May 6, 2015, a public version of Bigtable was made available as a service. Bigtable also underlies Google Cloud Datastore, which is available as a part of the Google Cloud Platform.

Magnetic tape data storage is a system for storing digital information on magnetic tape using digital recording.

Paper data storage refers to the use of paper as a data storage device. This includes writing, illustrating, and the use of data that can be interpreted by a machine or is the result of the functioning of a machine. A defining feature of paper data storage is the ability of humans to produce it with only simple tools and interpret it visually.

Virtual memory compression is a memory management technique that utilizes data compression to reduce the size or number of paging requests to and from the auxiliary storage. In a virtual memory compression system, pages to be paged out of virtual memory are compressed and stored in physical memory, which is usually random-access memory (RAM), or sent as compressed to auxiliary storage such as a hard disk drive (HDD) or solid-state drive (SSD). In both cases the virtual memory range, whose contents has been compressed, is marked inaccessible so that attempts to access compressed pages can trigger page faults and reversal of the process. The footprint of the data being paged is reduced by the compression process; in the first instance, the freed RAM is returned to the available physical memory pool, while the compressed portion is kept in RAM. In the second instance, the compressed data is sent to auxiliary storage but the resulting I/O operation is smaller and therefore takes less time.

References

  1. "Data Can Now Be Stored on Paper" by M. A. Siraj, ArabNews (published November 18, 2006; accessed November 29, 2006)
  2. "Paper storage man misunderstood". The Inquirer . 12 December 2006. Archived from the original on 2007-02-23.CS1 maint: unfit URL (link)
  3. "Store 256GB on an A4 sheet" by Chris Mellor, Techworld (published November 24, 2006; accessed November 29, 2006)
  4. IT Soup: Scam of Indian student developing technology to store 450 GB of data on a sheet of paper By ITSoup (published November 25, 2006; accessed November 25, 2006)
  5. "Can you get 256GB on an A4 sheet? No way!" By Chris Mellor, Techworld (published November 24, 2006; accessed November 29, 2006)

Absolute Rainbow Dots

Absolute rainbow dots are used to detect errors caused by scratches, and whether any fading has occurred. Absolute rainbow dots are predefined dots carrying a unique value. These dots can be inserted in the rainbow picture in pre-specified areas. If fading occurs these dot values will change accordingly, and at the reproduction stage this can be checked and corrected. Absolute rainbow dots will be microscopically small so that they occupy very little space in the rainbow picture. These will be colored differently so that each dot will have its own fixed unique value.