Machine Identification Code

Last updated

Yellow dots on white paper, produced by color laser printer (enlarged, dot diameter about 0.1 mm) Printermarkrp.jpg
Yellow dots on white paper, produced by color laser printer (enlarged, dot diameter about 0.1 mm)

A Machine Identification Code (MIC), also known as printer steganography, yellow dots, tracking dots or secret dots, is a digital watermark which certain color laser printers and copiers leave on every printed page, allowing identification of the device which was used to print a document and giving clues to the originator. Developed by Xerox and Canon in the mid-1980s, its existence became public only in 2004. In 2018, scientists developed privacy software to anonymize prints in order to support whistleblowers publishing their work. [1] [2] [3]

Contents

History

In the mid-1980s Xerox pioneered an encoding mechanism for a unique number represented by tiny dots spread over the entire print area. Xerox developed this machine identification code (MIC) "to assuage fears that their color copiers could be used to counterfeit bills" [4] and received U.S. Patent No 5515451 describing the use of the yellow dots to identify the source of a copied or printed document. [5] [6]

The public first became aware of MIC in October 2004, when it was used by Dutch authorities to track counterfeiters who had used a Canon color laser printer. [7] In November 2004, PC World reported the machine identification code had been used for decades in some printers, allowing law enforcement to identify and track counterfeiters. [4] The Central Bank Counterfeit Deterrence Group (CBCDG) has denied that it developed the feature. [5]

The decoding process discovered by the EFF. Machine Identification Code von Druckern.png
The decoding process discovered by the EFF.

In 2005, the civil rights group Electronic Frontier Foundation (EFF) encouraged the public to send in sample printouts and subsequently decoded the pattern. [8] The pattern has been demonstrated on a wide range of printers from different manufacturers and models. [9] The EFF stated in 2015 that the documents that they previously received through the FOIA [10] suggested that all major manufacturers of color laser printers entered a secret agreement with governments to ensure that the output of those printers is forensically traceable. [11]

In 2007, the European Parliament was asked about the question of invasion of privacy. [12] [5]

Technical aspects

Yellow dots produced by an HP Color LaserJet CP1515n MIC 60x.jpg
Yellow dots produced by an HP Color LaserJet CP1515n

The pattern consists of a dot-matrix spread of yellow dots, which can barely be seen with the naked eye. The dots have a diameter of a tenth of a millimeter (0.004") and a spacing of about one millimeter (0.039"). Their arrangement encodes the serial number of the device, date and time of the printing, and is repeated several times across the printing area in case of errors. For example, if the code consists of 8 × 16 dots in a square or hexagonal pattern, it spreads over a surface of about 4 square centimetres (0.62 sq in) and appears on a sheet of size A4 paper about 150 times. Thus, it can be analyzed even if only fragments or excerpts are available. Some printers arrange yellow dots in seemingly random point clouds.

According to the Chaos Computer Club in 2005, color printers leave the code in a matrix of 32 × 16 dots and thus can store 64 bytes of data (64×8). [13]

As of 2011, Xerox was one of the few manufacturers to draw attention to the marked pages, stating in a product description, "The digital color printing system is equipped with an anti-counterfeit identification and banknote recognition system according to the requirements of numerous governments. Each copy shall be marked with a label which, if necessary, allows identification of the printing system with which it was created. This code is not visible under normal conditions." [14]

In 2018, scientists at the TU Dresden analyzed the patterns of 106 printer models from 18 manufacturers and found four different encoding schemes. [2]

Visibility

Yellow Dots: tiny yellow dots on the print-out representing the hidden code of an HP Color LaserJet 3700. HP Color Laserjet 3700 schutz g.jpg
Yellow Dots: tiny yellow dots on the print-out representing the hidden code of an HP Color LaserJet 3700.

The Machine Identification Code can be made visible by printing or copying a page and subsequently scanning a small section with a high-resolution scanner. The yellow color channel can then be enhanced with an image processing program, to make any dots of the MIC clearly visible. Under good lighting conditions, a magnifying glass may be enough to see the pattern. Under UV-light the yellow dots are clearly recognizable. [15]

Machine Identification Code (MIC, yellow dots, tracking dots, secret dots) under UV-light in regular (red and blue markings) and irregular arrangement (green). Machine Identification Code.png
Machine Identification Code (MIC, yellow dots, tracking dots, secret dots) under UV-light in regular (red and blue markings) and irregular arrangement (green).

Using this steganographic process, high-quality copies of an original (e.g. a bank note) under blue light can be made identifiable. Using this process, even shredded prints can be identified: the 2011 "Shredder Challenge" initiated by the DARPA was solved by a team called "All Your Shreds Are Belong To U.S." consisting of Otavio Good and two colleagues. [16] [17]

Practical application

Both journalists and security experts have suggested that The Intercept 's handling of the leaks by whistleblower Reality Winner, which included publishing secret NSA documents unredacted and including the printer tracking dots, was used to identify Winner as the leaker, leading to her arrest in 2017 and conviction. [18] [19]

Protection of privacy and circumvention

Copies or printouts of documents with confidential personal information, for example health care information, account statements, tax declaration or balance sheets, can be traced to the owner of the printer and the inception date of the documents can be revealed. This traceability is unknown to many users and inaccessible, as manufacturers do not publicize the code that produces these patterns. It is unclear which data may be unintentionally passed on with a copy or printout. In particular, there are no mentions of the technique in the support materials of most affected printers. In 2005 the Electronic Frontier Foundation (EFF) sought a decoding method and made available a Python script for analysis. [20]

In 2018, scientists from TU Dresden developed and published a tool to extract and analyze the steganographic codes of a given color printer and subsequently to anonymize prints from that printer. The anonymization works by printing additional yellow dots on top of the Machine Identification Code. [1] [2] [3] The scientists made the software available to support whistleblowers in their efforts to publicize grievances. [21]

Comparable processes

Other methods of identification are not as easily recognizable as yellow dots. For example, a modulation of laser intensity and a variation of shades of grey in texts are already feasible. As of 2006, it was unknown whether manufacturers were also using these techniques. [22]

See also

Related Research Articles

<span class="mw-page-title-main">Printer (computing)</span> Computer peripheral that prints text or graphics

In the field of computing, a printer is considered a peripheral device that serves the purpose of creating a permanent representation of text or graphics, usually on paper. While the majority of outputs produced by printers are readable by humans, there are instances where barcode printers have found a utility beyond this traditional use. Different types of printers are available for use, including inkjet printers, thermal printers, laser printers, and 3D printers.

<span class="mw-page-title-main">PostScript</span> File format and programming language

PostScript is a page description language and dynamically typed, stack-based programming language. It is most commonly used in the electronic publishing and desktop publishing realm, but as a Turing complete programming language, it can be used for many other purposes as well. PostScript was created at Adobe Systems by John Warnock, Charles Geschke, Doug Brotz, Ed Taft and Bill Paxton from 1982 to 1984. The most recent version, PostScript 3, was released in 1997.

<span class="mw-page-title-main">Laser printing</span> Electrostatic digital printing process

Laser printing is an electrostatic digital printing process. It produces high-quality text and graphics by repeatedly passing a laser beam back and forth over a negatively charged cylinder called a "drum" to define a differentially charged image. The drum then selectively collects electrically charged powdered ink (toner), and transfers the image to paper, which is then heated to permanently fuse the text, imagery, or both, to the paper. As with digital photocopiers, laser printers employ a xerographic printing process. Laser printing differs from traditional xerography as implemented in analog photocopiers in that in the latter, the image is formed by reflecting light off an existing document onto the exposed drum.

<span class="mw-page-title-main">Daisy wheel printing</span> Impact printing technology

Daisy wheel printing is an impact printing technology invented in 1970 by Andrew Gabor at Diablo Data Systems. It uses interchangeable pre-formed type elements, each with typically 96 glyphs, to generate high-quality output comparable to premium typewriters such as the IBM Selectric, but two to three times faster. Daisy wheel printing was used in electronic typewriters, word processors and computers from 1972. The daisy wheel is so named because of its resemblance to the daisy flower.

Interpress is a page description language developed at Xerox PARC, based on the Forth programming language and an earlier graphics language called JaM. PARC was unable to commercialize Interpress. Two of its creators, Chuck Geschke and John Warnock, left Xerox, formed Adobe Systems, and produced a similar language called PostScript. Interpress is used in some Xerox printers most notable was the DocuTech Network Production Publisher, and is supported in Xerox Ventura Publisher. Interpress is also used as the output format for PARC's InterScript system, which is an editable word processor format for rich text documents. Interpress describes the desired or ideal appearance of a document that has been completely composed by some other process (emitter). All line ending, hyphenation, and line justification decisions, and in fact all decisions about the shapes and positions of the images, are made before creating the master. Since Interpress describes a document in a device-independent manner, a master can be printed on a variety of devices, each of which renders its best approximation to the ideal represented by the master.

<span class="mw-page-title-main">Dots per inch</span> Measure of dot density

Dots per inch is a measure of spatial printing, video or image scanner dot density, in particular the number of individual dots that can be placed in a line within the span of 1 inch (2.54 cm). Similarly, dots per centimetre refers to the number of individual dots that can be placed within a line of 1 centimetre (0.394 in).

<span class="mw-page-title-main">Microdot</span> Steganograph method of hiding messages

A microdot is text or an image substantially reduced in size to prevent detection by unintended recipients. Microdots are normally circular and around 1 millimetre (0.039 in) in diameter but can be made into different shapes and sizes and made from various materials such as polyester or metal. The name comes from the fact that the microdots have often been about the size and shape of a typographical dot, such as a period or the tittle of a lowercase i or j. Microdots are, fundamentally, a steganographic approach to message protection.

A canary trap is a method for exposing an information leak by giving different versions of a sensitive document to each of several suspects and seeing which version gets leaked. It could be one false statement, to see whether sensitive information gets out to other people as well. Special attention is paid to the quality of the prose of the unique language, in the hopes that the suspect will repeat it verbatim in the leak, thereby identifying the version of the document.

<span class="mw-page-title-main">Thermal-transfer printing</span> Digital printing method

Thermal-transfer printing is a digital printing method in which material is applied to paper by melting a coating of ribbon so that it stays glued to the material on which the print is applied. It contrasts with direct thermal printing, where no ribbon is present in the process.

<span class="mw-page-title-main">Xerography</span> Dry photocopying technique

Xerography is a dry photocopying technique. Originally called electrophotography, it was renamed xerography—from the Greek roots ξηρόςxeros, meaning "dry" and -‍γραφία-‍graphia, meaning "writing"—to emphasize that unlike reproduction techniques then in use such as cyanotype, the process of xerography used no liquid chemicals.

<span class="mw-page-title-main">EURion constellation</span> Pattern of symbols incorporated into a number of banknote designs

The EURion constellation is a pattern of symbols incorporated into a number of secure documents such as banknotes, checks, and ownership title certificates designs worldwide since about 1996. It is added to help imaging software detect the presence of such a document in a digital image. Such software can then block the user from reproducing banknotes to prevent counterfeiting using colour photocopiers.

<span class="mw-page-title-main">Security printing</span> Field of the printing industry for banknotes and other security products

Security printing is the field of the printing industry that deals with the printing of items such as banknotes, cheques, passports, tamper-evident labels, security tapes, product authentication, stock certificates, postage stamps and identity cards. The main goal of security printing is to prevent forgery, tampering, or counterfeiting. More recently many of the techniques used to protect these high-value documents have become more available to commercial printers, whether they are using the more traditional offset and flexographic presses or the newer digital platforms. Businesses are protecting their lesser-value documents such as transcripts, coupons and prescription pads by incorporating some of the features listed below to ensure that they cannot be forged or that alteration of the data cannot occur undetected.

Variable Data Intelligent Postscript Printware is an open language from Xerox that enables highest-performance output of variable-data PostScript documents. It is used by the FreeFlow VI Suite (VIPP) front end.

<span class="mw-page-title-main">Microprinting</span> Printing at a small scale

Microprinting is the production of recognizable patterns or characters in a printed medium at a scale that typically requires magnification to read with the naked eye. To the unaided eye, the text may appear as a solid line. Attempts to reproduce by methods of photocopy, image scanning, or pantograph typically translate as a dotted or solid line, unless the reproduction method can identify and recreate patterns to such scale. Microprint is predominantly used as an anti-counterfeiting technique, due to its inability to be easily reproduced by widespread digital methods.

<span class="mw-page-title-main">Digital paper</span> Paper used to create digital documents

Digital paper, also known as interactive paper, is patterned paper used in conjunction with a digital pen to create handwritten digital documents. The printed dot pattern uniquely identifies the position coordinates on the paper. The digital pen uses this pattern to store handwriting and upload it to a computer.

In the distribution and logistics of many types of products, track and trace or tracking and tracing concerns a process of determining the current and past locations of a unique item or property. Mass serialization is the process that manufacturers go through to assign and mark each of their products with a unique identifier such as an Electronic Product Code (EPC) for track and trace purposes. The marking or "tagging" of products is usually completed within the manufacturing process through the use of various combinations of human readable or machine readable technologies such as DataMatrix barcodes or RFID.

DocuTech is the name given to a line of electronic production-publishing systems produced by Xerox Corporation. It allowed paper documents to be scanned, electronically edited, and then printed on demand. DocuTech systems were the last known to use the XNS protocol for networking.

<span class="mw-page-title-main">Photocopier</span> Device for reproducing documents

A photocopier is a machine that makes copies of documents and other visual images onto paper or plastic film quickly and cheaply. Most modern photocopiers use a technology called xerography, a dry process that uses electrostatic charges on a light-sensitive photoreceptor to first attract and then transfer toner particles onto paper in the form of an image. The toner is then fused onto the paper using heat, pressure, or a combination of both. Copiers can also use other technologies, such as inkjet, but xerography is standard for office copying.

The Electronic Frontier Foundation (EFF) is an international non-profit advocacy and legal organization based in the United States.

The Xerox 9700 was a high-end laser printer manufactured by Xerox Corporation beginning in 1977. Based on the Xerox 9200 copier, the 9700 printed at 300 dots-per-inch on cut-sheet paper at up to two pages per second (pps), one- or two-sided, that is simplex or duplex, landscape or portrait.

References

  1. 1 2 DEDA - tracking Dots Extraction, Decoding and Anonymisation toolkit: deda on GitHub
  2. 1 2 3 Richter, Timo; Escher, Stephan; Schönfeld, Dagmar; Strufe, Thorsten (June 14, 2018). "Forensic Analysis and Anonymisation of Printed Documents". Proceedings of the 6th ACM Workshop on Information Hiding and Multimedia Security. ACM. pp. 127–138. doi:10.1145/3206004.3206019. ISBN   9781450356251. S2CID   49345609.
  3. 1 2 "Dresdner Forscher überlisten Tracking-Punkte bei Laserdruckern". heise online (in German). June 25, 2018.
  4. 1 2 Tuohey, Jason (November 22, 2004). "Government Uses Color Laser Printer Technology to Track Documents". PC World. Archived from the original on August 8, 2019. Retrieved December 3, 2019.
  5. 1 2 3 Escher, Stephan (June 28, 2018). "Tracking Dots unlesbar machen: Interview mit Uli Blumenthal". Deutschlandfunk (in German).
  6. US5515451A,Tsuji, Masato; Seki, Masao& Leng, Svayet al.,"Image processing system for selectively reproducing documents",issued 1996-05-07
  7. de Vries, Wilbert (October 26, 2004). "Dutch track counterfeits via printer serial numbers". PC World. Archived from the original on June 24, 2009.
  8. "DocuColor Tracking Dot Decoding Guide". Electronic Frontier Foundation. 2005. Archived from the original on March 5, 2018. Retrieved July 5, 2018.
  9. "Electronic Frontier Foundation". Electronic Frontier Foundation. Archived from the original on June 29, 2010. Retrieved June 9, 2017.
  10. Lee, Robert (July 27, 2005). "Freedom of Information Act (FOIA) request" (PDF). Letter to Latita M. Huff. Electronic Frontier Foundation. Retrieved November 7, 2016.
  11. "List of Printers Which Do or Do Not Display Tracking Dots". Electronic Frontier Foundation . September 20, 2007. Retrieved December 10, 2018.
  12. "Tracking codes in photocopiers and colour laser printers". Parliamentary questions. European Parliament. November 20, 2007.
  13. Frank Rosengart (2005). "Datenspur Papier" (PDF). Die Datenschleuder, Das wissenschaftliche Fachblatt für Datenreisende[technical factsheet]. Hamburg: Chaos Computer Club. pp. 19–21. ISSN   0930-1054 . Retrieved February 27, 2011.{{cite book}}: |journal= ignored (help)
  14. "Abschnitt „Technische Daten des Digitalen Farbdrucksystems Xerox DocuColor 6060"" (PDF; 1,4 MB). Xerox DocuColor® 6060 Digitales Farbdrucksystem (Prospectus). Neuss: Xerox GmbH. p. 8. Retrieved February 27, 2011.
  15. "Beitrag bei Druckerchannel: Big Brother is watching you: Code bei Farblasern entschlüsselt". Druckerchannel.de. October 26, 2005.
  16. "CONGRATULATIONS to "All Your Shreds Are Belong To U.S."!". Defense Advanced Research Projects Agency. November 21, 2011. Archived from the original on August 25, 2016. Retrieved June 12, 2014.
  17. "Tip for Bad Guys: Burn, Don't Shred". Bloomberg Businessweek . December 15, 2011. Retrieved June 12, 2014.
  18. Anderson, L.V. (June 6, 2017). "Did The Intercept Betray Its NSA Source With Sloppy Reporting?". Digg. Archived from the original on January 1, 2019.
  19. Wemple, Erik (June 6, 2017). "Did the Intercept bungle the NSA leak?". The Washington Post. Archived from the original on July 13, 2018.
  20. "docucolor.cgi - CGI script to interpret Xerox DocuColor forensic dot pattern". Electronic Frontier Foundation. 2005. Archived from the original on May 8, 2017. Retrieved October 3, 2018.
  21. "So verpfeift dich dein Drucker nicht". Deutschlandfunk Nova (in German). June 26, 2018.
  22. Printer Characterization and Signature Embedding for Security and Forensic Applications Pei-Ju Chiang, Aravind K. Mikkilineni, Sungjoo Suh, Jan P. Allebach, George T.-C. Chiu, Edward J. Delp., Purdue University, 2006 (poster)