Mac OS Roman

Last updated
Mac OS Roman
MIME / IANAmacintosh
Language(s) English, various others
Created byApple Computer, Inc.
Classification Extended ASCII, Mac OS script
Extends ASCII, Macintosh character set

Mac OS Roman is a character encoding created by Apple Computer, Inc. for use by Macintosh computers. [1] It is suitable for representing text in English and several other Western languages. Mac OS Roman encodes 256 characters, the first 128 of which are identical to ASCII, with the remaining characters including mathematical symbols, diacritics, and additional punctuation marks. Mac OS Roman is an extension of the original Macintosh character set, which encoded only 217 characters. [1] Full support for Mac OS Roman first appeared in System 6.0.4, released in 1989, [2] and the encoding is still supported in current versions of macOS, though the standard character encoding is now UTF-8. Apple modified Mac OS Roman in 1998 with the release of Mac OS 8.5 by replacing the currency sign with the euro sign, [3] but otherwise the encoding has been unchanged since its release.

Contents

Character set

The following table shows how characters are encoded in Mac OS Roman. The row and column headings give the first and second digit of the hexadecimal code for each character in the table.

Mac OS Roman [4] [5]
0123456789ABCDEF
0x NUL SOH STX ETX EOT ENQ ACK BEL   BS    HT    LF    VT    FF    CR    SO    SI   
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN   EM   SUB ESC   FS    GS    RS    US  
2x  SP   ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
8x Ä Å Ç É Ñ Ö Ü á à â ä ã å ç é è
9x ê ë í ì î ï ñ ó ò ô ö õ ú ù û ü
Ax ° ¢ £ § ß ® © ´ ¨ Æ Ø
Bx ± ¥ µ π ª º Ω [lower-alpha 1] æ ø
Cx ¿ ¡ ¬ ƒ « » NBSP À Ã Õ Œ œ
Dx ÷ ÿ Ÿ [lower-alpha 2]
Ex · Â Ê Á Ë È Í Î Ï Ì Ó Ô
Fx Apple logo black.svg [lower-alpha 3] Ò Ú Û Ù ı ˆ ˜ ¯ ˘ ˙ ˚ ¸ ˝ ˛ ˇ
  1. Prior to December 1997, Apple's mapping published on Unicode.org mapped this character to U+2126 OHM SIGN. [5]
  2. Before Mac OS 8.5, the character at position 0xDB was the generic currency sign (¤). [lower-alpha 4]
  3. The character at position 0xF0 is a solid Apple logo. Apple uses Unicode character U+F8FF in the Corporate Private Use Area for this character, but it may not be supported on non-Apple systems.

See also

Notes

  1. 1 2 Apple Computer, Inc. (1993). Inside Macintosh: Text (PDF). p. 1-53. ISBN   0-201-63298-5. Archived from the original (PDF) on 2019-12-11. Retrieved July 10, 2021.
  2. Apple Computer, Inc. (1991). Inside Macintosh, Volume VI. p. 14-104. ISBN   0-201-57755-0.
  3. Apple Computer, Inc. (September 14, 1998). "Technical Note TN1104: The Euro Currency Symbol" . Retrieved July 10, 2021.
  4. Inside Macintosh: Text (PDF). Apple Computer, Inc. 1993. pp. 1–54, A-5–A-18. ISBN   0-201-63298-5. Archived from the original (PDF) on 2019-12-11. Retrieved July 10, 2021.
  5. 1 2 "ROMAN.TXT". Unicode.org. Apple Computer, Inc. 5 April 2005. Retrieved 9 October 2023.

Related Research Articles

Big-5 or Big5 is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for traditional Chinese characters.

Mac OS Cyrillic is a character encoding used on Apple Macintosh computers to represent texts in the Cyrillic script.

Mac OS Central European is a character encoding used on Apple Macintosh computers to represent texts in Central European and Southeastern European languages that use the Latin script. This encoding is also known as Code Page 10029. IBM assigns code page/CCSID 1282 to this encoding. This codepage contains diacritical letters that ISO 8859-2 does not have, and vice versa.

The currency sign¤ is a character used to denote an unspecified currency. It can be described as a circle the size of a lowercase character with four short radiating arms at 45° (NE), 135° (SE), 225° (SW) and 315° (NW). It is raised slightly above the baseline. The character is sometimes called scarab.

Several 8-bit character sets (encodings) were designed for binary representation of common Western European languages, which use the Latin alphabet, a few additional letters and ones with precomposed diacritics, some punctuation, and various symbols. These character sets also happen to support many other languages such as Malay, Swahili, and Classical Latin.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

<span class="mw-page-title-main">JIS X 0201</span> Japanese single byte character encoding

JIS X 0201, a Japanese Industrial Standard developed in 1969, was the first Japanese electronic character set to become widely used. The character set was initially known as JIS C 6220 before the JIS category reform. Its two forms were a 7-bit encoding or an 8-bit encoding, although the 8-bit form was dominant until Unicode replaced it. The full name of this standard is 7-bit and 8-bit coded character sets for information interchange (7ビット及び8ビットの情報交換用符号化文字集合).

Symbol is one of the four standard fonts available on all PostScript-based printers, starting with Apple's original LaserWriter (1985). It contains a complete unaccented Greek alphabet and a selection of commonly used mathematical symbols. Insofar as it fits into any standard classification, it is a serif font designed in the style of Times New Roman.

Mac OS Icelandic is an obsolete character encoding that was used in Apple Macintosh computers to represent Icelandic text. It is largely identical to Mac OS Roman, except for the Icelandic special characters Ý, Þ and Ð which have replaced typography characters.

KPS 9566 is a North Korean standard specifying a character encoding for the Chosŏn'gŭl (Hangul) writing system used for the Korean language. The edition of 1997 specified an ISO 2022-compliant 94×94 two-byte coded character set. Subsequent editions have added additional encoded characters outside of the 94×94 plane, in a manner comparable to UHC or GBK.

KS X 1001, "Code for Information Interchange ", formerly called KS C 5601, is a South Korean coded character set standard to represent Hangul and Hanja characters on a computer.

Mac OS Ukrainian is a character encoding used on Apple Macintosh computers prior to Mac OS 9 to represent texts in Cyrillic script which include the letters ‹Ґ› and ‹ґ›, including the Ukrainian alphabet.

MacGreek encoding or Macintosh Greek encoding is used in Apple Macintosh computers to represent texts in the Greek language that uses the Greek script. This encoding is registered as IBM code page/CCSID 1280 and Windows code page 10006.

Mac OS Romanian is a character encoding used on Apple Macintosh computers to represent the Romanian language. It is a derivative of Mac OS Roman.

Mac OS Croatian is a character encoding used on Apple Macintosh computers to represent Gaj's Latin alphabet. It is a derivative of Mac OS Roman. The three digraphs, Dž, Lj, and Nj, are not encoded.

Mac OS Celtic is a character encoding used by Mac OS to represent Welsh text, replacing 14 of the Mac OS Roman characters with Welsh characters. This character set was developed by Michael Everson and was used for the Irish localizations of Mac OS 6.0.8 and 7.1 and for the Welsh localization of Mac OS 7.1.

Mac OS Gaelic is a character encoding created for the Irish Gaelic language, based on the Welsh Mac OS Celtic encoding but replacing 23 characters with Gaelic characters. It was developed by Michael Everson, and was in his CeltScript fonts and on some fonts included with the Irish localization of Mac OS 6.0.8 and 7.1 and on.

Mac OS Sámi is a character encoding used on classic Mac OS to represent the Sámi languages and the Finnish Kalo language. While not used in any official Apple product, it has been used in various fonts designed to support Sámi languages under classic Mac OS, including those from Evertype.

Mac OS Maltese/Esperanto, called MacOS Esperanto in older sources, is a character encoding for Esperanto, Maltese and Turkish created by Michael Everson on August 15 1997, based on the Mac OS Turkish encoding. It is used in his fonts, but not on official Mac OS fonts.

The VT100 code page is a character encoding used to represent text on the Classic Mac OS for compatibility with the VT100 terminal. It encodes 256 characters, the first 128 of which are identical to ASCII, with the remaining characters including mathematical symbols, diacritics, and additional punctuation marks. It is suitable for English and several other Western languages. It is similar to Mac OS Roman but includes all characters in ISO 8859-1 except for the currency sign, the no-break space, and the soft hyphen. It also includes all characters in DEC Special Graphics, except for the new line and no-break space controls. The VT100 encoding is only used on the VT100 font on the Classic Mac OS and is not an official Mac OS character encoding.