Mac OS Gurmukhi

Last updated
Mac OS Gurmukhi
Alias(es)x-mac-gurmukhi
Created by Apple, Inc
Classification Extended ASCII, Mac OS script
Extends US-ASCII
Based on ISCII

Mac OS Gurmukhi is a character set developed by Apple Inc., based on IS 13194:1991 (ISCII-91). [1]

Code page layout

The following table shows the Mac OS Gurmukhi encoding. [1] Each character is shown with its equivalent Unicode code point. Only the second half of the table (code points 128255) is shown, the first half (code points 0127) being the same as Mac OS Roman.

Mac OS Gurmukhi
0123456789ABCDEF
8x × © ®
9x
Ax
Bx
Cx
Dx ਸ਼ LRM ਿ
Ex
Fx

Byte pairs and ISCII-related features are described in the mapping file. [1]

Related Research Articles

ISO/IEC 8859-11:2001, Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined.

Indian Script Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Bengali–Assamese, Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. ISCII does not encode the writing systems of India that are based on Persian, but its writing system switching codes nonetheless provide for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic. The Persian-based writing systems were subsequently encoded in the PASCII encoding.

Mac OS Roman is a character encoding created by Apple Computer, Inc. for use by Macintosh computers. It is suitable for representing text in English and several other Western languages. Mac OS Roman encodes 256 characters, the first 128 of which are identical to ASCII, with the remaining characters including mathematical symbols, diacritics, and additional punctuation marks. Mac OS Roman is an extension of the original Macintosh character set, which encoded only 217 characters. Full support for Mac OS Roman first appeared in System 6.0.4, released in 1989, and the encoding is still supported in current versions of macOS, though the standard character encodings are now UTF-8 or UTF-16. Apple modified Mac OS Roman in 1998 with the release of Mac OS 8.5 by replacing the currency sign at position hexadecimal 0xDB with the euro sign, but otherwise the encoding has been unchanged since its release.

Mac OS Cyrillic is a character encoding used on Apple Macintosh computers to represent texts in the Cyrillic script.

Mac OS Central European is a character encoding used on Apple Macintosh computers to represent texts in Central European and Southeastern European languages that use the Latin script. This encoding is also known as Code Page 10029. IBM assigns code page/CCSID 1282 to this encoding. This codepage contains diacritical letters that ISO 8859-2 does not have, and vice versa.

Mac OS Icelandic is a character encoding used in Apple Macintosh computers to represent Icelandic text. It is largely identical to Mac OS Roman, except for the Icelandic special characters Ý, Þ and Ð which have replaced typography characters.

Mac OS Ukrainian is a character encoding used on Apple Macintosh computers prior to Mac OS 9 to represent texts in Cyrillic script which include the letters ‹Ґ› and ‹ґ›, including the Ukrainian alphabet.

MacGreek encoding or Macintosh Greek encoding is used in Apple Macintosh computers to represent texts in the Greek language that uses the Greek script. This encoding is registered as IBM code page/CCSID 1280 and Windows code page 10006.

Each character is shown with its equivalent Unicode code point. Only the second half of the table is shown, the first half being the same as ASCII.

Mac OS Romanian is a character encoding used on Apple Macintosh computers to represent the Romanian language. It is a derivative of Mac OS Roman.

Mac OS Croatian is a character encoding used on Apple Macintosh computers to represent Gaj's Latin alphabet. It is a derivative of Mac OS Roman. The three digraphs, Dž, Lj, and Nj, are not encoded.

Mac OS Celtic is a character encoding used by the Mac OS to represent Welsh text, replacing 14 of the Mac OS Roman characters with Welsh characters. This character set was developed by Michael Everson and was used for the Irish localizations of Mac OS 6.0.8 and 7.1 and for the Welsh localization of Mac OS 7.1.

Mac OS Gaelic is a character encoding created for the Irish Gaelic language, based on the Welsh Mac OS Celtic encoding but replacing 23 characters with Gaelic characters. It was developed by Michael Everson, and was in his CeltScript fonts and on some fonts included with the Irish localization of Mac OS 6.0.8 and 7.1 and on.

Mac OS Sámi is a character encoding used on classic Mac OS to represent the Sámi languages and the Finnish Kalo language. While not used in any official Apple product, it has been used in various fonts designed to support Sámi languages under classic Mac OS, including those from Evertype.

Macintosh Latin is a character encoding which is used by Kermit to represent text on the Apple Macintosh. It is a modification of Mac OS Icelandic to include all characters in ISO/IEC 8859-1, DEC MCS, the PostScript Standard Encoding, and a Dutch ISO 646 variant. Although Macintosh Latin is designed to be compatible with the standard Macintosh Mac OS Roman encoding for the shared subset of characters, the two should not be confused.

Mac OS Inuit, also called Mac OS Inuktitut or InuitSCII, is an 8-bit, single byte, extended ASCII character encoding supporting the variant of Canadian Aboriginal syllabics used by the Inuktitut language. It was designed by Doug Hitch for the government of the Northwest Territories, and adopted by Michael Everson for his fonts.

The Macintosh Turkic Cyrillic encoding is used in Apple Macintosh computers to represent texts in the Cyrillic script for Turkic languages. It was created by Michael Everson for use in his fonts, but is not an official Mac OS Codepage. It supports Azerbaijani, Bashkir, Kazakh, Kyrgyz, Tajik, Tatar, Turkmen, and Uzbek.

The Macintosh Barents Cyrillic encoding is used in Apple Macintosh computers to represent texts in Kildin Sami, Komi, and Nenets.

Mac OS Ogham is a character encoding for representing Ogham text on Apple Macintosh computers. It is a superset of the Irish Standard I.S. 434:1999 character encoding for Ogham, adding some punctuation characters from Mac OS Roman. It is not an official Mac OS Codepage.

Mac OS Gujarati is a character set developed by Apple Inc. based on IS 13194:1991 (ISCII-91).

References

  1. 1 2 3 Apple, Inc. (2005-04-05) [1998-02-05]. "GURMUKHI.TXT: Map (external version) from Mac OS Gurmukhi character set to Unicode 2.1 and later" (TXT). Unicode, Inc. Retrieved 2020-03-15.