Greek and Coptic | |
---|---|
Range | U+0370..U+03FF (144 code points) |
Plane | BMP |
Scripts | Greek (117 char.) Coptic (14 char.) Common (4 char.) |
Major alphabets | Greek |
Assigned | 135 code points |
Unused | 9 reserved code points |
Source standards | ISO 8859-7 |
Unicode version history | |
1.0.0 (1991) | 112 (+112) |
1.0.1 (1992) | 103 (-9) |
1.1 (1993) | 105 (+2) |
3.0 (1999) | 110 (+5) |
3.1 (2001) | 112 (+2) |
3.2 (2002) | 115 (+3) |
4.0 (2003) | 120 (+5) |
4.1 (2005) | 124 (+4) |
5.0 (2006) | 127 (+3) |
5.1 (2008) | 134 (+7) |
7.0 (2014) | 135 (+1) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] [3] |
Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally also used for writing Coptic, [1] using the similar Greek letters in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.
Its block name in Unicode 1.0 was simply Greek, although Coptic letters were already included. [4]
Points were reserved for the uppercase forms of ΐ, ΰ and ς. While letter-diacritic combinations such as ΐ and ΰ are no longer accepted by Unicode, a capital ς remains a theoretical possibility. There is in addition room for three additional casing pairs, or for capital forms of letters such as lunate ϵ and ϶.
Greek and Coptic [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+037x | Ͱ | ͱ | Ͳ | ͳ | ʹ | ͵ | Ͷ | ͷ | ͺ | ͻ | ͼ | ͽ | ; | Ϳ | ||
U+038x | ΄ | ΅ | Ά | · | Έ | Ή | Ί | Ό | Ύ | Ώ | ||||||
U+039x | ΐ | Α | Β | Γ | Δ | Ε | Ζ | Η | Θ | Ι | Κ | Λ | Μ | Ν | Ξ | Ο |
U+03Ax | Π | Ρ | Σ | Τ | Υ | Φ | Χ | Ψ | Ω | Ϊ | Ϋ | ά | έ | ή | ί | |
U+03Bx | ΰ | α | β | γ | δ | ε | ζ | η | θ | ι | κ | λ | μ | ν | ξ | ο |
U+03Cx | π | ρ | ς | σ | τ | υ | φ | χ | ψ | ω | ϊ | ϋ | ό | ύ | ώ | Ϗ |
U+03Dx | ϐ | ϑ | ϒ | ϓ | ϔ | ϕ | ϖ | ϗ | Ϙ | ϙ | Ϛ | ϛ | Ϝ | ϝ | Ϟ | ϟ |
U+03Ex | Ϡ | ϡ | Ϣ | ϣ | Ϥ | ϥ | Ϧ | ϧ | Ϩ | ϩ | Ϫ | ϫ | Ϭ | ϭ | Ϯ | ϯ |
U+03Fx | ϰ | ϱ | ϲ | ϳ | ϴ | ϵ | ϶ | Ϸ | ϸ | Ϲ | Ϻ | ϻ | ϼ | Ͻ | Ͼ | Ͽ |
Notes |
In Unicode 1.0.1, a number of changes were made to this block in order to make Unicode 1.0.1 a proper subset of ISO 10646. [1] [2] [3]
The following Unicode-related documents record the purpose and process of defining specific characters in the Greek and Coptic block:
Version | Final code points [a] | Count | UTC ID | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|---|
1.0.0 | U+0374..0375, 037A, 037E, 0384..0386, 0388..038A, 038C, 038E..03A1, 03A3..03CE, 03D0..03D6, 03DA, 03DC, 03DE, 03E0, 03E2..03F0 | 101 | (to be determined) | |||
L2/06-386 | Davis, Mark (2006-11-09), Properties not preserving canonical equivalence | |||||
L2/07-015 | Moore, Lisa (2007-02-08), "UCD Canonical Check (B.14.9) [U+0374]", UTC #110 Minutes | |||||
L2/07-071 | Davis, Mark (2007-02-08), UCD Canonical Check | |||||
U+03F1 | 1 | UTC/1991-048B | Whistler, Ken (1991-03-27), "Alternate Rho", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple | |||
U+03F2 | 1 | UTC/1991-048B | Whistler, Ken (1991-03-27), "Lunate Sigma", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple | |||
1.1 | U+0387, 03F3 | 2 | (to be determined) | |||
L2/06-386 | Davis, Mark (2006-11-09), Properties not preserving canonical equivalence | |||||
L2/07-015 | Moore, Lisa (2007-02-08), "UCD Canonical Check (B.14.9) [U+0387]", UTC #110 Minutes | |||||
L2/07-071 | Davis, Mark (2007-02-08), UCD Canonical Check | |||||
3.0 | U+03D7, 03DB, 03DD, 03DF, 03E1 | 5 | L2/98-210 | N1743 | Everson, Michael (1998-05-25), Additional Greek characters for the UCS | |
L2/98-281R (pdf, html) | Aliprand, Joan (1998-07-31), "Greek characters (IV.C.2)", Unconfirmed Minutes – UTC #77 & NCITS Subgroup L2 # 174 JOINT MEETING, Redmond, WA -- July 29-31, 1998 | |||||
L2/98-292R (pdf, html, Figure 1) | "2.2", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-19 | |||||
L2/98-292 | N1840 | "2.2", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-25 | ||||
L2/98-293 | N1885 | "2.3", Comments on proposals to add various characters to ISO/IEC 10646, 1998-08-25 | ||||
L2/98-301 | N1847 | Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals | ||||
L2/98-372 | N1884R2 (pdf, doc) | Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS | ||||
L2/98-329 | N1920 | Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 30, AMENDMENT 30: Additional Latin and other characters, 1998-10-28 | ||||
L2/99-010 | N1903 (pdf, html, doc) | Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25 | ||||
3.1 | U+03F4..03F5 | 2 | L2/00-119 [b] | N2191R | Whistler, Ken; Freytag, Asmus (2000-04-19), Encoding Additional Mathematical Symbols in Unicode | |
L2/00-234 | N2203 (rtf, txt) | Umamaheswaran, V. S. (2000-07-21), "8.18", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24 | ||||
L2/00-115R2 | Moore, Lisa (2000-08-08), "Motion 83-M11", Minutes Of UTC Meeting #83 | |||||
3.2 | U+03D8..03D9, 03F6 | 3 | L2/99-018 | N1938 | Everson, Michael (1998-12-12), On GREEK LETTER KOPPA | |
L2/99-077.1 | N1975 | Irish Comments on SC 2 N 3210, 1999-01-20 | ||||
L2/99-054R | Aliprand, Joan (1999-06-21), "Greek Letter Koppa", Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999 | |||||
L2/00-119 [b] | N2191R | Whistler, Ken; Freytag, Asmus (2000-04-19), Encoding Additional Mathematical Symbols in Unicode | ||||
L2/00-234 | N2203 (rtf, txt) | Umamaheswaran, V. S. (2000-07-21), "8.18", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24 | ||||
L2/00-115R2 | Moore, Lisa (2000-08-08), "Motion 83-M11", Minutes Of UTC Meeting #83 | |||||
L2/01-050 | N2253 | Umamaheswaran, V. S. (2001-01-21), "RESOLUTION M39.24", Minutes of the SC2/WG2 meeting in Athens, September 2000 | ||||
4.0 | U+03F7..03F8 | 2 | L2/01-007 | Bunz, Carl-Martin (2000-12-21), "Bactrian", Iranianist Meeting Report: Symposium on Encoding Iranian Scripts in Unicode | ||
L2/02-009 | Bunz, Carl-Martin (2001-11-23), "Bactrian", 2nd Iranian Meeting Report | |||||
L2/02-056 | N2411 | Everson, Michael (2002-01-30), Proposal to add two Greek letters for Bactrian to the UCS | ||||
L2/02-166R2 | Moore, Lisa (2002-08-09), "Scripts and New Characters - Bactrian", UTC #91 Minutes, The UTC accepts two Greek characters for use in Bactrian | |||||
U+03F9 | 1 | L2/02-314R [c] | N2512 | Pantelia, Maria (2002-08-21), Proposal to add the Greek Capital Lunate Sigma Symbol to the UCS | ||
U+03FA..03FB | 2 | L2/02-313R2 [c] | N2522 | Pantelia, Maria (2002-11-07), Proposal to encode the Archaic Greek Letter San and Greek Small Letter San in the UCS | ||
L2/04-034 | March, Jeremy; Kirk, Peter; Rourke, Patrick; Anderson, Deborah (2004-01-23), Request to Change Greek Collation Order for SAN | |||||
4.1 | U+03FC | 1 | L2/03-157 | Pantelia, Maria (2003-05-19), Additional Beta Code Characters not in Unicode (WIP) | ||
L2/03-188R | N2612-5 | Pantelia, Maria (2003-06-11), Proposal to encode the Greek Rho with Stroke Symbol in the UCS | ||||
U+03FD..03FF | 3 | L2/02-031 | Anderson, Deborah (2002-01-21), TLG Miscellanea Proposal | |||
L2/02-033 | Anderson, Deborah (2002-01-21), TLG Unicode Proposal (draft) | |||||
L2/02-053 | Anderson, Deborah (2002-02-04), Description of TLG Documents | |||||
L2/02-273 | Pantelia, Maria (2002-07-31), TLG Unicode Proposal | |||||
L2/02-287 | Pantelia, Maria (2002-08-09), Proposal Summary Form accompanying TLG Unicode Proposal (L2/02-273) | |||||
L2/02-312R | Pantelia, Maria (2002-11-07), Proposal to encode additional Greek editorial and punctuation characters in the UCS | |||||
L2/03-324 | N2642 | Pantelia, Maria (2003-10-06), Proposal to encode additional Greek editorial and punctuation characters in the UCS | ||||
5.0 | U+037B..037D | 3 | L2/05-076 | Davis, Mark (2005-02-10), Stability of Case Folding | ||
N2942 | Freytag, Asmus; Whistler, Ken (2005-08-12), Proposal to add nine lowercase characters | |||||
L2/05-108R | Moore, Lisa (2005-08-26), "Stability of Case Folding (B.14.2)", UTC #103 Minutes | |||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "M47.5c, M47.5d, M47.5e", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | |||||
5.1 | U+0370..0373, 0376..0377 | 6 | L2/04-388 | Nicholas, Nick (2004-11-10), Proposal to add Greek Letter Lowercase Heta and Greek Letter Capital Heta to the UCS | ||
L2/04-389 | Nicholas, Nick (2004-11-10), Epigraphical Greek Letters: Request for preliminary opinion | |||||
L2/05-002 | Nicholas, Nick (2005-01-01), Proposal to add Greek Letter Lowercase Heta and Greek Letter Capital Heta (Updated fonts 2005-04-28) | |||||
L2/05-003R | N2946 | Nicholas, Nick (2005-01-01), Proposal to add Greek epigraphical letters | ||||
L2/05-054 | Anderson, Deborah (2005-01-31), Feedback on Proposals for Tack Hetas and Greek Epigraphical Letters (L2/05-002, L2/05-003) | |||||
L2/05-098 | Nicholas, Nick (2005-04-03), Proposal to add Greek epigraphical characters to the UCS | |||||
L2/05-108R | Moore, Lisa (2005-08-26), "Greek Epigraphical Letters (C.10)", UTC #103 Minutes | |||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "7.4.3", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | |||||
N3153 (pdf, doc) | Umamaheswaran, V. S. (2007-02-16), "M49.1c [U+0373]", Unconfirmed minutes of WG 2 meeting 49 AIST, Akihabara, Tokyo, Japan; 2006-09-25/29 | |||||
U+03CF | 1 | L2/06-266 | N3122 | Everson, Michael (2006-08-06), Proposal to add Latin letters and a Greek symbol to the UCS | ||
L2/06-231 | Moore, Lisa (2006-08-17), "C.16", UTC #108 Minutes | |||||
N3153 (pdf, doc) | Umamaheswaran, V. S. (2007-02-16), "M49.3", Unconfirmed minutes of WG 2 meeting 49 AIST, Akihabara, Tokyo, Japan; 2006-09-25/29 | |||||
7.0 | U+037F | 1 | L2/10-474 | N3997 | Bobeck, Michael (2010-12-12), Proposal to encode GREEK CAPITAL LETTER YOT | |
L2/11-016 | Moore, Lisa (2011-02-15), "C.9", UTC #126 / L2 #223 Minutes | |||||
N4103 | "11.2.8 Greek Capital Letter YOT", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | |||||
Digamma or wau is an archaic letter of the Greek alphabet. It originally stood for the sound but it has remained in use principally as a Greek numeral for 6. Whereas it was originally called waw or wau, its most common appellation in classical Greek is digamma; as a numeral, it was called episēmon during the Byzantine era and is now known as stigma after the Byzantine ligature combining σ-τ as ϛ.
Epsilon is the fifth letter of the Greek alphabet, corresponding phonetically to a mid front unrounded vowel IPA:[e̞] or IPA:[ɛ̝]. In the system of Greek numerals it also has the value five. It was derived from the Phoenician letter He . Letters that arose from epsilon include the Roman E, Ë and Ɛ, and Cyrillic Е, È, Ё, Є and Э. The name of the letter was originally εἶ, but it was later changed to ἒ ψιλόν in the Middle Ages to distinguish the letter from the digraph αι, a former diphthong that had come to be pronounced the same as epsilon.
Iota is the ninth letter of the Greek alphabet. It was derived from the Phoenician letter Yodh. Letters that arose from this letter include the Latin I and J, the Cyrillic І, Yi, and Je, and iotated letters. In the system of Greek numerals, iota has a value of 10.
Greek numerals, also known as Ionic, Ionian, Milesian, or Alexandrian numerals, is a system of writing numbers using the letters of the Greek alphabet. In modern Greece, they are still used for ordinal numbers and in contexts similar to those in which Roman numerals are still used in the Western world. For ordinary cardinal numbers, however, modern Greece uses Arabic numerals.
Upsilon or ypsilon is the twentieth letter of the Greek alphabet. In the system of Greek numerals, Υʹ has a value of 400. It is derived from the Phoenician waw .
Koppa or qoppa is a letter that was used in early forms of the Greek alphabet, derived from Phoenician qoph (𐤒). It was originally used to denote the sound, but dropped out of use as an alphabetic character and replaced by Kappa (Κ). It has remained in use as a numeral symbol (90) in the system of Greek numerals, although with a modified shape. Koppa is the source of Latin Q, as well as the Cyrillic numeral sign of the same name (Koppa).
Sampi is an archaic letter of the Greek alphabet. It was used as an addition to the classical 24-letter alphabet in some eastern Ionic dialects of ancient Greek in the 6th and 5th centuries BC, to denote some type of a sibilant sound, probably or, and was abandoned when the sound disappeared from Greek.
Omicron is the fifteenth letter of the Greek alphabet. This letter is derived from the Phoenician letter ayin: . In classical Greek, omicron represented the close-mid back rounded vowel IPA:[o] in contrast to omega which represented the open-mid back rounded vowel IPA:[ɔː] and the digraph ου which represented the long close-mid back rounded vowel IPA:[oː]. In modern Greek, both omicron and omega represent the mid back rounded vowel IPA:[o̞] or IPA:[ɔ̝]. Letters that arose from omicron include Roman O and Cyrillic O|Ю (Cyrillic)|Ю. The word literally means "little O" as opposed to "great O". In the system of Greek numerals, omicron has a value of 70.
In the polytonic orthography of Ancient Greek, the rough breathing character is a diacritical mark used to indicate the presence of an sound before a vowel, diphthong, or after rho. It remained in the polytonic orthography even after the Hellenistic period, when the sound disappeared from the Greek language. In the monotonic orthography of Modern Greek phonology, in use since 1982, it is not used at all.
The smooth breathing is a diacritical mark used in polytonic orthography. In Ancient Greek, it marks the absence of the voiceless glottal fricative from the beginning of a word.
San (Ϻ) was an archaic letter of the Greek alphabet. Its shape was similar to modern M or Mu, or to a modern Greek Sigma (Σ) turned sideways, and it was used as an alternative to Sigma to denote the sound. Unlike Sigma, whose position in the alphabet is between Rho and Tau, San appeared between Pi and Qoppa in alphabetic order. In addition to denoting this separate archaic character, the name San was also used as an alternative name to denote the standard letter Sigma.
The European ordering rules define an ordering for strings written in languages that are written with the Latin, Greek and Cyrillic alphabets. The standard covers languages used by the European Union, the European Free Trade Association, and parts of the former Soviet Union. It is a tailoring of the Common Tailorable Template of ISO/IEC 14651. EOR can in turn be tailored for different (European) languages. But in inter-European contexts, EOR can be used without further tailoring.
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to have developed distinct letters for consonants as well as vowels. In Archaic and early Classical times, the Greek alphabet existed in many local variants, but, by the end of the 4th century BC, the Ionic-based Euclidean alphabet, with 24 letters, ordered from alpha to omega, had become standard throughout the Greek-speaking world and is the version that is still used for Greek writing today.
Romanization of Greek is the transliteration (letter-mapping) or transcription (sound-mapping) of text from the Greek alphabet into the Latin alphabet.
Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for this are compatibility issues with legacy systems.
Diacritical marks of two dots¨, placed side-by-side over or under a letter, are used in several languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons.
The orthography of the Greek language ultimately has its roots in the adoption of the Greek alphabet in the 9th century BC. Some time prior to that, one early form of Greek, Mycenaean, was written in Linear B, although there was a lapse of several centuries between the time Mycenaean stopped being written and the time when the Greek alphabet came into use.
Greek orthography has used a variety of diacritics starting in the Hellenistic period. The more complex polytonic orthography, which includes five diacritics, notates Ancient Greek phonology. The simpler monotonic orthography, introduced in 1982, corresponds to Modern Greek phonology, and requires only two diacritics.
The table of chords, created by the Greek astronomer, geometer, and geographer Ptolemy in Egypt during the 2nd century AD, is a trigonometric table in Book I, chapter 11 of Ptolemy's Almagest, a treatise on mathematical astronomy. It is essentially equivalent to a table of values of the sine function. It was the earliest trigonometric table extensive enough for many practical purposes, including those of astronomy. Since the 8th and 9th centuries, the sine and other trigonometric functions have been used in Islamic mathematics and astronomy, reforming the production of sine tables. Khwarizmi and Habash al-Hasib later produced a set of trigonometric tables.
Greek Extended is a Unicode block containing the accented vowels necessary for writing polytonic Greek. The regular, unaccented Greek characters as well as the characters with tonos and diaeresis can be found in the Greek and Coptic block. Greek Extended was encoded in version 1.1 of the Unicode Standard. As an alternative to Greek Extended, combining characters can be used to represent the tones and breath marks of polytonic Greek.