In computing HP Roman is a family of character sets consisting of HP Roman Extension, HP Roman-8, HP Roman-9 and several variants. Originally introduced by Hewlett-Packard around 1978, revisions and adaptations were published several times up to 1999. The 1985 revisions were later standardized as IBM codepages 1050 and 1051. Supporting many European languages,[ which? ] the character sets were used by various HP workstations, terminals, calculators as well as many printers, also from third-parties.
HP Roman is a family of single byte character encodings supporting several Latin script based languages of Europe. It was originally introduced by Hewlett-Packard around 1978 as 7- and 8-bit HP Roman Extension for some of their computer terminals and printers. Early versions of the 8-bit variant were also used by some HP workstations in 1978/1979. Several revisions led to more characters being added before the 8-bit variant of the character set became officially known as HP Roman-8 in 1983. [1] Soon later, this became the default character set of the HP-UX [2] operating system and the page description language PCL for inkjet [3] and laser printers in 1984. The character set was again expanded in 1985. [4] A modified adaptation of the 1984 definition of Roman-8 was used in the HP Portable series of computers, [5] [6] whereas a derivation of the updated 1985 definition of Roman-8 was used in several early RPL calculators and corresponding thermal printers since 1986. [7] The latest off-spring of the family is HP Roman-9 , which was introduced in 1999 to include the euro sign. [8] PCL Ventura International is based on HP Roman-8.
The character set was originally introduced by Hewlett-Packard as extended ASCII 7-bit codepage named HP Roman Extension, [9] [10] which existed at least since 1978. [11] [12] [13] [14] [15] This character set was used as a secondary character set in conjunction with the primary character set, which was identical to ASCII, except for character 127, which was a medium shaded box instead of the delete character. The first 32 characters, that normally functioned as C0 control codes, also had graphical non-control alternatives, that could appear during self-test or display functions mode. Switching between character sets was done using the Shift Out and Shift In characters, or alternatively, on systems supporting 8-bit mode, using the high bit of the character. Before the name "Roman-8" was established for the 8-bit variant in 1983, this was sometimes called "8-bit Roman Extension" or "HP Roman-8 Extension". Over the years both variants were revised to include more characters. The final 1985 revision of the secondary character set was also standardized by IBM in 1989 as code page 1050 (CP1050 or ibm-1050). [16]
Although strictly speaking not part of Roman Extension, the following table shows those rows of the primary character set that differed from ASCII. Note that the first two rows are normally the same and only appear as graphical characters in special circumstances, as described above. Although some of the Unicode control pictures conventionally use three characters rather than two, those "diagonal lettering glyphs are only exemplary; alternate representations may be, and often are used in the visible display of control codes". [17]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NU | SH | SX | EX | ET | EQ | AK | 🔔︎ | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DL | D1 | D2 | D3 | D4 | NK | SY | EB | CN | EM | SB | EC | FS | GS | RS | US |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ▒ |
The following table shows the 1982 version; a current variant is shown in the Roman-8 section below. The table assumes 8-bit mode is used; if not, subtract 128 (8016) from the character code.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
Ax | ´ | ˋ | ¨ | ˜ | ₤ [a] | |||||||||||
Bx | ‾ | ˚ | ç | Ñ | ñ | ¡ | ¿ | ¤ | £ [a] | § | ||||||
Cx | â | ê | ô | û | á | é | ó | ú | à | è | ò | ù | ä | ë | ö | ü |
Dx | Å | î | Ø | Æ | å | í | ø | æ | Ä | ì | Ö | Ü | É | ï | ß |
HP Roman-8 is an 8-bit single byte character encoding that is mainly used on HP-UX [2] and many Hewlett-Packard [7] and PCL compatible printers. The name Roman-8 appeared in 1983, [1] but a precursor of the character set was already used by the HP 250 and HP 300 workstations since 1978/1979 as 8-bit Roman Extension . [12] [13] [14] [15]
The original 1983/1984 version of Roman-8 still had some code points undefined. [5] [6] [3] [19] In a 1985 revision code points 177 (Ý), 178 (ý), 242 (·), 243 (µ), 244 (¶) and 245 (¾) were added and the appearance of code point 228 was changed from a stroked d (đ) to an eth (ð). [4] [20] [21] This final revision of the character set was also standardized as codepage 1051 by IBM in 1989. [22] [23]
In contrast to the newer HP Roman-9, HP Roman-8 does not provide a code point for the euro sign.
The following table shows the latest 1985 definition of the HP Roman-8 character set (with some remarks regarding former definitions and alternative interpretations). Each character is shown with a potential Unicode equivalent and its decimal code, however, sources differ in the recommended translations for some of the codes even among definitions from Hewlett-Packard [2] [21] and IBM. [22] [23]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ▒ [a] |
8x | ||||||||||||||||
9x | ||||||||||||||||
Ax | NBSP | À | Â | È | Ê | Ë | Î | Ï | ´ | ˋ/` | ˆ | ¨ | ˜ | Ù | Û | ₤ [b] |
Bx | ¯/‾ | Ý | ý | °/˚ | Ç | ç | Ñ | ñ | ¡ | ¿ | ¤ | £ | ¥ | § | ƒ | ¢ |
Cx | â | ê | ô | û | á | é | ó | ú | à | è | ò | ù | ä | ë | ö | ü |
Dx | Å | î | Ø | Æ | å | í | ø | æ | Ä | ì | Ö | Ü | É | ï | ß/β | Ô |
Ex | Á | Ã | ã | Ð | ð/đ | Í | Ì | Ó | Ò | Õ | õ | Š | š | Ú | Ÿ | ÿ |
Fx | Þ | þ | · | µ/μ | ¶ | ¾ | SHY /- | ¼ | ½ | ª | º | « | ■ | » | ± |
This character set has over the years acquired a number of different names, such as:
In 1984, Hewlett-Packard introduced the HP 110 / HP Portable personal computer followed by the HP 110 Plus / HP Portable Plus in 1985. In "HP mode" they supported a derivation of the 1984 revision of 8-bit HP Roman-8 (still lacking the six additional characters at code points 177 to 178 and 242 to 245, and with code point 228 still resembling a stroked d (đ)), but with 32 additional graphical symbols at code points 128 to 159, including a rich set of box-drawing characters. [5] [6]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ▒ |
8x | ◄ | ▲ | ▼ | ► | ╝ | ╗ | ╔ | ╚ | ╣ | ╩ | ╦ | ╠ | ═ | ║ | ╬ | ♦ |
9x | ↑ | ▀ | ▄ | ↓ | ┘ | ┐ | ┌ | └ | ┤ | ┴ | ┬ | ├ | ─ | │ | ┼ | █ |
Ax | NBSP | À | Â | È | Ê | Ë | Î | Ï | ´ | ˋ/` | ˆ | ¨ | ˜ | Ù | Û | ₤ [a] |
In 1986, [66] Hewlett-Packard introduced the HP-18C calculator and HP 82240A thermo printer, [66] which internally used an extended variant of the 1985 revision of the 8-bit HP Roman-8 character set (now with the six additional characters defined and with code point 228 already changed to an eth (ð)), but with the code points 127 (0x7F) and 160 (0xA0) as well as the control codes in the range 128 to 159 (0x80 to 0x9F) being replaced by additional displayable characters, [7] [67] some of which were derived from the HP-41C/CV/CX's FOCAL character set and others incorporated into the revised FOCAL character set used by the HP-42S calculator, although at different code points. On the HP-28 series, characters above 147 (0x93) could not be displayed on the calculator, only be printed. [67] [7] [68]
There is no official code point definition for the euro sign in this modified character set. The HP 49/50 series of calculators use a different character set [69] based on ECMA-94 / ISO 8859-1 which includes the euro symbol.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ▒ |
8x | NBSP [69] | ÷ | × | √ | ∫ | Σ | ▶ | π | ∂ | ≤ | ≥ | ≠ | α | → | ← | µ/μ |
9x | ␊ | ° | « | » | ⊦ [a] | ₁ | ₂ | ² | ³ | ᵢ | ⱼ | ‥ | ⁱ | ʲ | ᵏ | ⁿ |
Ax | ∡/∠ | À | Â | È | Ê | Ë | Î | Ï | ´ | ˋ/` | ˆ | ¨ | ˜ | Ù | Û | ₤ [b] |
HP Roman-9 (also known as HP Roman 9, hp-roman9, roman9 or R9) is a slight modification of the 8-bit HP Roman-8 character set where the general currency sign (¤) at code point 186 (0xBA) was replaced by the euro sign (€). [70] [71] It was introduced in early 1999. [8] As of 2017, HP Roman-9 still has no known code page number assigned to it.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
Bx | ¯/‾ | Ý | ý | °/˚ | Ç | ç | Ñ | ñ | ¡ | ¿ | € | £ | ¥ | § | ƒ | ¢ |
ISO/IEC 8859-1:1998, Information technology—8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. ISO/IEC 8859-1 encodes what it refers to as "Latin alphabet no. 1", consisting of 191 characters from the Latin script. This character-encoding scheme is used throughout the Americas, Western Europe, Oceania, and much of Africa. It is the basis for some popular 8-bit character sets and the first two blocks of characters in Unicode.
In digital printing, a page description language (PDL) is a computer language that describes the appearance of a printed page in a higher level than an actual output bitmap. An overlapping term is printer control language, which includes Hewlett-Packard's Printer Command Language (PCL). PostScript is one of the most noted page description languages. The markup language adaptation of the PDL is the page description markup language.
ISO/IEC 646 is a set of ISO/IEC standards, described as Information technology — ISO 7-bit coded character set for information interchange, and developed in cooperation with ASCII at least since 1964. Since its first edition in 1967 it has specified a 7-bit character code from which several national standards are derived.
In computing, a code page is a character encoding and as such it is a specific association of a set of printable characters and control characters with unique numbers. Typically each number represents the binary value in a single byte.
Windows-1252 or CP-1252 is a legacy single-byte character encoding that is used by default in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.
ISO/IEC 8859-11:2001, Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined.
In computer programming, digraphs and trigraphs are sequences of two and three characters, respectively, that appear in source code and, according to a programming language's specification, should be treated as if they were single characters. Trigraphs have been removed from the C++ language, and will be from C as of C23, thus likely aren't used much in practice in C already, nor in any other mainstream language. In the modern world of Unicode/UTF-8 there's no need for trigraphs in language design, which were considered a burden, and neither really digraphs, that likely have very few users, at least in those languages.
ISO/IEC 8859-4:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin-4 or North European. It was designed to cover Estonian, Latvian, Lithuanian, Greenlandic, and Sámi. It has been largely superseded by ISO/IEC 8859-10 and Unicode. Microsoft has assigned code page 28594 a.k.a. Windows-28594 to ISO-8859-4 in Windows. IBM has assigned code page 914 to ISO 8859-4.
ISO/IEC 8859-5:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 5: Latin/Cyrillic alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin/Cyrillic.
Box-drawing characters, also known as line-drawing characters, are a form of semigraphics widely used in text user interfaces to draw various geometric frames and boxes. These characters are characterized by being designed to be connected horizontally and/or vertically with adjacent characters, which requires proper alignment. Box-drawing characters therefore typically only work well with monospaced fonts.
Printer Command Language, more commonly referred to as PCL, is a page description language (PDL) developed by Hewlett-Packard as a printer protocol and has become a de facto industry standard. Originally developed for early inkjet printers in 1984, PCL has been released in varying levels for thermal, matrix, and page printers. HP-GL/2 and PJL are supported by later versions of PCL.
Windows code pages are sets of characters or code pages used in Microsoft Windows from the 1980s and 1990s. Windows code pages were gradually superseded when Unicode was implemented in Windows, although they are still supported both within Windows and other platforms, and still apply when Alt code shortcuts are used.
MIK (МИК) is an 8-bit Cyrillic code page used with DOS. It is based on the character set used in the Bulgarian Pravetz 16 IBM PC compatible system. Kermit calls this character set "BULGARIA-PC" / "bulgaria-pc". In Bulgaria, it was sometimes incorrectly referred to as code page 856. This code page is known by Star printers and FreeDOS as Code page 3021.
The HP 250 was a multiuser business computer by Hewlett-Packard running HP 250 BASIC language as its OS with access to HP's IMAGE database management. It was produced by the General Systems Division (GSD), but was a major repackaging of desktop workstation HP 9835 from the HP 9800 series which had been sold in small business configurations. The HP 9835's processor was initially used in the first HP 250s.
The National Replacement Character Set (NRCS) was a feature supported by later models of Digital's (DEC) computer terminal systems, starting with the VT200 series in 1983. NRCS allowed individual characters from one character set to be replaced by one from another set, allowing the construction of different character sets on the fly. It was used to customize the character set to different local languages, without having to change the terminal's ROM for different countries, or alternately, include many different sets in a larger ROM. Many 3rd party terminals and terminal emulators supporting VT200 codes also supported NRCS.
The HP 95LX Palmtop PC, also known as project Jaguar, is Hewlett Packard's first DOS-based pocket computer, or personal digital assistant, introduced in April 1991 in collaboration with Lotus Development Corporation. The abbreviation "LX" stood for "Lotus Expandable". The computer can be seen as successor to a series of larger portable PCs like the HP 110 and HP 110 Plus.
Extended ASCII is a repertoire of character encodings that include the original 96 ASCII character set, plus up to 128 additional characters. There is no formal definition of "extended ASCII", and even use of the term is sometimes criticized, because it can be mistakenly interpreted to mean that the American National Standards Institute (ANSI) had updated its ANSI X3.4-1986 standard to include more characters, or that the term identifies a single unambiguous encoding, neither of which is the case.
In computing, a hardware code page (HWCP) refers to a code page supported natively by a hardware device such as a display adapter or printer. The glyphs to present the characters are stored in the alphanumeric character generator's resident read-only memory and are thus not user-changeable. They are available for use by the system without having to load any font definitions into the device first. Startup messages issued by a PC's System BIOS or displayed by an operating system before initializing its own code page switching logic and font management and before switching to graphics mode are displayed in a computer's default hardware code page.
The RPL character set is an 8-bit character set and encoding used by most RPL calculators manufactured by Hewlett-Packard as well as by the HP 82240B thermal printer. It is sometimes referred to simply as "ECMA-94" in documentation, although it is for the most part a superset of ISO/IEC 8859-1 / ECMA-94 in terms of printable characters, and it differs from ISO/IEC 8859-1 by using displayable characters rather than control characters in the 0x80 to 0x9F range of code points.
In computing FOCAL character set refers to a group of 8-bit single byte character sets introduced by Hewlett-Packard since 1979. It was used in several RPN calculators supporting the FOCAL programming language, like the HP-41C/CV/CX as well as the later HP-42S, which was introduced in 1988 and produced up to 1995. As such, it is also used by SwissMicros' DM41/L, both introduced in 2015, and is implicitly supported by the DM42, introduced in 2017.
Roman-8 & Roman Extension: The Roman-8 and Roman Extension characters reflect the 1985 updates: 1. Six additional characters added to former blank positions. Ý ý · µ ¶ ¾ 2. Appearance of the lowercase đ character changed to ð.
009 - Roman Extension Set ... option 009 gives all the commonly used European characters a unique code. When this "Roman Extension" set is used in conjunction with the standard ASCII character set, the end result is a 256 character set represented by 8-bit code. The use of shift-in and shift-out characters is not required, since decimal codes 0 through 127 automatically access USASCII while codes 128 through 255 access the European characters. It is important to understand the differences between the two techniques and to know which technique is supported on a given system. The HP 250 and HP 300 support the 8-bit code technique, consequently, 2631A option 009 must be ordered to provide local language printing on these two systems. All other HP computer systems and the 264X terminals support the 7-bit code, shift-in/shift-out method.
Currency Symbols: U+20A0–U+20CF ... Lira Sign. A separate currency sign U+20A4 LIRA SIGN is encoded for compatibility with the HP Roman-8 character set, which is still widely implemented in printers. In general, U+00A3 POUND SIGN may be used for both the various currencies known as pound (or punt) and the currencies known as lira.
{{cite web}}
: CS1 maint: unfit URL (link)