EBCDIC 892

Last updated

IBM code page 892 (CCSID 892) is an EBCDIC code page used in IBM mainframes for optical character recognition. [1]

Codepage layout

Characters are shown with their equivalent Unicode codes.

Host OCR-A [2] [3]
_0_1_2_3_4_5_6_7_8_9_A_B_C_D_E_F
4_ SP
0020

 

 

 

 

 

 

 

 

 
[
005B
.
002E
<
003C
(
0028
+
002B
!
0021
5_ &
0026

 

 

 

 

 

 

 

 

 
]
005D
$
0024
*
002A
)
0029
;
003B

 
6_ -
002D
/
002F

 
Ä
00C4

 

 

 
Å
00C5

 
Ñ
00D1

 
,
002C
%
0025

 
>
003E
?
003F
7_
 

 

 

 

 

 

 
[a]
 

 

 
:
003A
#
0023
@
0040
'
0027
=
003D
"
0022
8_ Ø
00D8
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069

 

 

 

 

 

 
9_
 
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
p
0070
q
0071
r
0072

 

 

 

 
Æ
00C6

 
A_
 

 
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A

 

 

 

 

 

 
B_
 
£
00A3
¥
00A5

 

 

 

 

 

 

 

 
|
007C

 

 

 
[b]
 
C_ {
007B
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049

 

2441

 

 

 

 
D_ }
007D
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
P
0050
Q
0051
R
0052

 

2440

 

 

 
^
005E
E_ \
005C

 
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A

 

2442
Ö
00D6

 

 

 
F_ 0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039

 

 
Ü
00DC

 

 

 

  Letter  Number  Punctuation  Symbol   Other   Undefined

Characters not in Unicode: [3] [4]

Related Research Articles

ISO/IEC 8859-11:2001, Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined.

ISO/IEC 8859-7:2003, Information technology — 8-bit single-byte coded graphic character sets — Part 7: Latin/Greek alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. It is informally referred to as Latin/Greek. It was designed to cover the modern Greek language. The original 1987 version of the standard had the same character assignments as the Greek national standard ELOT 928, published in 1986. The table in this article shows the updated 2003 version which adds three characters. Microsoft has assigned code page 28597 a.k.a. Windows-28597 to ISO-8859-7 in Windows. IBM has assigned code page 813 to ISO 8859-7. (IBM CCSID 813 is the original encoding. CCSID 4909 adds the euro sign. CCSID 9005 further adds the drachma sign and ypogegrammeni.)

Windows-1258 is a code page used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks.

Mac OS Icelandic is a character encoding used in Apple Macintosh computers to represent Icelandic text. It is largely identical to Mac OS Roman, except for the Icelandic special characters Ý, Þ and Ð which have replaced typography characters.

Code page 1047 is an EBCDIC code page with the full Latin-1 character set. It is closely related to both EBCDIC 037-2 and EBCDIC 037, both of which also encode Latin-1.

IBM code page 500 is an EBCDIC code page with full Latin-1-charset support used in IBM mainframes.

Code page 864 is a code page used to write Arabic in Egypt, Iraq, Jordan, Saudi Arabia, and Syria.

IBM code page 273 is an EBCDIC code page with the full Latin-1 character set used in IBM mainframes in Austria and Germany.

Mac OS Romanian is a character encoding used on Apple Macintosh computers to represent the Romanian language. It is a derivative of Mac OS Roman.

Mac OS Croatian is a character encoding used on Apple Macintosh computers to represent Gaj's Latin alphabet. It is a derivative of Mac OS Roman.

Code page 949 (IBM) IBM/AIX character encoding for Korean

IBM code page 949 (IBM-949) is IBM's PC Data KS code. It implements EUC-KR in addition to encodings for IBM extensions including user defined characters. This code page supports the Korean language. It is a combination of the single-byte Code page 1088 and the double-byte Code page 951.

IBM code page 277 is an EBCDIC code page with the full Latin-1 character set used in IBM mainframes in Denmark and Norway.

IBM code page 278 is an EBCDIC code page with full Latin-1-charset used in IBM mainframes in Finland and Sweden.

IBM code page 297 is an EBCDIC code page with full Latin-1-charset used in IBM mainframes in France.

IBM code page 1026 is an EBCDIC code page with full Latin-5-charset used in IBM mainframes in Turkey.

Code page 896, called Japan 7-Bit Katakana Extended, is IBM's code page for code-set G2 of EUC-JP, a 7-bit code page representing the Kana set of JIS X 0201 and accompanying Code page 895 which corresponds to the lower half of that standard. It encodes half-width katakana.

Code page 921 is a code page used under IBM AIX and DOS to write the Estonian, Latvian, and Lithuanian languages. It is an extension of ISO/IEC 8859-13.

IBM code page 420 is an EBCDIC code page with support for Arabic script and the Latin alphabet. It is used in IBM mainframes.

IBM code page 918 is an EBCDIC code page used on IBM mainframes in Pakistan to support Urdu.

IBM code page 893 is an EBCDIC code page used in IBM mainframes for optical character recognition.

References

  1. "CCSID 892 information document". Archived from the original on 2016-03-26.
  2. Code Page CPGID 00892 (pdf) (PDF), IBM
  3. 1 2 Code Page CPGID 00892 (txt), IBM
  4. Alphanumeric character sets for optical recognition - Part I: Character set OCR-A - Shapes and dimensions of the printed image (preview) (PDF). pp. 2–3.