Portable character set

Last updated

Portable Character Set is a set of 103 characters which, according to the POSIX standard, must be present in any character set. Compared to ASCII, the Portable Character Set lacks some control characters, and does not prescribe any particular value encoding. [1] [2] The Portable Character Set is a superset of the Basic Execution Character Set as defined by ANSI C. [3]

nameglyph C string Unicode Unicode name
NUL \0U+0000NULL (NUL)
alert \aU+0007ALERT (BEL)
backspace \bU+0008BACKSPACE (BS)
tab \tU+0009CHARACTER TABULATION (HT)
newline \nU+000ALINE FEED (LF)
vertical-tab \vU+000BLINE TABULATION (VT)
form-feed \fU+000CFORM FEED (FF)
carriage-return \rU+000DCARRIAGE RETURN (CR)
space  U+0020SPACE
exclamation-mark!!U+0021EXCLAMATION MARK
quotation-mark"\"U+0022QUOTATION MARK
number-sign##U+0023NUMBER SIGN
dollar-sign$$U+0024DOLLAR SIGN
percent-sign%%U+0025PERCENT SIGN
ampersand&&U+0026AMPERSAND
apostrophe'\'U+0027APOSTROPHE
left-parenthesis((U+0028LEFT PARENTHESIS
right-parenthesis))U+0029RIGHT PARENTHESIS
asterisk**U+002AASTERISK
plus-sign++U+002BPLUS SIGN
comma,,U+002CCOMMA
hyphen--U+002DHYPHEN-MINUS
period..U+002EFULL STOP
slash//U+002FSOLIDUS
zero00U+0030DIGIT ZERO
one11U+0031DIGIT ONE
two22U+0032DIGIT TWO
three33U+0033DIGIT THREE
four44U+0034DIGIT FOUR
five55U+0035DIGIT FIVE
six66U+0036DIGIT SIX
seven77U+0037DIGIT SEVEN
eight88U+0038DIGIT EIGHT
nine99U+0039DIGIT NINE
colon::U+003ACOLON
semicolon;;U+003BSEMICOLON
less-than-sign<<U+003CLESS-THAN SIGN
equals-sign==U+003DEQUALS SIGN
greater-than-sign>>U+003EGREATER-THAN SIGN
question-mark??U+003FQUESTION MARK
commercial-at@@U+0040COMMERCIAL AT
AAAU+0041LATIN CAPITAL LETTER A
BBBU+0042LATIN CAPITAL LETTER B
CCCU+0043LATIN CAPITAL LETTER C
DDDU+0044LATIN CAPITAL LETTER D
EEEU+0045LATIN CAPITAL LETTER E
FFFU+0046LATIN CAPITAL LETTER F
GGGU+0047LATIN CAPITAL LETTER G
HHHU+0048LATIN CAPITAL LETTER H
IIIU+0049LATIN CAPITAL LETTER I
JJJU+004ALATIN CAPITAL LETTER J
KKKU+004BLATIN CAPITAL LETTER K
LLLU+004CLATIN CAPITAL LETTER L
MMMU+004DLATIN CAPITAL LETTER M
NNNU+004ELATIN CAPITAL LETTER N
OOOU+004FLATIN CAPITAL LETTER O
PPPU+0050LATIN CAPITAL LETTER P
QQQU+0051LATIN CAPITAL LETTER Q
RRRU+0052LATIN CAPITAL LETTER R
SSSU+0053LATIN CAPITAL LETTER S
TTTU+0054LATIN CAPITAL LETTER T
UUUU+0055LATIN CAPITAL LETTER U
VVVU+0056LATIN CAPITAL LETTER V
WWWU+0057LATIN CAPITAL LETTER W
XXXU+0058LATIN CAPITAL LETTER X
YYYU+0059LATIN CAPITAL LETTER Y
ZZZU+005ALATIN CAPITAL LETTER Z
left-square-bracket[[U+005BLEFT SQUARE BRACKET
backslash\\\U+005CREVERSE SOLIDUS
right-square-bracket]]U+005DRIGHT SQUARE BRACKET
circumflex^^U+005ECIRCUMFLEX ACCENT
underscore__U+005FLOW LINE
grave-accent``U+0060GRAVE ACCENT
aaaU+0061LATIN SMALL LETTER A
bbbU+0062LATIN SMALL LETTER B
cccU+0063LATIN SMALL LETTER C
dddU+0064LATIN SMALL LETTER D
eeeU+0065LATIN SMALL LETTER E
fffU+0066LATIN SMALL LETTER F
gggU+0067LATIN SMALL LETTER G
hhhU+0068LATIN SMALL LETTER H
iiiU+0069LATIN SMALL LETTER I
jjjU+006ALATIN SMALL LETTER J
kkkU+006BLATIN SMALL LETTER K
lllU+006CLATIN SMALL LETTER L
mmmU+006DLATIN SMALL LETTER M
nnnU+006ELATIN SMALL LETTER N
oooU+006FLATIN SMALL LETTER O
pppU+0070LATIN SMALL LETTER P
qqqU+0071LATIN SMALL LETTER Q
rrrU+0072LATIN SMALL LETTER R
sssU+0073LATIN SMALL LETTER S
tttU+0074LATIN SMALL LETTER T
uuuU+0075LATIN SMALL LETTER U
vvvU+0076LATIN SMALL LETTER V
wwwU+0077LATIN SMALL LETTER W
xxxU+0078LATIN SMALL LETTER X
yyyU+0079LATIN SMALL LETTER Y
zzzU+007ALATIN SMALL LETTER Z
left-brace{{U+007BLEFT CURLY BRACKET
vertical-lineU+007CVERTICAL LINE
right-brace}}U+007DRIGHT CURLY BRACKET
tilde~~U+007ETILDE

Related Research Articles

Fortran General-purpose programming language

Fortran is a general-purpose, compiled imperative programming language that is especially suited to numeric computation and scientific computing.

Hexadecimal Base 16 numerical system

In mathematics and computing, hexadecimal is a positional system that represents numbers using a base of 16. Unlike the common way of representing numbers with ten symbols, it uses sixteen distinct symbols, most often the symbols "0"–"9" to represent values zero to nine, and "A"–"F" to represent values ten to fifteen.

The Portable Operating System Interface (POSIX) is a family of standards specified by the IEEE Computer Society for maintaining compatibility between operating systems. POSIX defines the application programming interface (API), along with command line shells and utility interfaces, for software compatibility with variants of Unix and other operating systems.

Regular expression Sequence of characters that forms a search pattern

A regular expression is a sequence of characters that define a search pattern. Usually such patterns are used by string-searching algorithms for "find" or "find and replace" operations on strings, or for input validation. It is a technique developed in theoretical computer science and formal language theory.

The Rich Text Format is a proprietary document file format with published specification developed by Microsoft Corporation from 1987 until 2008 for cross-platform document interchange with Microsoft products. Prior to 2008, Microsoft published updated specifications for RTF with major revisions of Microsoft Word and Office versions.

The Single UNIX Specification (SUS) is the collective name of a family of standards for computer operating systems, compliance with which is required to qualify for using the "UNIX" trademark. The core specifications of the SUS are developed and maintained by the Austin Group, which is a joint working group of IEEE, ISO JTC 1 SC22 and The Open Group. If an operating system is submitted to The Open Group for certification, and passes conformance tests, then it is deemed to be compliant with a UNIX standard such as UNIX 98 or UNIX 03.

Yacc is a computer program for the Unix operating system developed by Stephen C. Johnson. It is a Look Ahead Left-to-Right (LALR) parser generator, generating a parser, the part of a compiler that tries to make syntactic sense of the source code, specifically a LALR parser, based on an analytic grammar written in a notation similar to Backus–Naur Form (BNF). Yacc is supplied as a standard utility on BSD and AT&T Unix. GNU-based Linux distributions include Bison, a forward-compatible Yacc replacement.

Lex is a computer program that generates lexical analyzers.

In computing, tar is a computer software utility for collecting many files into one archive file, often referred to as a tarball, for distribution or backup purposes. The name is derived from "tape archive", as it was originally developed to write data to sequential I/O devices with no file system of their own. The archive data sets created by tar contain various file system parameters, such as name, timestamps, ownership, file access permissions, and directory organization. The command line utility was first introduced in the Version 7 Unix in January 1979, replacing the tp program. The file structure to store this information was standardized in POSIX.1-1988 and later POSIX.1-2001, and became a format supported by most modern file archiving systems.

The FreeBSD Documentation License is the license that covers most of the documentation for the FreeBSD operating system.

pwd command used to output the path of the current working directory

In Unix-like and some other operating systems, the pwd command writes the full pathname of the current working directory to the standard output.

An open standard is a standard that is publicly available and has various rights to use associated with it and may also have various properties of how it was designed. There is no single definition, and interpretations vary with usage.

basename is a standard computer program on Unix and Unix-like operating systems. When basename is given a pathname, it will delete any prefix up to the last slash ('/') character and return the result. basename is described in the Single UNIX Specification and is primarily used in shell scripts.

Portable computer self-contained computer that is designed to be moved from one place to another

A portable computer is a computer designed to be easily moved from one place to another and included a display and keyboard. The first commercially sold portable was the 50-pound (23 kg) IBM 5100, introduced 1975. The next major portables were Osborne's 24-pound (11 kg) CP/M-based Osborne 1 (1981) and Compaq's 28-pound (13 kg) 100% IBM PC compatible Compaq Portable (1983). These "luggable" computers lacked the next technological development, not requiring an external power source; that feature was introduced by the laptop. Laptops were followed by lighter models, so that in the 2000s mobile devices and by 2007 smartphones made the term almost meaningless. The 2010s introduced wearable computers such as smartwatches.

In Unix and related computer operating systems, a file descriptor is an abstract indicator (handle) used to access a file or other input/output resource, such as a pipe or network socket. File descriptors form part of the POSIX application programming interface. A file descriptor is a non-negative integer, generally represented in the C programming language as the type int.

pax (command) archiving utility created by POSIX

pax is an archiving utility created by POSIX, defined since 1995. Rather than sort out the incompatible options that have crept up between tar and cpio, along with their implementations across various versions of Unix, the IEEE designed a new archive utility that could support various archive formats with useful options from both archivers. The pax command is available on Unix and Unix-like operating systems and on Microsoft Windows NT and Windows 2000.

The Austin Group or the Austin Common Standards Revision Group is a joint technical working group formed to develop and maintain a common revision of POSIX.1 and parts of the Single UNIX Specification.

OpenCL Open standard for programming heterogenous computing systems, such as CPUs or GPUs

OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of central processing units (CPUs), graphics processing units (GPUs), digital signal processors (DSPs), field-programmable gate arrays (FPGAs) and other processors or hardware accelerators. OpenCL specifies programming languages for programming these devices and application programming interfaces (APIs) to control the platform and execute programs on the compute devices. OpenCL provides a standard interface for parallel computing using task- and data-based parallelism.

In computing, mkstemp is a POSIX function for creating a temporary file. It accepts an argument that determines the location of the temporary file, and the prefix of its generated filename. After mkstemp was added to the Single UNIX Specification, the function tempnam was deprecated, because the latter carried the risk that a temporary file with the same name could be created by another thread or process within the time from when the caller obtains the temporary filename and attempts to create it. mkstemp does not suffer from this problem.

IEEE 802.11s is Wireless LAN standard and an IEEE 802.11 amendment for mesh networking, defining how wireless devices can interconnect to create a WLAN mesh network, which may be used for relatively fixed topologies and wireless ad hoc networks. The IEEE 802.11s working group draws upon volunteers from university and industry to provide specifications and possible design solutions for wireless mesh networking. As a standard, the document was iterated and revised many times prior to finalization.

References

  1. "The Open Group Base Specifications Issue 7, 2018 edition". IEEE and The Open Group . 2018. Retrieved 2018-03-21.
  2. "The Open Group Base Specifications Issue 6". IEEE and The Open Group . 2004. Retrieved 18 August 2014.
  3. "Working draft ISO/IEC 9899:202x, Information technology Programming languages C, § 5.2.1" (PDF). International Organization for Standardization . 2018. Retrieved 2020-08-03.