Nibble

Last updated

An octet code page 866 font table ordered by nibbles. Octets in CP866 ordered by nibbles.png
An octet code page 866 font table ordered by nibbles.

In computing, a nibble [1] (occasionally nybble, nyble, or nybl to match the spelling of byte) is a four-bit aggregation, [1] [2] [3] or half an octet. It is also known as half-byte [4] or tetrade. [5] [6] In a networking or telecommunication context, the nibble is often called a semi-octet, [7] quadbit, [8] or quartet. [9] [10] A nibble has sixteen (24) possible values. A nibble can be represented by a single hexadecimal digit (0F) and called a hex digit. [11]

Contents

A full byte (octet) is represented by two hexadecimal digits (00FF); therefore, it is common to display a byte of information as two nibbles. Sometimes the set of all 256-byte values is represented as a 16×16 table, which gives easily readable hexadecimal codes for each value.

Four-bit computer architectures use groups of four bits as their fundamental unit. Such architectures were used in early microprocessors, pocket calculators and pocket computers. They continue to be used in some microcontrollers. In this context, 4-bit groups were sometimes also called characters [12] rather than nibbles. [1]

History

The term nibble originates from its representing "half a byte", with byte a homophone of the English word bite. [4] In 2014, David B. Benson, a professor emeritus at Washington State University, remembered that he playfully used (and may have possibly coined) the term nibble as "half a byte" and unit of storage required to hold a binary-coded decimal (BCD) digit around 1958, when talking to a programmer from Los Alamos Scientific Laboratory. The alternative spelling nybble reflects the spelling of byte, as noted in editorials of Kilobaud and Byte in the early 1980s. Another early recorded use of the term nybble was in 1977 within the consumer-banking technology group at Citibank. It created a pre-ISO 8583 standard for transactional messages between cash machines and Citibank's data centers that used the basic data unit 'nabble'.

Nibble is used to describe the amount of memory used to store a digit of a number stored in packed decimal format (BCD) within an IBM mainframe. This technique is used to make computations faster and debugging easier. An 8-bit byte is split in half and each nibble is used to store one decimal digit. The last (rightmost) nibble of the variable is reserved for the sign. Thus a variable which can store up to nine digits would be "packed" into 5 bytes. Ease of debugging resulted from the numbers’ being readable in a hex dump where two hex numbers are used to represent the value of a byte, as 16×16 = 28. For example, a five-byte BCD value of 31 41 59 26 5C represents a decimal value of +314159265.

Historically, there are cases where nybble was used for a group of bits greater than 4. In the Apple II microcomputer line, much of the disk drive control and group-coded recording was implemented in software. Writing data to a disk was done by converting 256-byte pages into sets of 5-bit (later, 6-bit) nibbles and loading disk data required the reverse. [13] [14] [15] Moreover, 1982 documentation for the Integrated Woz Machine refers consistently to an "8 bit nibble". [16] The term byte once had the same ambiguity and meant a set of bits but not necessarily 8, hence the distinction of bytes and octets or of nibbles and quartets (or quadbits). Today, the terms byte and nibble almost always refer to 8-bit and 4-bit collections respectively and are very rarely used to express any other sizes.

Binary and hexadecimal representation

A nibble can be represented in binary and hexadecimal as follows:

Binary vs Decimal vs Hexadecimal
BinaryDecimalHexadecimal
000000
000111
001022
001133
010044
010155
011066
011177
100088
100199
101010A
101111B
110012C
110113D
111014E
111115F

Low and high nibbles

The terms low nibble and high nibble are used to denote the nibbles containing, respectively, the less significant bits and the more significant bits within a byte. In graphical representations of bits within a byte, the leftmost bit could represent the most significant bit (MSB), corresponding to ordinary decimal notation in which the digit at the left of a number is the most significant. In such illustrations the four bits on the left end of the byte form the high nibble, and the remaining four bits form the low nibble. [17] For example,

ninety-seven = 9710 = (0110 0001)2 = 61hex

the high nibble is 01102 (6hex), and the low nibble is 00012 (1hex). The total value is high-nibble × 1610 + low-nibble (6 × 16 + 1 = 9710).

Extracting a nibble from a byte

A nibble can be extracted from a byte by doing a bitwise logical AND operation and optionally a bit shift depending on if the high or low nibble is to be extracted.

In C:

#define HI_NIBBLE(b) (((b) >> 4) & 0x0F)#define LO_NIBBLE(b) ((b) & 0x0F)

where b must be a variable or constant of an integral data type, and only the least-significant byte of b is used.

For example, HI_NIBBLE(0xAB)==0xA and LO_NIBBLE(0xAB)==0xB.

In Common Lisp:

(defunhi-nibble(b)(ldb(byte44)b))(defunlo-nibble(b)(ldb(byte40)b))

See also

Related Research Articles

The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit of memory in many computer architectures. To disambiguate arbitrarily sized bytes from the common 8-bit definition, network protocol documents such as the Internet Protocol refer to an 8-bit byte as an octet. Those bits in an octet are usually counted with numbering from 0 to 7 or 7 to 0 depending on the bit endianness.

<span class="mw-page-title-main">Binary-coded decimal</span> System of digitally encoding numbers

In computing and electronic systems, binary-coded decimal (BCD) is a class of binary encodings of decimal numbers where each digit is represented by a fixed number of bits, usually four or eight. Sometimes, special bit patterns are used for a sign or other indications.

<span class="mw-page-title-main">Character encoding</span> Using numbers to represent text characters

Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that make up a character encoding are known as "code points" and collectively comprise a "code space", a "code page", or a "character map".

In mathematics and computing, the hexadecimal numeral system is a positional numeral system that represents numbers using a radix (base) of sixteen. Unlike the decimal system representing numbers using ten symbols, hexadecimal uses sixteen distinct symbols, most often the symbols "0"–"9" to represent values 0 to 9, and "A"–"F" to represent values from ten to fifteen.

In computer science, an integer is a datum of integral data type, a data type that represents some range of mathematical integers. Integral data types may be of different sizes and may or may not be allowed to contain negative values. Integers are commonly represented in a computer as a group of binary digits (bits). The size of the grouping varies so the set of integer sizes available varies between different types of computers. Computer hardware nearly always provides a way to represent a processor register or memory address as an integer.

A computer number format is the internal representation of numeric values in digital device hardware and software, such as in programmable computers and calculators. Numerical values are stored as groupings of bits, such as bytes and words. The encoding between numerical values and bit patterns is chosen for convenience of the operation of the computer; the encoding used by the computer's instruction set generally requires conversion for external use, such as for printing and display. Different types of processors may have different internal representations of numerical values and different conventions are used for integer and real numbers. Most calculations are carried out with number formats that fit into a processor register, but some software systems allow representation of arbitrarily large numbers using multiple words of memory.

Dot-decimal notation is a presentation format for numerical data. It consists of a string of decimal numbers, using the full stop (dot) as a separation character.

4-bit computing is the use of computer architectures in which integers and other data units are 4 bits wide. 4-bit central processing unit (CPU) and arithmetic logic unit (ALU) architectures are those that are based on registers or data buses of that size. A group of four bits is also called a nibble and has 24 = 16 possible values.

An organizationally unique identifier (OUI) is a 24-bit number that uniquely identifies a vendor, manufacturer, or other organization.

A hex editor is a computer program that allows for manipulation of the fundamental binary data that constitutes a computer file. The name 'hex' comes from 'hexadecimal', a standard numerical format for representing binary data. A typical computer file occupies multiple areas on the storage medium, whose contents are combined to form the file. Hex editors that are designed to parse and edit sector data from the physical segments of floppy or hard disks are sometimes called sector editors or disk editors.

The Intel BCD opcodes are a set of six x86 instructions that operate with binary-coded decimal numbers. The radix used for the representation of numbers in the x86 processors is 2. This is called a binary numeral system. However, the x86 processors do have limited support for the decimal numeral system.

Intel hexadecimal object file format, Intel hex format or Intellec Hex is a file format that conveys binary information in ASCII text form, making it possible to store on non-binary media such as paper tape, punch cards, etc., to display on text terminals or be printed on line-oriented printers. The format is commonly used for programming microcontrollers, EPROMs, and other types of programmable logic devices and hardware emulators. In a typical application, a compiler or assembler converts a program's source code to machine code and outputs it into a object or executable file in hexadecimal format. In some applications, the Intel hex format is also used as a container format holding packets of stream data. Common file extensions used for the resulting files are .HEX or .H86. The HEX file is then read by a programmer to write the machine code into a PROM or is transferred to the target system for loading and execution. There are various tools to convert files between hexadecimal and binary format, and vice versa.

The octet is a unit of digital information in computing and telecommunications that consists of eight bits. The term is often used when the term byte might be ambiguous, as the byte has historically been used for storage units of a variety of sizes.

<span class="mw-page-title-main">HP Saturn</span> Family of 4-bit datapath microprocessors

The Saturn family of 4-bit (datapath) microprocessors was developed by Hewlett-Packard in the 1980s first for the HP-71B handheld computer and then later for various HP calculators. It succeeded the Nut family of processors used in earlier calculators. The original Saturn chip was first used in the HP-71B hand-held BASIC-programmable computer, introduced in 1984. Later models of the family powered the popular HP 48 series of calculators. The HP48SX and HP48S were the last models to use genuine Saturn processors manufactured by HP. Later calculator models used Saturn processors manufactured by NEC. The HP 49 series initially used the Saturn CPU as well, until the NEC fab could no longer manufacture the processor for technical reasons in 2003. Therefore, starting with the HP 49g+ model in 2003, the calculators switched to a Samsung S3C2410 processor with an ARM920T core which ran an emulator of the Saturn hardware in software. In 2000, the HP 39G and HP 40G were the last calculators introduced based on the actual NEC fabricated Saturn hardware. The last calculators based on the Saturn emulator were the HP 39gs, HP 40gs and HP 50g in 2006, as well as the 2007 revision of the hp 48gII. The HP 50g, the last calculator utilizing this emulator, was discontinued in 2015 when Samsung stopped producing the ARM processor on which it was based.

<span class="mw-page-title-main">SREC (file format)</span> File format developed by Motorola

Motorola S-record is a file format, created by Motorola in the mid-1970s, that conveys binary information as hex values in ASCII text form. This file format may also be known as SRECORD, SREC, S19, S28, S37. It is commonly used for programming flash memory in microcontrollers, EPROMs, EEPROMs, and other types of programmable logic devices. In a typical application, a compiler or assembler converts a program's source code to machine code and outputs it into a HEX file. The HEX file is then imported by a programmer to "burn" the machine code into non-volatile memory, or is transferred to the target system for loading and execution.

In digital computing and telecommunications, a unit of information is the capacity of some standard data storage system or communication channel, used to measure the capacities of other systems and channels. In information theory, units of information are also used to measure information contained in messages and the entropy of random variables.

Tektronix hex format and Extended Tektronix hex format / Extended Tektronix Object Format are ASCII-based hexadecimal file formats, created by Tektronix, for conveying binary information for applications like programming microcontrollers, EPROMs, and other kinds of chips.

BCD, also called alphanumeric BCD, alphameric BCD, BCD Interchange Code, or BCDIC, is a family of representations of numerals, uppercase Latin letters, and some special and control characters as six-bit character codes.

A half-carry flag is a condition flag bit in the status register of many CPU families, such as the Intel 8080, Zilog Z80, the x86, and the Atmel AVR series, among others. It indicates when a carry or borrow has been generated out of the least significant four bits of the accumulator register following the execution of an arithmetic instruction. It is primarily used in decimal (BCD) arithmetic instructions.

In computing, a hextet, or a chomp, is a sixteen-bit aggregation, or four nibbles. As a nibble typically is notated in hexadecimal format, a hextet consists of 4 hexadecimal digits. A hextet is the unofficial name for each of the 8 blocks in an IPv6 address.

References

  1. 1 2 3 Raphael, Howard A., ed. (November 1974). "The Functions Of A Computer: Instruction Register And Decoder" (PDF). MCS-40 User's Manual For Logic Designers. Santa Clara, California, USA: Intel Corporation. p. viii. Archived (PDF) from the original on 2020-03-03. Retrieved 2020-03-03. [...] The characteristic eight bit field is sometimes referred to as a byte, a four bit field can be referred to as a nibble. [...]
  2. Hall, Douglas V. (1980). Microprocessors and Digital Systems. McGraw-Hill. ISBN   0-07-025571-7.
  3. Warren Jr., Henry S. (2013) [2002]. Hacker's Delight (2 ed.). Addison WesleyPearson Education, Inc. ISBN   978-0-321-84268-8. 0-321-84268-5.
  4. 1 2 Raymond, Eric S. (1996). The New Hacker's Dictionary. MIT Press. p. 333. ISBN   978-0-262-68092-9.
  5. Carr, John W. Introduction to the use of digital computers: Notes from the Summer Conference Held at the Computation Center of the University of North Carolina, Chapel Hill, N.C., August 17-28, 1959. Frontier Research on Digital Computers. Vol. 1. University of North Carolina at Chapel Hill, Computation Center. p. 211. Each of these letters corresponds to one of the integers from zero to fifteen, therefore requiring 4 bits (one "tetrade") in binary representation.
  6. Speiser, Ambrosius Paul (1965) [1961]. Digitale Rechenanlagen – Grundlagen / Schaltungstechnik / Arbeitsweise / Betriebssicherheit[Digital computers – Basics / Circuits / Operation / Reliability] (in German) (2 ed.). ETH Zürich, Zürich, Switzerland: Springer-Verlag / IBM. pp. 6, 34, 165, 183, 208, 213, 215. LCCN   65-14624. 0978.
  7. Puzman, Josef; Kubin, Boris (2012). Public Data Networks: From Separate PDNs to the ISDN. Springer Science+Business Media. p. 113. ISBN   978-1-4471-1737-7.
  8. Horak, Ray (2007). Webster's New World Telecom Dictionary. John Wiley & Sons. p. 402. ISBN   978-0-470-22571-4.
  9. Brewster, Ronald L. (1994). Data Communications and Networks, Vol. III. IEE telecommunications series. Vol. 31. Institution of Electrical Engineers. p. 155. ISBN   978-0-85296-804-8. A data symbol represents one quartet (4 bits) of binary data.
  10. Courbis, Paul; Lalande, Sébastien (2006-06-27) [1989]. Voyage au centre de la HP28c/s (in French) (2 ed.). Paris, France: Editions de la Règle à Calcul. OCLC   636072913. Archived from the original on 2016-08-06. Retrieved 2015-09-06.
  11. Heller, Steve (1997). Introduction to C++. Morgan Kaufmann. p. 27. ISBN   978-0-12-339099-8. Each hex digit (0–f) represents exactly 4 bits.
  12. "Terms And Abbreviations" (PDF). MCS-4 Assembly Language Programming Manual – The INTELLEC 4 Microcomputer System Programming Manual (Preliminary ed.). Santa Clara, California, USA: Intel Corporation. December 1973. pp. v, 2-6. MCS-030-1273-1. Archived (PDF) from the original on 2020-03-01. Retrieved 2020-03-02. [...] Bit – The smallest unit of information which can be represented. (A bit may be in one of two states I 0 or 1). [...] Byte – A group of 8 contiguous bits occupying a single memory location. [...] Character – A group of 4 contiguous bits of data. [...] (NB. This Intel 4004 manual uses the term character referring to 4-bit rather than 8-bit data entities. Intel switched to use the more common term nibble for 4-bit entities in their documentation for the succeeding processor 4040 in 1974 already.)
  13. Worth, Don D.; Lechner, Pieter M. (May 1982) [1981]. Beneath Apple DOS (4th printing, 1st ed.). Reseda, California, USA: Quality Software . Retrieved 2017-03-21. Archived 9 March 2016 at the Wayback Machine
  14. Worth, Don D.; Lechner, Pieter M. (March 1985) [1984]. Beneath Apple ProDOS – For Users of Apple II Plus, Apple IIe and Apple IIc Computers (PDF) (2nd printing, 1st ed.). Chatsworth, California, USA: Quality Software. ISBN   0-912985-05-4. LCCN   84-61383. Archived (PDF) from the original on 2017-03-21. Retrieved 2017-03-21.
  15. Copy II Plus Version 9 – ProDOS/DOS Utilities – Data Recovery, File Management, Protected Software Backup (PDF). 9.0. Central Point Software, Inc. 1989-10-31 [1982]. Archived from the original (PDF) on 2017-05-07. Retrieved 2017-03-21.
  16. Apple Computer, Inc. (February 1982) [1978]. Integrated Woz Machine (IWM) Specification (PDF) (19 ed.). DigiBarn Computer Museum. Archived (PDF) from the original on 2016-08-06. Retrieved 2016-08-06.
  17. Baccala, Brent (April 1997). "Binary arithmetic". Connected: An Internet Encyclopedia (3rd ed.). Archived from the original on 2016-08-06. Retrieved 2015-07-20.