Sort (Unix)

sort
	The sort command
Original author(s)	Ken Thompson (AT&T Bell Laboratories)
Developer(s)	Various open-source and commercial developers
Initial release	November 3, 1971;53 years ago
Written in	C
Operating system	Multics, Unix, Unix-like, V, Plan 9, Inferno, MSX-DOS, IBM i
Platform	Cross-platform
Type	Command
License	coreutils: GPLv3+ ; Plan 9: MIT License

Last updated April 30, 2025

In computing, sort is a standard command line program of Unix and Unix-like operating systems, that prints the lines of its input or concatenation of all files listed in its argument list in sorted order. Sorting is done based on one or more sort keys extracted from each line of input. By default, the entire input is taken as sort key. Blank space is the default field separator. The command supports a number of command-line options that can vary by implementation. For instance the "-r" flag will reverse the sort order. Sort ordering is affected by the environment's locale settings.^[1]

History

A sort command that invokes a general sort facility was first implemented within Multics.^[2] Later, it appeared in Version 1 Unix. This version was originally written by Ken Thompson at AT&T Bell Laboratories. By Version 4 Thompson had modified it to use pipes, but sort retained an option to name the output file because it was used to sort a file in place. In Version 5, Thompson invented "-" to represent standard input.^[3]

The version of sort bundled in GNU coreutils was written by Mike Haertel and Paul Eggert.^[1] This implementation employs the merge sort algorithm.

Similar commands are available on many other operating systems, for example a sort command is part of ASCII's MSX-DOS2 Tools for MSX-DOS version 2.^[4]

The sort command has also been ported to the IBM i operating system.^[5]

Syntax

sort [OPTION]... [FILE]...

With no FILE, or when FILE is -, the command reads from standard input.

Parameters

Name	Description	Unix	Plan 9	Inferno	FreeBSD	Linux	MSX-DOS	IBM i
-b, --ignore-leading-blanks	Ignores leading blanks.	Yes	Yes	No	Yes	Yes	No	Yes
-c	Check that input file is sorted.	No	Yes	No	Yes	Yes	No	Yes
-C	Like -c, but does not report the first bad line.	No	No	No	Yes	Yes	No	No
-d, --dictionary-order	Considers only blanks and alphanumeric characters.	Yes	Yes	No	Yes	Yes	No	Yes
-f, --ignore-case	Fold lower case to upper case characters.	Yes	Yes	No	Yes	Yes	No	Yes
-g, --general-numeric-sort, --sort=general-numeric	Compares according to general numerical value.	Yes	Yes	No	Yes	Yes	No	No
-h, --human-numeric-sort, --sort=human-numeric	Compare human readable numbers (e.g., 2K 1G).	Yes	No	No	Yes	Yes	No	No
-i, --ignore-nonprinting	Considers only printable characters.	Yes	Yes	No	Yes	Yes	No	Yes
-k, --key=POS1[,POS2]	Start a key at POS1 (origin 1), end it at POS2 (default end of line)	No	No	No	Yes	Yes	No	No
-m	Merge only; input files are assumed to be presorted.	No	Yes	No	Yes	Yes	No	Yes
-M, --month-sort, --sort=month	Compares (unknown) < 'JAN' < ... < 'DEC'.	Yes	Yes	No	Yes	Yes	No	No
-n, --numeric-sort, --sort=numeric	Compares according to string numerical value.	Yes	Yes	Yes	Yes	Yes	No	Yes
-oOUTPUT	Uses OUTPUT file instead of standard output.	No	Yes	No	Yes	Yes	No	Yes
-r, --reverse	Reverses the result of comparisons.	Yes	Yes	Yes	Yes	Yes	No	Yes
-R, --random-sort, --sort=random	Shuffles, but groups identical keys. See also: shuf	Yes	No	No	Yes	Yes	No	No
-s	Stabilizes sort by disabling last-resort comparison.	No	No	No	Yes	Yes	No	No
-Ssize, --buffer-size=size	Use size for the maximum size of the memory buffer.	No	No	No	Yes	No	No	No
-tx	'Tab character' separating fields is x.	No	Yes	No	No	Yes	No	Yes
-tchar, --field-separator=char	Uses char instead of non-blank to blank transition.	No	No	No	Yes	Yes	No	No
-Tdir, --temporary-directory=dir	Uses dir for temporaries.	No	Yes	No	Yes	Yes	No	No
-u, --unique	Unique processing to suppress all but one in each set of lines having equal keys.	No	Yes	No	Yes	Yes	No	Yes
-V, --version-sort	Natural sort of (version) numbers within text	No	No	No	Yes	Yes	No	No
-w	Like -i, but ignore only tabs and spaces.	No	Yes	No	No	No	No	No
-z, --zero-terminated	End lines with 0 byte, not newline	No	No	No	Yes	Yes	No	No
--help	Display help and exit	No	No	No	Yes	Yes	No	No
--version	Output version information and exit	No	No	No	Yes	Yes	No	No
/R	Reverses the result of comparisons.	No	No	No	No	No	Yes	No
/S	Specify the number of digits to determine how many digits of each line should be judged.	No	No	No	No	No	Yes	No
/A	Sort by ASCII code.	No	No	No	No	No	Yes	No
/H	Include hidden files when using wild cards.	No	No	No	No	No	Yes	No

Examples

Sort a file in alphabetical order

$ catphonebook Smith, Brett     555-4321Doe, John        555-1234Doe, Jane        555-3214Avery, Cory      555-4132Fogarty, Suzie   555-2314

$ sortphonebook Avery, Cory      555-4132Doe, Jane        555-3214Doe, John        555-1234Fogarty, Suzie   555-2314Smith, Brett     555-4321

Sort by number

The -n option makes the program sort according to numerical value. The du command produces output that starts with a number, the file size, so its output can be piped to sort to produce a list of files sorted by (ascending) file size:

$ du/bin/*|sort-n 4       /bin/domainname24      /bin/ls102     /bin/sh304     /bin/csh

The find command with the ls option prints file sizes in the 7th field, so a list of the LaTeX files sorted by file size is produced by:

$ find.-name"*.tex"-ls|sort-k7n

Columns or fields

Use the -k option to sort on a certain column. For example, use "-k 2" to sort on the second column. In old versions of sort, the +1 option made the program sort on the second column of data (+2 for the third, etc.). This usage is deprecated.

$ catzipcode Adam  12345Bob   34567Joe   56789Sam   45678Wendy 23456

$ sort-k2nzipcode Adam  12345Wendy 23456Bob   34567Sam   45678Joe   56789

Sort on multiple fields

The -k m,n option lets you sort on a key that is potentially composed of multiple fields (start at column m, end at column n):

$ catquota fred 2000bob 1000an 1000chad 1000don 1500eric 500

$ sort-k2,2n-k1,1quota eric 500an 1000bob 1000chad 1000don 1500fred 2000

Here the first sort is done using column 2. -k2,2n specifies sorting on the key starting and ending with column 2, and sorting numerically. If -k2 is used instead, the sort key would begin at column 2 and extend to the end of the line, spanning all the fields in between. -k1,1 dictates breaking ties using the value in column 1, sorting alphabetically by default. Note that bob, and chad have the same quota and are sorted alphabetically in the final output.

Sorting a pipe delimited file

$ sort-k2,2,-k1,1-t'|'zipcode Adam|12345Wendy|23456Sam|45678Joe|56789Bob|34567

Sorting a tab delimited file

Sorting a file with tab separated values requires a tab character to be specified as the column delimiter. This illustration uses the shell's dollar-quote notation^[6]^[7] to specify the tab as a C escape sequence.

$ sort-k2,2-t$'\t'phonebookDoe, John 555-1234Fogarty, Suzie 555-2314Doe, Jane 555-3214Avery, Cory 555-4132Smith, Brett 555-4321

Sort in reverse

The -r option just reverses the order of the sort:

$ sort-rk2nzipcode Joe   56789Sam   45678Bob   34567Wendy 23456Adam  12345

Sort in random

The GNU implementation has a -R --random-sort option based on hashing; this is not a full random shuffle because it will sort identical lines together. A true random sort is provided by the Unix utility shuf.

Sort by version

The GNU implementation has a -V --version-sort option which is a natural sort of (version) numbers within text. Two text strings that are to be compared are split into blocks of letters and blocks of digits. Blocks of letters are compared alpha-numerically, and blocks of digits are compared numerically (i.e., skipping leading zeros, more digits means larger, otherwise the leftmost digits that differ determine the result). Blocks are compared left-to-right and the first non-equal block in that loop decides which text is larger. This happens to work for IP addresses, Debian package version strings and similar tasks where numbers of variable length are embedded in strings.

References

1 2 "sort(1): sort lines of text files - Linux man page". linux.die.net.
↑ "Multics Commands". www.multicians.org.
↑ McIlroy, M. D. (1987). A Research Unix reader: annotated excerpts from the Programmer's Manual, 1971–1986 (PDF) (Technical report). CSTR. Bell Labs. 139.
↑ "MSX-DOS2 Tools User's Manual - MSX-DOS2 TOOLS ユーザーズマニュアル". April 1, 1993 – via Internet Archive.
↑ IBM. "IBM System i Version 7.2 Programming Qshell" (PDF). IBM . Retrieved 2020-09-05.
↑ "The GNU Bash Reference Manual, for Bash, Version 4.2: Section 3.1.2.4 ANSI-C Quoting". Free Software Foundation, Inc. 28 December 2010. Retrieved 1 February 2013. Words of the form $'string' are treated specially. The word expands to string, with backslash-escaped characters replaced as specified by the ANSI C standard.
↑ Fowler, Glenn S.; Korn, David G.; Vo, Kiem-Phong. "KornShell FAQ". Archived from the original on 2013-05-27. Retrieved 3 March 2015. The $'...' string literal syntax was added to ksh93 to solve the problem of entering special characters in scripts. It uses ANSI-C rules to translate the string between the '...'.

External links

Original Sort manpage The original BSD Unix program's manpage
sort(1) – Linux User Manual – User Commands
sort(1) – Plan 9 Programmer's Manual, Volume 1
sort(1) – Inferno General commands Manual
Further details about sort at Softpanorama

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 "sort(1): sort lines of text files - Linux man page". linux.die.net.

[2] "Multics Commands". www.multicians.org.

[reader-3] McIlroy, M. D. (1987). A Research Unix reader: annotated excerpts from the Programmer's Manual, 1971–1986 (PDF) (Technical report). CSTR. Bell Labs. 139.

[4] "MSX-DOS2 Tools User's Manual - MSX-DOS2 TOOLS ユーザーズマニュアル". April 1, 1993 – via Internet Archive.

[5] IBM. "IBM System i Version 7.2 Programming Qshell" (PDF). IBM . Retrieved 2020-09-05.

[6] "The GNU Bash Reference Manual, for Bash, Version 4.2: Section 3.1.2.4 ANSI-C Quoting". Free Software Foundation, Inc. 28 December 2010. Retrieved 1 February 2013. Words of the form $'string' are treated specially. The word expands to string, with backslash-escaped characters replaced as specified by the ANSI C standard.

[KornShell_93-7] Fowler, Glenn S.; Korn, David G.; Vo, Kiem-Phong. "KornShell FAQ". Archived from the original on 2013-05-27. Retrieved 3 March 2015. The $'...' string literal syntax was added to ksh93 to solve the problem of entering special characters in scripts. It uses ANSI-C rules to translate the string between the '...'.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

v t e Unix command-line interface programs and shell builtins
File system	cat chattr chmod chown chgrp cksum cmp cp dd du df file fuser ln ls mkdir mv pax pwd rm rmdir split tee touch type umask
Processes	at bg crontab fg kill nice ps time
User environment	env exit logname mesg talk tput uname who write
Text processing	awk basename comm csplit cut diff dirname ed ex fold head iconv join m4 more nl paste patch printf read sed sort strings tail tr troff uniq vi wc xargs
Shell builtins	alias cd echo test unset wait
Searching	find grep
Documentation	man
Software development	ar ctags lex make nm strip yacc
Miscellaneous	bc cal expr lp od sleep true and false
Categories Standard Unix programs Unix SUS2008 utilities List

v t e Plan 9 command-line interface programs and shell builtins
File system	chmod chgrp cmp cp dd du file gzip ls mkdir pwd rm split tee touch
Processes	kill ps
User environment	passwd who
Text processing	awk basename comm diff ed eqn join sed sort spell strings tail tr troff uniq wc
Shell builtins	echo test
Networking	ip/ipconfig ip/ping netstat
Searching	grep
Software development	ar hoc lex nm strip yacc
Miscellaneous	bc cal fortune sleep
Category

v t e GNU Core Utilities command-line interface programs
File system	chcon chmod chown chgrp cksum cp dd df dir dircolors install ln ls mkdir mkfifo mknod mktemp mv realpath rm rmdir shred sync touch truncate vdir
Text utilities	b2sum base32 base64 cat cksum comm csplit cut expand fmt fold head join md5sum nl numfmt od paste ptx pr sha1sum shuf sort split sum tac tail tr tsort unexpand uniq wc
Shell utilities	arch basename chroot date dirname du echo env expr factor false groups hostid id link logname nice nohup nproc pathchk pinky printenv printf pwd readlink runcon seq sleep stat stdbuf stty tee test timeout true tty uname unlink uptime users who whoami yes