Indentation style

Last updated

In computer programming, indentation style is a convention, a.k.a. style, governing the indentation of blocks of source code. An indentation style generally involves consistent width of whitespace (indentation size) before each line of a block, so that the lines of code appear to be related, and dictates whether to use space or tab characters for the indentation whitespace.

Contents

Overview

This article primarily addresses styles for free-form programming languages. As the name implies, such language code need not follow an indentation style. Indentation is a secondary notation that is often intended to lower cognitive load for a programmer to understand the structure of the code. Indentation can clarify the separation between the code executed based on control flow.

Structured languages, such as Python and occam, use indentation to determine the structure instead of using braces or keywords; this is termed the off-side rule. In such languages, indentation is meaningful to the language processor (such as compiler or interpreter). A programmer must conform to the language's indentation rules although may be free to choose indentation size.

This article focuses on curly-bracket languages (that delimit blocks with curly brackets, a.k.a. curly braces, a.k.a. braces) and in particular C-family languages, but a convention used for one language can be adapted to another language. For example, a language that uses BEGIN and END keywords instead of braces can be adapted by treating BEGIN the same as the open brace and so on.

Indentation style only applies to text-based languages. Visual programming languages have no indentation.

Research

Despite the ubiquitous use of indentation styles, little research has been conducted on its value. First experiments, conducted by Weissman in 1974, did not show any effect. [1] In 2023, an experiment by Morzeck et al. [2] showed a significant positive effect for nested if statements where non-indented code required on average 179% more time to read than indented code. A follow up-experiment by Hanenberg et al. [3] confirmed a large effect (although in that experiment non-indented code just took 113% more time to read) and revealed that the differences in reading times can be explained by the code that can be skipped (for indented code). In another experiment on JSON objects [4] non-indented code took even 544% more time to read.

Notable styles

The table below includes code examples of various indentation styles. For consistency, indentation size for example code is 4 spaces even though this varies by coding convention.

ExampleName
while(x==y){foo();bar();}
Allman
while(x==y){foo();bar();}
GNU
while(x==y){foo();bar();}
Whitesmiths
while(x==y){foo();bar();}
K&R
while(x==y){foo();bar();}
Ratliff
while(x==y){foo();bar();}
Horstmann
while(x==y){foo();bar();}
Pico
while(x==y){foo();bar();}
Lisp
#define W(c,b) {while(c){b}}W(x==y,f();b();)
APL

C/C++ styles

Attributes of C, C++ and other curly-brace programming language coding style include but are not limited to:

K&R

The Kernighan & Ritchie (K&R) style is commonly used for C and C++ code and is the basis for many derivative styles. It is used in the original Unix kernel, Kernighan and Ritchie's book The C Programming Language , as well as Kernighan and Plauger's book The Elements of Programming Style .

Although The C Programming Language does not explicitly define this style, it follows it consistently. From the book:

The position of braces is less important, although people hold passionate beliefs. We have chosen one of several popular styles. Pick a style that suits you, then use it consistently.

In this style, a function has its opening and closing braces on their own lines and with the same indentation as the declaration, while the statements in the body of the function are indented an additional level. A multi-statement block inside a function, however, has its opening brace on the same line as its control clause while the closing brace remains on its own line unless followed by a keyword such as else or while.

Example code:

intmain(intargc,char*argv[]){while(x==y){do_something();do_something_else();if(some_error)fix_issue();// single-statement block without braceselsecontinue_as_usual();}final_thing();}

Egyptian braces

The non-aligned braces of the multi-line blocks are nicknamed "Egyptian braces" (or "Egyptian brackets") for their resemblance to arms in some fanciful poses of ancient Egyptians. [5] [6] [7]

Single statements

A single-statement block does not have braces, which is a cause of easy-to-miss bugs such as the goto fail bug.

One True Brace

The One True Brace Style [8] (abbreviated 1TBS or OTBS [9] ) is like the K&R style, but functions are formatted like multi-statement blocks with the opening brace on the same line as the declaration, and braces are not omitted for a single-statement block. [10]

boolis_negative(intx){if(x<0){returntrue;}else{returnfalse;}}

Although not required by languages such as C/C++, using braces for single-statement blocks ensures that inserting a statement does not result in control flow that disagrees with indenting, as seen for example in Apple's infamous goto fail bug.

Cited advantages include shorter code (than K&R) since the starting brace needs no extra line, that the ending brace lines up with the statement it conceptually belongs to, and the perceived stylistic consistency of using the same brace style in both function bodies and multi-line statement blocks. [11]

Sources disagree as to the meaning of One True Brace Style. Some say that it is the variation specified here, [10] while others say it is "hacker jargon" for K&R. [12]

Linux kernel

The Linux kernel source tree is styled in a variant of K&R. [13] Linus Torvalds advises contributors to follow it. Attributes include:

intpower(intx,inty){intresult;if(y<0){result=0;}else{result=1;while(y-->0)result*=x;}returnresult;}

Java

A significant body of Java code uses a variant of the K&R style in which the opening brace is on the same line not only for the blocks inside a function, but also for class or method declarations. This style is widespread largely because Sun Microsystems's original style guides [15] [16] [17] used this K&R variant, and as a result, most of the standard source code for the Java API is written in this style. It is also a popular indentation style for ActionScript and JavaScript, along with the Allman style.

Stroustrup

Bjarne Stroustrup adapted the K&R style for C++ in his books, such as Programming: Principles and Practice using C++ and The C++ Programming Language . [18]

Unlike the variants above, Stroustrup does not use a "cuddled else". Thus, Stroustrup would write [18]

if(x<0){puts("Negative");negative(x);}else{puts("Non-negative");nonnegative(x);}

Stroustrup extends K&R style for classes, writing them as follows:

classVector{public:// construct a VectorVector(ints):elem(newdouble[s]),sz(s){}// element access: subscriptingdouble&operator[](inti){returnelem[i];}intsize(){returnsz;}private:// pointer to the elementsdouble*elem;// number of elementsintsz;};

Stroustrup does not indent the labels public: and private:. Also, in this style, while the opening brace of a function starts on a new line, the opening brace of a class is on the same line as the class name.

Stroustrup allows writing short functions all on one line. Stroustrup style is a named indentation style available in the editor Emacs. Stroustrup encourages a K&R-derived style layout with C++ as stated in his modern C++ Core Guidelines. [19]

BSD KNF

The Berkeley Software Distribution (BSD) operating systems uses a style that is sometimes termed kernel normal form (KNF). Although mostly intended for kernel code, it is also widely used in userland code. It is essentially a thoroughly documented variant of K&R style as used in the Bell Labs version 6 & 7 Unix source code. [20]

The SunOS kernel and userland uses a similar indentation style. [20] Like KNF, this also was based on AT&T style documents and is sometimes termed Bill Joy Normal Form. [21] The SunOS guideline was published in 1996; ANSI C is discussed briefly. The correctness of the indentation of a list of source files can be verified by the cstyle program written by Bill Shannon. [20] [21] [22]

In this style, the hard tabulator (ts in vi) is kept at eight columns, while a soft tabulator is often defined as a helper also (sw in vi), and set at four. The hard tabulators are used to indent code blocks, while a soft tabulator (four spaces) of additional indentation is used for all continuing lines that must be split over multiple lines.

Moreover, function calls do not use a space before the parenthesis, although C-language native statements such as if, while, do, switch and return do (in the case where return is used with parens). Functions that declare no local variables in their top-level block should also leave an empty line after their opening block brace.

Examples:

while(x==y){something();something_else();}final_thing();
if(data!=NULL&&res>0){if(JS_DefineProperty(cx,o,"data",STRING_TO_JSVAL(JS_NewStringCopyN(cx,data,res)),NULL,NULL,JSPROP_ENUMERATE)!=0){QUEUE_EXCEPTION("Internal error!");gotoerr;}PQfreemem(data);}else{if(JS_DefineProperty(cx,o,"data",OBJECT_TO_JSVAL(NULL),NULL,NULL,JSPROP_ENUMERATE)!=0){QUEUE_EXCEPTION("Internal error!");gotoerr;}}
staticJSBoolpgresult_constructor(JSContext*cx,JSObject*obj,uintNargc,jsval*argv,jsval*rval){QUEUE_EXCEPTION("PGresult class not user-instantiable");return(JS_FALSE);}

Allman

The Allman style is named after Eric Allman. It is also sometimes termed BSD style since Allman wrote many of the utilities for BSD Unix (although this should not be confused with the different "BSD KNF style"; see above).

This style puts the brace associated with a control statement on the next line, indented to the same level as the control statement. Statements within the braces are indented to the next level. [12]

while(x==y){something();something_else();}final_thing();

This style is similar to the standard indentation used by the Pascal languages and Transact-SQL, where the braces are equivalent to the keywords begin and end.

(* Example Allman code indentation style in Pascal *)proceduredosomething(x,y:Integer);beginwhilex=ydobeginsomething();something_else();end;end;

Consequences of this style are that the indented code is clearly set apart from the containing statement by lines that are almost all whitespace and the closing brace lines up in the same column as the opening brace. Some people feel this makes it easy to find matching braces. The blocking style also delineates the block of code from the associated control statement. Commenting out or removing a control statement or block of code, or code refactoring, are all less likely to introduce syntax errors via dangling or missing braces. Also, it is consistent with brace placement for the outer-function block.

For example, the following is still correct syntactically:

// while (x == y){something();something_else();}

As is this:

// for (int i=0; i < x; i++)// while (x == y)if(x==y){something();something_else();}

Even like this, with conditional compilation:

intc;#ifdef HAS_GETCHwhile((c=getch())!=EOF)#elsewhile((c=getchar())!=EOF)#endif{do_something(c);}

Variant: Allman-8

Allman-8 uses the 8-space indentation tabs and 80-column limit of the Linux Kernel variant of K&R. The style purportedly helps improve readability on projectors. Also, the indentation size and column restriction help create a visual cue for identifying excessive nesting of code blocks. These advantages combine to help provide newer developers and learners implicit guidance to manage code complexity.[ citation needed ]

Whitesmiths

The Whitesmiths style, also sometimes termed Wishart style, was originally used in the documentation for the first commercial C compiler, the Whitesmiths Compiler. It was also popular in the early days of Windows, since it was used in three influential Windows programming books, Programmer's Guide to Windows by Durant, Carlson & Yao, Programming Windows by Petzold, and Windows 3.0 Power Programming Techniques by Norton & Yao.

Whitesmiths, along with Allman, were claimed to have been the most common bracing styles in 1991 by the Jargon File, with roughly equal popularity at the time. [12] [23]

This style puts the brace associated with a control statement on the next line, indented. Statements within the braces are indented to the same level as the braces.

Like Ratliff style, the closing brace is indented the same as statements within the braces. [24]

while(x==y){something();something_else();}final_thing();

The advantages of this style are similar to those of the Allman style. Blocks are clearly set apart from control statements. The alignment of the braces with the block emphasizes that the full block is conceptually, and programmatically, one compound statement. Indenting the braces emphasizes that they are subordinate to the control statement. The ending brace no longer lines up with the statement, but instead with the opening brace.

An example:

if(data!=NULL&&res>0){if(!JS_DefineProperty(cx,o,"data",STRING_TO_JSVAL(JS_NewStringCopyN(cx,data,res)),NULL,NULL,JSPROP_ENUMERATE)){QUEUE_EXCEPTION("Internal error!");gotoerr;}PQfreemem(data);}elseif(!JS_DefineProperty(cx,o,"data",OBJECT_TO_JSVAL(NULL),NULL,NULL,JSPROP_ENUMERATE)){QUEUE_EXCEPTION("Internal error!");gotoerr;}

else if are treated as statement, much like the #elif preprocessor statement.

GNU

Like the Allman and Whitesmiths styles, GNU style puts braces on a line by themselves, indented by two spaces, except when opening a function definition, where they are not indented. [25] In either case, the contained code is indented by two spaces from the braces.

Popularised by Richard Stallman, the layout may be influenced by his background of writing Lisp code. [26] In Lisp, the equivalent to a block (a progn) is a first-class data entity, and giving it its own indentation level helps to emphasize that, whereas in C, a block is only syntax. This style can also be found in some ALGOL and XPL programming language textbooks from the 1960s and 1970s. [27] [28] [ discuss ]

Although not indentation per se, GNU coding style also includes a space after a function name before the left parenthesis of an argument list. [25]

staticchar*concat(char*s1,char*s2){while(x==y){something();something_else();}final_thing();}

This style combines the advantages of Allman and Whitesmiths, thereby removing the possible Whitesmiths disadvantage of braces not standing out from the block. One disadvantage is that the ending brace no longer lines up with the statement it conceptually belongs to. Another possible disadvantage is that it might waste space by using two visual levels of indents for one conceptual level, but in reality this is unlikely because, in systems with single-level indentation, each level is usually at least 4 spaces, same as 2 * 2 spaces in GNU style.

The GNU Coding Standards recommend this style, and nearly all maintainers of GNU project software use it.[ citation needed ]

The GNU Emacs text editor and the GNU systems' indent command will reformat code according to this style by default. [29] Those who do not use GNU Emacs, or similarly extensible/customisable editors, may find that the automatic indentation settings of their editor are unhelpful for this style. However, many editors defaulting to KNF style cope well with the GNU style when the tab width is set to two spaces; likewise, GNU Emacs adapts well to KNF style by simply setting the tab width to eight spaces. In both cases, automatic reformatting destroys the original spacing, but automatic line indenting will work properly.

Steve McConnell, in his book Code Complete, advises against using this style: he marks a code sample which uses it with a "Coding Horror" icon, symbolizing especially dangerous code, and states that it impedes readability. [24] The Linux kernel coding style documentation also recommends against this style, urging readers to burn a copy of the GNU coding standards as a "great symbolic gesture". [11]

Horstmann

The 1997 edition of Computing Concepts with C++ Essentials by Cay S. Horstmann adapts Allman by placing the first statement of a block on the same line as the opening brace. This style is also used in examples in Jensen and Wirth's Pascal User Manual and Report. [30]

while(x==y){something();something_else();//...if(x<0){printf("Negative");negative(x);}else{printf("Non-negative");nonnegative(x);}}final_thing();

This style combines the advantages of Allman by keeping the vertical alignment of the braces for readability, and identifying blocks easily, with the saving of a line of the K&R style. However, the 2003 edition now uses Allman style throughout. [31]

Pico

This is the style used most commonly in the language Pico by its designers. Pico lacks return statements, and uses semicolons as statement separators instead of terminators. It yields this syntax: [32]

stuff(n): { x: 3 * n;   y: do_stuff(x);   y + x } 

The advantages and disadvantages are similar to those of saving screen real estate with K&R style. An added advantage is that the starting and closing braces are consistent in application (both share space with a line of code), relative to K&R style, where one brace shares space with a line of code and one brace has a line alone.

Ratliff

In the book Programmers at Work, [33] C. Wayne Ratliff, the original programmer behind the popular dBase-II and -III fourth-generation programming languages, discussed a style that is like 1TBS but the closing brace lines up with the indentation of the nested block. He indicated that the style was originally documented in material from Digital Research Inc. This style has sometimes been termed banner style, [34] possibly for the resemblance to a banner hanging from a pole. In this style, which is to Whitesmiths as K&R is to Allman, the closing control is indented the same as the last item in the list (and thus properly loses salience) [24] The style can make visual scanning easier for some, since the headers of any block are the only thing exdented at that level (the theory being that the closing control of the prior block interferes with the visual flow of the next block header in the K&R and Allman styles). Kernighan and Plauger use this style in the Ratfor code in Software Tools. [35]

// In Cfor(i=0;i<10;i++){if(i%2==0){do_something(i);}else{do_something_else(i);}}

C derived language styles

The following styles are common for various languages derived from C that are both significantly similar and dissimilar. And, they can be adapted to C as well. They might be applied to C code written as part of a project mostly written in one of these other languages, where maintaining a consistent look and feel to the project's core code overrides considerations of using more conventional C style.

Lisp style

While GNU style is sometimes characterized as C code indented by a Lisp programmer, one might even go so far as to insert closing braces together in the last line of a block. This style makes indentation the only way to distinguish blocks of code, but has the advantage of containing no uninformative lines. This could easily be called the Lisp style because this style is very common in Lisp code. In Lisp, the grouping of identical braces at the end of expression trees is meant to signify that it is not the user's job to visually track nesting levels, only to understand the structure of the tree.

The traditional Lisp variant of this style prefers extremely narrow levels of indentation (typically two spaces) because Lisp code usually nests very deeply since Lisp features only expressions, with no distinct class of statements; function arguments are mostly indented to the same level to illustrate their shared status within the enclosing expression. This is also because, braces aside, Lisp is conventionally a very terse language, omitting even common forms of simple boilerplate code as uninformative, such as the else keyword in an if : then | else block, instead rendering it uniformly as (if expr1 expr2 expr3).

// Cfor(i=0;i<10;i++){if(i%2==0){do_something(i);}else{do_something_else(i);do_third_thing(i);}}

 

;; Lisp(dotimes(i10)(if(=(remi2)0)(do-somethingi)(progn(do-something-elsei)(do-third-thingi))))

Note: progn is a procedure for evaluating multiple sub-expressions sequentially for effects, while discarding all but the final (nth) return value. If all return values are desired, the values procedure would be used.

Haskell style

Haskell layout can make the placement of braces optional, although braces and semicolons are allowed in the language. [36] The two segments below are equally acceptable to the compiler:

braceless=dotext<-getContentsletfirstWord=head$wordstextbigWord=maptoUpperfirstWordputStrLnbigWordbraceful=do{text<-getContents;let{firstWord=head$wordstext;bigWord=maptoUpperfirstWord};putStrLnbigWord}

In Haskell, layout can replace braces. Usually the braces and semicolons are omitted for procedural do sections and the program text in general, but the style is commonly used for lists, records and other syntactic elements made up of some pair of parentheses or braces, which are separated with commas or semicolons. [37] If code following the keywords where, let, or of omits braces and semicolons, then indentation is significant. [38]

APL style

For an example of how terse APL typically is, here is the implementation of the step function for the Game of Life:

life{1.34=+/+¯101∘.¯101¨}

APL style C resembles the terse style of APL code, and is commonly used in their implementations. [39] This style was pioneered by Arthur Whitney, and is heavily used in the implementation of K, Arthur's own project. The J programming language is implemented in this style as well. Notably, not all implementations of APL use this style of C, namely: GNU APL and Dyalog APL.

In addition to APL style C indentation, typically the names are shortened to either single or double characters: To reduce the amount of indentation, and expressions spanning multiple lines. [40]

Indentation size

Typically, programmers use the same width of whitespace to indent each block of code with commonly used widths varying from 1 to 4 spaces.

An experiment performed on PASCAL code in 1983, found that indentation size significantly affected comprehensibility. Indentation sizes between 2 and 4 characters proved optimal. [41]

Although they both affect the general layout of code, indentation size is independent of the indentation style discussed here.

Tab vs. space

Typically, a programmer uses a text editor that provides tab stops at fixed intervals (a number of spaces), to assist in maintaining whitespace according to a style. The interval is called the tab width. Sometimes the programmer stores the code with tab characters one for each tab key press or they store a sequence of spaces equal in number to the tab width.

Storing tab characters in code can cause visual misalignment when viewed in different contexts, which counters the value of the indentation style.

Programmers lack consensus on storing tab characters. Proponents of storing tab characters cite ease of typing and smaller text files since a single tab character serves the purpose of multiple spaces. Opponents, such as Jamie Zawinski, state that using spaces instead increases cross-platform portability. [42] Others, such as the writers of the WordPress coding standards, state the opposite: that hard tabs increase portability. [43] A survey of the top 400,000 repositories on GitHub found that spaces are more common. [44]

Many text editors, including Notepad++, TextEdit, Emacs, vi, and nano, can be configured to either store tab characters when entered via the tab key or to convert them to spaces (based on the configured tab width) so that tab characters are not added to the file when the tab key is pressed. Some editors can convert tab to space characters and vice versa.

Some text file pagers, such as less, can be configured for a tab width. Some tools such as expand/unexpand can convert on the fly via filters.

Style automation

A tool can automate formatting code per an indentation style, for example the Unix indent command.

Emacs provides commands to modify indentation, including hitting Tab on a given line. M-x indent-region indents code.

Elastic tabstops is a tabulation style which requires support from the text editor, where entire blocks of text are kept automatically aligned when the length of one line in the block changes.

Losing track of blocks

In more complicated code, the programmer may lose track of block boundaries while reading the code. This is often experienced in large sections of code containing many compound statements nested to many levels of indentation. As the programmer scrolls to the bottom of a huge set of nested statements, they may lose track of context such as the control structure at the top of the block.

Long compound statements can be a code smell of over complexity which can be solved by refactoring.

Programmers who rely on counting the opening braces may have difficulty with indentation styles such as K&R, where the starting brace is not visually separated from its control statement. Programmers who rely more on indentations will gain more from styles that are vertically compact, such as K&R, because the blocks are shorter.

To avoid losing track of control statements such as for , a large indentation can be used, such as an 8-unit-wide hard tab, along with breaking up large functions into smaller and more readable functions. Linux is done this way, while using the K&R style.

Some text editors allow the programmer to jump between the two corresponding braces of a block. For example, vi jumps to the brace enclosing the same block as the one under the cursor when pressing the % key. Since the text cursor's next key (viz., the n key) retained directional positioning information (whether the up or down key was formerly pressed), the dot macro (the . key) could then be used to place the text cursor on the next brace, [45] given a suitable coding style. Instead, inspecting the block boundaries using the % key can be used to enforce a coding standard.

Another way to maintain block awareness, is to use comments after the closing brace. For example:

for(inti=0;i<total;i++){foo();}//for (i)
if(x<0){bar();}//if (x < 0)

A disadvantage is maintaining the same code in multiple locations above and below the block.

Some editors provide support for maintaining block awareness. A folding editor can hide (fold) and reveal (unfold) blocks by indentation level. Some editors highlight matching braces when the cursor is positioned next to one.

Statement insertion

The K&R style prevents the common error caused by inserting a line of code after a control statement before the open brace. The inserted line causes the block to become disassociated from the control statement.

Given this starting code:

for(inti=0;i<10;i++){do_something();}//for (i)

do_something will be called 10 times. Then, it is modified by adding a new second line:

for(inti=0;i<10;i++)do_something_else();{do_something();// called once!}//for (i)

The original block (lines 3-5) is no longer the body of thefor loop and executes only once. Further, the comment on line 5 becomes wrong.

K&R style avoids this problem by keeping the control statement and the opening brace on the same line.

Original:

for(inti=0;i<10;i++){do_something();}//for (i)

Adding a new second line does not affect how many times do_something is called or the validity of the end comment.

for(inti=0;i<10;i++){do_something_else();do_something();}//for (i)

See also

Related Research Articles

Lexical tokenization is conversion of a text into meaningful lexical tokens belonging to categories defined by a "lexer" program. In case of a natural language, those categories include nouns, verbs, adjectives, punctuations etc. In case of a programming language, the categories include identifiers, operators, grouping symbols and data types. Lexical tokenization is related to the type of tokenization used in large language models (LLMs) but with two differences. First, lexical tokenization is usually based on a lexical grammar, whereas LLM tokenizers are usually probability-based. Second, LLM tokenizers perform a second step that converts the tokens into numerical values.

Programming style, also known as coding style, refers to the conventions and patterns used in writing source code, resulting in a consistent and readable codebase. These conventions often encompass aspects such as indentation, naming conventions, capitalization, and comments. Consistent programming style is generally considered beneficial for code readability and maintainability, particularly in collaborative environments.

Pretty-printing is the application of any of various stylistic formatting conventions to text files, such as source code, markup, and similar kinds of content. These formatting conventions may entail adhering to an indentation style, using different color and typeface to highlight syntactic elements of source code, or adjusting size, to make the content easier for people to read, and understand. Pretty-printers for source code are sometimes called code formatters or beautifiers.

In computer programming, a block or code block or block of code is a lexical structure of source code which is grouped together. Blocks consist of one or more declarations and statements. A programming language that permits the creation of blocks, including blocks nested within other blocks, is called a block-structured programming language. Blocks are fundamental to structured programming, where control structures are formed from blocks.

<span class="mw-page-title-main">C syntax</span> Set of rules defining correctly structured programs

The syntax of the C programming language is the set of rules governing writing of software in C. It is designed to allow for programs that are extremely terse, have a close relationship with the resulting object code, and yet provide relatively high-level data abstraction. C was the first widely successful high-level language for portable operating-system development.

<span class="mw-page-title-main">Conditional (computer programming)</span> Control flow statement that executes code according to some condition(s)

In computer science, conditionals are programming language constructs that perform different computations or actions or return different values depending on the value of a Boolean expression, called a condition.

<span class="mw-page-title-main">Code folding</span> Tool of editors for programming, scripting and markup

Code or text folding, or less commonly holophrasting, is a feature of some graphical user interfaces that allows the user to selectively hide ("fold") or display ("unfold") parts of a document. This allows the user to manage large amounts of text while viewing only those subsections that are currently of interest. It is typically used with documents which have a natural tree structure consisting of nested elements. Other names for these features include expand and collapse, code hiding, and outlining. In Microsoft Word, the feature is called "collapsible outlining".

The off-side rule describes syntax of a computer programming language that defines the bounds of a code block via indentation.

In the written form of many languages, indentation describes empty space, a.k.a. white space, used around text to signify an important aspect of the text such as:

In computer programming, the ternary conditional operator is a ternary operator that is part of the syntax for basic conditional expressions in several programming languages. It is commonly referred to as the conditional operator, conditional expression, ternary if, or inline if. An expression if a then b else c or a ? b : c evaluates to b if the value of a is true, and otherwise to c. One can read it aloud as "if a then b otherwise c". The form a ? b : c is the most common, but alternative syntax do exist; for example, Raku uses the syntax a ?? b !! c to avoid confusion with the infix operators ? and !, whereas in Visual Basic .NET, it instead takes the form If(a, b, c).

indent is a Unix utility that reformats C and C++ code in a user-defined indentation style and coding style. Support for C++ code is minimal.

In computer programming, a nested function is a named function that is defined within another, enclosing, block and is lexically scoped within the enclosing block – meaning it is only callable by name within the body of the enclosing block and can use identifiers declared in outer blocks, including outer functions. The enclosing block is typically, but not always, another function.

In computing, a here document is a file literal or input stream literal: it is a section of a source code file that is treated as if it were a separate file. The term is also used for a form of multiline string literals that use similar syntax, preserving line breaks and other whitespace in the text.

The GNU coding standards are a set of rules and guidelines for writing programs that work consistently within the GNU system. The GNU Coding Standards were written by Richard Stallman and other GNU Project volunteers. The standards document is part of the GNU Project and is available from the GNU website. Though it focuses on writing free software for GNU in C, much of it can be applied more generally. In particular, the GNU Project encourages its contributors to always try to follow the standards—whether or not their programs are implemented in C.

<span class="mw-page-title-main">Python syntax and semantics</span> Set of rules defining correctly structured programs

The syntax of the Python programming language is the set of rules that defines how a Python program will be written and interpreted. The Python language has many similarities to Perl, C, and Java. However, there are some definite differences between the languages. It supports multiple programming paradigms, including structured, object-oriented programming, and functional programming, and boasts a dynamic type system and automatic memory management.

This comparison of programming languages compares the features of language syntax (format) for over 50 computer programming languages.

Coding conventions are a set of guidelines for a specific programming language that recommend programming style, practices, and methods for each aspect of a program written in that language. These conventions usually cover file organization, indentation, comments, declarations, statements, white space, naming conventions, programming practices, programming principles, programming rules of thumb, architectural best practices, etc. These are guidelines for software structural quality. Software programmers are highly recommended to follow these guidelines to help improve the readability of their source code and make software maintenance easier. Coding conventions are only applicable to the human maintainers and peer reviewers of a software project. Conventions may be formalized in a documented set of rules that an entire team or company follows, or may be as informal as the habitual coding practices of an individual. Coding conventions are not enforced by compilers.

<span class="mw-page-title-main">Comment (computer programming)</span> Explanatory note in the source code of a computer program

In computer programming, a comment is a human-readable explanation or annotation in the source code of a computer program. They are added with the purpose of making the source code easier for humans to understand, and are generally ignored by compilers and interpreters. The syntax of comments in various programming languages varies considerably.

This article compares a large number of programming languages by tabulating their data types, their expression, statement, and declaration syntax, and some common operating-system interfaces.

<span class="mw-page-title-main">Goto</span> One-way control statement in computer programming

Goto is a statement found in many computer programming languages. It performs a one-way transfer of control to another line of code; in contrast a function call normally returns control. The jumped-to locations are usually identified using labels, though some languages use line numbers. At the machine code level, a goto is a form of branch or jump statement, in some cases combined with a stack adjustment. Many languages support the goto statement, and many do not.

References

  1. Weissman, Laurence Mark (1974). A Methodology For Studying The Psychological Complexity of Computer Programs. CSRG-37 (Technical report). Computer Systems Research Group, University of Toronto. OCLC   1085612768. technicalreportc37univ via Internet Archive.
  2. Morzeck, Johannes; Hanenberg, Stefan; Werger, Ole; Gruhn, Volker (2023). Indentation in Source Code: A Randomized Control Trial on the Readability of Control Flows in Java Code with Large Effects. Proceedings of the 18th International Conference on Software Technologies - ICSOFT. Rome, Italy. pp. 117–128. doi: 10.5220/0012087500003538 . ISBN   978-989-758-665-1 via Stefan Hanenberg on Google Drive (preprint).
  3. Hanenberg, Stefan; Morzeck, Johannes; Gruhn, Volker (9 August 2024). "Indentation and reading time: a randomized control trial on the differences between generated indented and non-indented if-statements". Empirical Software Engineering. 29 (5): 134. doi: 10.1007/s10664-024-10531-y . ISSN   1573-7616.
  4. Hanenberg, Stefan; Morzeck, Johannes; Werger, Ole; Gries, Stefan; Gruhn, Volker (2024). "Indentation and Reading Time: A Controlled Experiment on the Differences Between Generated Indented and Non-indented JSON Objects". In Fill, Hans-Georg; Domínguez Mayo, Francisco José; van Sinderen, Marten; Maciaszek, Leszek A. (eds.). Software Technologies. Communications in Computer and Information Science. Vol. 2104. Cham: Springer Nature Switzerland. pp. 50–75. doi:10.1007/978-3-031-61753-9_4. ISBN   978-3-031-61753-9.
  5. "Java Style Guide". Archived from the original on 12 July 2018. Using either "Egyptian" curly braces or C-style curly braces is acceptable
  6. "Egyptian brackets". Foldoc . A humourous[ sic ] term for K&R indent style, referring to the "one hand up in front, one down behind" pose
  7. "Google JavaScript Style Guide". Braces follow the Kernighan and Ritchie style ("Egyptian brackets") for nonempty blocks and block-like constructs
  8. Darwin, Ian F. (1988). Checking C programs with Lint. California: O'Reilly and Assosciates. p. 51. ISBN   9780937175309.
  9. "1TBS".
  10. 1 2 "Brace styles and JavaScript". 7 January 2013. Retrieved 8 November 2018.
  11. 1 2 "Linux kernel coding style" . Retrieved 1 January 2017.
  12. 1 2 3 "The Jargon File". 4.4.7. 29 December 2003. Retrieved 18 August 2014.
  13. A detailed description of the style is given at kernel.org.
  14. Larabel, Michael. "The Linux Kernel Deprecates The 80 Character Line Coding Style". Phoronix. Phoronix Media. Retrieved 1 May 2022.
  15. Reddy, Achut (30 March 2000). "Java Coding Style Guide" (PDF). Sun Microsystems. Archived from the original (PDF) on 28 February 2006. Retrieved 30 May 2008.
  16. "Java Code Conventions" (PDF). Sun Microsystems. 12 September 1997. Archived from the original (PDF) on 13 May 2008. Retrieved 30 May 2008.
  17. "Code Conventions for the Java Programming Language". Sun Microsystems. 20 March 1997. Retrieved 30 May 2008.
  18. 1 2 Stroustrup, Bjarne (September 2010). "PPP Style Guide" (PDF).
  19. Stroustrup, Bjarne. "C++ Core Guidelines". GitHub. Retrieved 3 November 2018.
  20. 1 2 3 Shannon, Bill (19 August 1996). "C Style and Coding Standards for SunOS" (PDF). 1.8. Sun Microsystems, Inc. Retrieved 15 June 2019.
  21. 1 2 Gregg, Brendan. "DTraceToolkit Style Guide" . Retrieved 6 February 2015.
  22. Shannon, Bill (9 September 1998). "cstyle.pl". illumos-gate. 1.58. Sun Microsystems, Inc. Retrieved 6 February 2015.
  23. "The Jargon File (Version 2.4.3)". 2.4.3. 23 January 1991. Retrieved 14 May 2024.
  24. 1 2 3 McConnell, Steve (2004). Code Complete: A practical handbook of software construction . Redmond, WA: Microsoft Press. pp.  746–747. ISBN   978-0-7356-1967-8.
  25. 1 2 "Formatting Your Source Code". GNU Coding Standards . Retrieved 6 June 2016.
  26. Stallman, Richard (28 October 2002). "My Lisp Experiences and the Development of GNU Emacs (Transcript of speech at the International Lisp Conference)" . Retrieved 6 June 2016.
  27. Baumann, Richard [in German]; Feliciano, Manuel; Bauer, Friedrich Ludwig; Samelson, Klaus (1964). Introduction to ALGOL – A primer for the non-specialist, emphasizing the practical uses of the algorithmic language. Series in Automatic Computation. Englewood Cliffs, New Jersey, USA: Prentice-Hall, Inc. ISBN   0-13-477828-6. LCCN   64-10740. ark:/13960/t6qz35p37. Retrieved 23 October 2022.
  28. W. M. McKeeman, J. J. Horning, and D. B. Wortman, A Compiler Generator, 1970, https://archive.org/details/compilergenerato00mcke
  29. Tested on the sample source code above on Ubuntu 18.04 with GNU indent 2.2.11 and GNU Emacs 25.2.2 started with emacs --no-init-file.
  30. Jensen, Kathleen; Wirth, Niklaus (1974). PASCAL User Manual and Report. Springer-Verlag.
  31. Horstmann Style Guide
  32. Ohno, Asako (2013). "A methodology to teach exemplary coding style considering students' coding style feature contains fluctuations". 2013 IEEE Frontiers in Education Conference (FIE). pp. 1908–1910. doi:10.1109/fie.2013.6685167. ISBN   9781467352611. S2CID   28385526.
  33. Lammers, Susan (1986). Programmers at Work . Microsoft Press. ISBN   978-0-914845-71-3.
  34. Pattee, Jim. "Artistic Style 2.05 Documentation". Artistic Style. Retrieved 24 April 2015.
  35. Kernighan, Brian W.; Plauger, P. J. (1976). Software Tools . Addison-Wesley. ISBN   9780201036695.
  36. "The Haskell 98 Report". haskell.org. Retrieved 3 March 2016.
  37. Lipovača, Miran. "Making Our Own Types and Typeclasses". learnyouahaskell.com. Retrieved 3 February 2016.
  38. Haskell Report 1.2 (1992), p.131 B.4 "Layout"
  39. "The J Incunabulum". jsoftware.com. Retrieved 19 May 2022.
  40. "The J source code". github.com. Retrieved 12 September 2024.
  41. Miara, Richard J.; Musselman, Joyce A.; Navarro, Juan A. & Shneiderman, Ben (November 1983). "Program Indentation and Comprehensibility" (PDF). Communications of the ACM. 26 (11): 861–867. doi:10.1145/182.358437. S2CID   11767796 . Retrieved 3 August 2017.
  42. Zawinski, Jamie (2000). "Tabs versus Spaces: An Eternal Holy War" . Retrieved 6 June 2016.
  43. "WordPress Coding Standards" . Retrieved 6 June 2016.
  44. Hoffa, Felipe (26 July 2017). "400,000 GitHub repositories, 1 billion files, 14 terabytes of code: Spaces or Tabs?". Medium. Retrieved 9 July 2019.
  45. Lamb, Linda (1998). Learning the vi editor . O'Reilly. ISBN   9781565924260.

Tabs and spaces