Capitalization (American spelling; also British spelling in Oxford) or capitalisation (Commonwealth English; all other meanings) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in lower case, in writing systems with a case distinction. The term also may refer to the choice of the casing applied to text.
Conventional writing systems (orthographies) for different languages have different conventions for capitalization, for example, the capitalization of titles. Conventions also vary, to a lesser extent, between different style guides. In addition to the Latin script, capitalization also affects the Armenian, Cyrillic, Georgian and Greek alphabets.
The full rules of capitalization in English are complicated. The rules have also changed over time, generally to capitalize fewer words. The conventions used in an 18th-century document will be unfamiliar to a modern reader; for instance, many common nouns were capitalized.
The systematic use of capitalized and uncapitalized words in running text is called "mixed case".
Owing to the essentially arbitrary nature of orthographic classification and the existence of variant authorities and local house styles, questionable capitalization of words is not uncommon, even in respected newspapers and magazines. Most publishers require consistency, at least within the same document, in applying a specified standard: this is described as "house style".
Many languages distinguish between formal and informal 2nd-person pronouns.
This section needs additional citations for verification .(October 2015) |
The capitalization of geographic terms in English text generally depends on whether the author perceives the term as a proper noun, in which case it is capitalized, or as a combination of an established proper noun with a normal adjective or noun, in which case the latter are not capitalized. There are no universally agreed lists of English geographic terms which are considered as proper nouns. The following are examples of rules that some[ which? ] British and U.S. publishers have established in style guides for their authors:
Upper case: East Asia, Southeast Asia, Central Asia, Central America, North Korea, South Africa, the European Union, the Republic of Poland, the North Atlantic, the Middle East, the Arctic, The Gambia, The Bahamas, The Hague
Lower case: western China, southern Beijing, western Mongolia, eastern Africa, northern North Korea, the central Gobi, the lower Yangtze River.
Abbreviated
When a term is used as a name and then subsequently a shorter term is used, then that shorter term may be used generically. If that is the case do not capitalize. ("The Tatra National Park is a tourist destination in Poland. Watch out for bears when visiting the national park.") [14] [15]
The following names are given to systems of capitalization:
"The quick brown fox jumps over the lazy dog."
The standard case used in English prose. Generally equivalent to the baseline universal standard of formal English orthography mentioned above; that is, only the first word is capitalized, except for proper nouns and other words which are generally capitalized by a more specific rule.
A variation is mid-sentence case which is identical to sentence case except that the first word is not capitalized (unless it would be capitalized by another rule). This type of letter case is used for entries in dictionaries.
"The Quick Brown Fox Jumps Over the Lazy Dog."
Also known as headline case and capital case. All words capitalized, except for certain subsets defined by rules that are not universally standardized, often minor words such as "the" (as above), "of", or "and". Other commonly lowercase words are prepositions and coordinating conjunctions. [16] The standardization is only at the level of house styles and individual style manuals. (See Headings and publication titles.) A simplified variant is start case, where all words, including articles, prepositions, and conjunctions, start with a capital letter.
"THE QUICK BROWN FOX JUMPS OVER THE LAZY DOG."
Also written as all-caps. Capital letters only. This style can be used for headlines and book or chapter titles at the top of a book page. It is commonly used in transcribed speech to indicate that a person is shouting, or to indicate a hectoring and obnoxious speaker. [17] [18] For this reason, it is generally discouraged. Long spans of Latin-alphabet text in all uppercase are harder to read because of the absence of the ascenders and descenders found in lowercase letters, which can aid recognition. [19] [20] In professional documents, a commonly preferred alternative to all caps text is the use of small caps to emphasize key names or acronyms, or the use of italics or (more rarely) bold. [21] In addition, if all caps must be used, it is customary in headings of a few words to slightly widen the spacing between the letters, by around 10% of the point height. This practice is known as tracking or letterspacing. [22]
Compound names are nouns that are made up of more than one stem, or a stem and one or more affixes. [lower-alpha 1] Names that are made up of several affixes and one or more nouns are not compound names under this definition, but noun phrases, that are made up of one or more separable affixes, and one or more nouns. Examples of the separable affixes may be found in List of family name affixes. [lower-alpha 2] Noun phrases are in this context treated as if they were nouns. So the general rule that nouns-as-names are capitalized in principle applies to compound names and noun-phrases-as-names as well. There are, however, exceptions to this rule that differ by language community.
The Chicago Manual of Style recommends that the titles of English-language artistic works (plays, novels, essays, paintings, etc.) capitalize the first word and the last word in the title. [28] Additionally, most other words within a title are capitalized as well; articles and coordinating conjunctions are not capitalized. [28] Sources disagree on the details of capitalizing prepositions. [28] For example, the Chicago Manual of Style recommends rendering all prepositions in lowercase, [29] whereas the APA style guide instructs: Capitalize major words in titles of books and articles within the body of the paper. Conjunctions, articles, and short prepositions are not considered major words; however, capitalize all words of four letters or more. [30]
In other languages, such as the Romance languages, only the first word and proper names are capitalized.
Acronyms are usually capitalized, with a few exceptions:
In most languages that use diacritics, these are treated the same way in uppercase whether the text is capitalized or all-uppercase. They may be always preserved (as in German) or always omitted (as in Greek) or often omitted (as in French). [31] Some attribute this to the fact that diacritics on capital letters were not available earlier on typewriters, and it is now becoming more common to preserve them in French and Spanish (in both languages the rule is to preserve them, [32] although in France and Mexico, for instance, schoolchildren are often erroneously taught that they should not add diacritics on capital letters).
However, in the polytonic orthography used for Greek prior to 1982, accents were omitted in all-uppercase words, but kept as part of an uppercase initial (written before rather than above the letter). The latter situation is provided for by title-case characters in Unicode. When Greek is written with the present day monotonic orthography, where only the acute accent is used, the same rule is applied. The accent is omitted in all-uppercase words but it is kept as part of an uppercase initial (written before the letter rather than above it). The dialytika (diaeresis) should also always be used in all-uppercase words (even in cases where they are not needed when writing in lowercase, e.g. ΑΫΛΟΣ — άυλος).
Some languages treat certain digraphs as single letters for the purpose of collation. In general, where one such is formed as a ligature, the corresponding uppercase form is used in capitalization; where it is written as two separate characters, only the first will be capitalized. Thus Oedipus or Œdipus are both correct, but OEdipus is not. Examples with ligature include Ærøskøbing in Danish, where Æ/æ is a completely separate letter rather than merely a typographic ligature (the same applies in Icelandic); examples with separate characters are Llanelli in Welsh, where Ll is a single letter; and Ffrangeg in Welsh where Ff is equivalent to English F (whereas Welsh F corresponds to English V). [33] Presentation forms, however, can use doubled capitals, such as the logo of the National Library of Wales (Llyfrgell Genedlaethol Cymru). The position in Hungarian is similar to the latter.
In languages where inflected forms of a word may have extra letters at the start, the capitalized letter may be the initial of the root form rather than the inflected form. For example, in Irish, in the placename Sliabh na mBan, "(the) mountain of the women" (anglicized as Slievenamon), the word-form written mBan contains the genitive plural of the noun bean, "woman", mutated after the genitive plural definite article (i.e., "of the"). The written B is mute in this form.
Other languages may capitalize the initial letter of the orthographic word, even if it is not present in the base, as with definite nouns in Maltese that start with certain consonant clusters. For example, l-Istati Uniti (the United States) capitalize the epenthetic I, even though the base form of the word — without the definite article — is stati.
In English, there are a few capitonyms, which are words whose meaning (and sometimes pronunciation) varies with capitalization. For example, the month August versus the adjective august. Or the verb polish versus the adjective Polish.
Camel case is the practice of writing phrases without spaces or punctuation and with capitalized words. The format indicates the first word starting with either case, then the following words having an initial uppercase letter. Common examples include YouTube, PowerPoint, HarperCollins, FedEx, iPhone, eBay, and LaGuardia. Camel case is often used as a naming convention in computer programming. It is also sometimes used in online usernames such as JohnSmith, and to make multi-word domain names more legible, for example in promoting EasyWidgetCompany.com. In fact, WikiWikiWeb, an ancestor of Wikipedia, is written in camel case.
The apostrophe is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for three basic purposes:
A proper noun is a noun that identifies a single entity and is used to refer to that entity as distinguished from a common noun, which is a noun that refers to a class of entities and may be used when referring to instances of a specific class. Some proper nouns occur in plural form, and then they refer to groups of entities considered as unique. Proper nouns can also occur in secondary applications, for example modifying nouns, or in the role of common nouns. The detailed definition of the term is problematic and, to an extent, governed by convention.
German orthography is the orthography used in writing the German language, which is largely phonemic. However, it shows many instances of spellings that are historic or analogous to other spellings rather than phonemic. The pronunciation of almost every word can be derived from its spelling once the spelling rules are known, but the opposite is not generally the case.
Letter case is the distinction between the letters that are in larger uppercase or capitals and smaller lowercase in the written representation of certain languages. The writing systems that distinguish between the upper- and lowercase have two parallel sets of letters: each in the majuscule set has a counterpart in the minuscule set. Some counterpart letters have the same shape, and differ only in size, but for others the shapes are different. The two case variants are alternative representations of the same letter: they have the same name and pronunciation and are typically treated identically when sorting in alphabetical order.
Tagalog grammar are the rules that describe the structure of expressions in the Tagalog language, one of the languages in the Philippines.
Title case or headline case is a style of capitalization used for rendering the titles of published works or works of art in English. When using title case, all words are capitalized, except for minor words that are not the first or last word of the title. There are different rules for which words are major, hence capitalized. As an example, a headline might be written like this: "The Quick Brown Fox Jumps over the Lazy Dog".
The iota subscript is a diacritic mark in the Greek alphabet shaped like a small vertical stroke or miniature iota ⟨ι⟩ placed below the letter. It can occur with the vowel letters eta ⟨η⟩, omega ⟨ω⟩, and alpha ⟨α⟩. It represents the former presence of an offglide after the vowel, forming a so‐called "long diphthong". Such diphthongs —phonologically distinct from the corresponding normal or "short" diphthongs —were a feature of ancient Greek in the pre-classical and classical eras.
Spanish orthography is the orthography used in the Spanish language. The alphabet uses the Latin script. The spelling is fairly phonemic, especially in comparison to more opaque orthographies like English, having a relatively consistent mapping of graphemes to phonemes; in other words, the pronunciation of a given Spanish-language word can largely be predicted from its spelling and to a slightly lesser extent vice versa. Spanish punctuation uniquely includes the use of inverted question and exclamation marks: ⟨¿⟩⟨¡⟩.
Orthographic conventions have varied over time, and vary by publishers, authors, and regional preferences, on whether and when Internet should be capitalized. When the Internet first came into common use, most publications treated Internet as a capitalized proper noun, but this has become less common. This reflects the tendency in English to capitalize new terms and move them to lowercase as they become familiar. The word is sometimes still capitalized to distinguish the global IP-based internet from internets that are smaller or not IP-based, though many publications, including the AP Stylebook since 2016, recommend the lowercase form in every case. In 2016, the Oxford English Dictionary found that, based on a study of around 2.5 billion printed and online sources, "Internet" was capitalized in 54% of cases, with Internet being preferred in the United States and internet being preferred in the United Kingdom.
The nouns of the German language have several properties, some unique. As in many related Indo-European languages, German nouns possess a grammatical gender; the three genders are masculine, feminine, and neuter. Words for objects without obvious masculine or feminine characteristics like 'bridge' or 'rock' can be masculine or feminine. German nouns are declined depending on their grammatical case and whether they are singular or plural. German has four cases: nominative, accusative, dative and genitive.
Czech orthography is a system of rules for proper formal writing (orthography) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech theologian and church reformist Jan Hus, the namesake of the Hussite movement, in one of his seminal works, De orthographia bohemica.
In English orthography, the term proper adjective is used to mean adjectives that take initial capital letters, and common adjective to mean those that do not. For example, a person from India is Indian—Indian is a proper adjective.
In the Latvian language, nouns, adjectives, pronouns and numerals are inflected in six declensions. There are seven cases:
The morphology of the Welsh language shows many characteristics perhaps unfamiliar to speakers of English or continental European languages like French or German, but has much in common with the other modern Insular Celtic languages: Irish, Scottish Gaelic, Manx, Cornish, and Breton. Welsh is a moderately inflected language. Verbs conjugate for person, tense and mood with affirmative, interrogative and negative conjugations of some verbs. A majority of prepositions inflect for person and number. There are few case inflections in Literary Welsh, being confined to certain pronouns.
Truecasing, also called capitalization recovery, capitalization correction, or case restoration, is the problem in natural language processing (NLP) of determining the proper capitalization of words where such information is unavailable. This commonly comes up due to the standard practice of automatically capitalizing the first word of a sentence. It can also arise in badly cased or noncased text.
Capitalization or capitalisation in English grammar is the use of a capital letter at the start of a word. English usage varies from capitalization in other languages.
Noon is a Cangin language of Senegal spoken in the Thiès region. There is an estimated population of 10,000- 50,000 speakers worldwide, rendering this language to be vulnerable. Ethnologue reports that it is 84% cognate with Lehar, essentially a divergent dialect, and 68% cognate with the other Cangin languages.
Malay grammar is the body of rules that describe the structure of expressions in the Malay language and Indonesian. This includes the structure of words, phrases, clauses and sentences. In Malay and Indonesian, there are four basic parts of speech: nouns, verbs, adjectives, and grammatical function words (particles). Nouns and verbs may be basic roots, but frequently they are derived from other words by means of prefixes and suffixes.
Reverential capitalization is the practice of capitalizing religious words that refer to deities or divine beings in cases where the words would not otherwise have been capitalized. Pronouns are also particularly included in reverential capitalization:
and God calleth to the light 'Day,' and to the darkness He hath called 'Night;' and there is an evening, and there is a morning — day one.
Jason Santa Maria, creative director of Happy Cog Studios, which designs Web sites, detected a basic breach of netiquette. "Hillary's text is all caps, like shouting," he said.
Capitalization Rules