William Whitaker's Words

Last updated

William Whitaker's Words is a computer program that parses the inflection or conjugation of a given Latin word entered by the user, and also translates the root into English. Conversely, given a basic English word, the program can output a Latin translation, generally with several possible Latin alternatives, although the database of translatable English words is not comprehensive. The software, written in Ada, is free for download but can be used online through several different hosts as well.

Contents

This program has gained popularity among Latinists because of its simple interface, high coverage of the Latin lexicon and mostly accurate results[ citation needed ]. Nevertheless, the user has to check the results, since WORDS uses a set of rules based on natural prefixation, suffixation, declension and conjugation to determine the possibility of an entry. As a consequence of this approach of analysing the structure of words, there is no guarantee that these words were ever used in Latin literature or speech, even if the program finds a possible meaning to a given word.

A few years after the original author's death, the software became the subject of digital preservation efforts.

Coverage

The dictionary consists of about 39,000 entries, which would result in hundreds of thousands of variations, counting declensions and conjugations. [1]

Additionally, the dictionary contains prefixes and suffixes. [2]

In comparison, the Oxford Latin Dictionary, considered to be the most complete Latin lexicon published in the English language, has about 34,000 entries, excluding proper names. [3] The Oxford Latin Dictionary has fewer entries because it only contains entries from Classical Latin, whereas WORDS contains words from many time periods.

Parsing process

For instance, given the Latin verb form amābantur, WORDS analyzes it as:

amābantur = am + (ā + ba + nt + ur), where

So amābantur is the passive, 3rd person, plural, imperfect, indicative form of the verb "to love", which would be translated "they were being loved".

About William Whitaker

William A. Whitaker (1936–2010) was a colonel in the United States Air Force. [4] While at the Defense Advanced Research Projects Agency (DARPA), he chaired the High Order Language Working Group that recommended development of the computer language Ada, in which WORDS is written. An accomplished Latinist, he created the translation software called WORDS after his retirement from the forces.

See also

Notes

  1. https://mk270.github.io/whitakers-words/index.html Words homepage
  2. "Whitaker's Words: Overview".
  3. ND.edu
  4. Death Notice

Related Research Articles

In linguistics, declension is the changing of the form of a word, generally to express its syntactic function in the sentence, by way of some inflection. Declensions may apply to nouns, pronouns, adjectives, adverbs, and determiners to indicate number, case, gender, and a number of other grammatical categories. Meanwhile, the inflectional change of verbs is called conjugation.

In linguistics and grammar, conjugation has two basic meanings. One meaning is the creation of derived forms of a verb from basic forms, or principal parts.

In grammar, a frequentative form of a word is one that indicates repeated action but is not to be confused with iterative aspect. The frequentative form can be considered a separate but not completely independent word called a frequentative. The frequentative is no longer productive in English, unlike in some language groups, such as Finno-Ugric, Balto-Slavic, and Turkic.

The subjunctive is a grammatical mood, a feature of an utterance that indicates the speaker's attitude toward it. Subjunctive forms of verbs are typically used to express various states of unreality such as wish, emotion, possibility, judgment, opinion, obligation, or action that has not yet occurred; the precise situations in which they are used vary from language to language. The subjunctive is one of the irrealis moods, which refer to what is not necessarily real. It is often contrasted with the indicative, a realis mood which principally indicates that something is a statement of fact.

Russian grammar employs an Indo-European inflexional structure, with considerable adaptation.

Verbs in the Finnish language can be divided into six main groups depending on the stem type, both for formal analysis and for teaching the language to non-native speakers. All six types have the same set of personal endings, but the stems assume different suffixes and undergo (slightly) different changes when inflected.

Lithuanian grammar retains many archaic features from Proto-Balto-Slavic that have been lost in other Balto-Slavic languages.

Georgian grammar has many distinctive and extremely complex features, such as split ergativity and a polypersonal verb agreement system.

Argobba is an Ethiopian Semitic language spoken in several districts of Afar, Amhara, and Oromia regions of Ethiopia by the Argobba people. It belongs to the South Ethiopic languages subgroup, and is closely related to Amharic.

This article describes the grammar of the standard Tajik language as spoken and written in Tajikistan. In general, the grammar of the Tajik language fits the analytical type. Little remains of the case system, and grammatical relationships are primarily expressed via clitics, word order and other analytical constructions. Like other modern varieties of Persian, Tajik grammar is almost identical to the classic Persian grammar, although there are differences in some verb tenses.

Ukrainian grammar is complex and characterised by a high degree of inflection; moreover, it has a relatively free word order, although the dominant arrangement is subject–verb–object (SVO). Ukrainian grammar describes its phonological, morphological, and syntactic rules. Ukrainian has seven grammatical cases and two numbers for its nominal declension and two aspects, three tenses, three moods, and two voices for its verbal conjugation. Adjectives agree in number, gender, and case with their nouns.

Sanskrit has inherited from its parent, the Proto-Indo-European language, an elaborate system of verbal morphology, much of which has been preserved in Sanskrit as a whole, unlike in other kindred languages, such as Ancient Greek or Latin. Sanskrit verbs thus have an inflection system for different combinations of tense, aspect, mood, voice, number, and person. Non-finite forms such as participles are also extensively used.

Icelandic grammar is the set of structural rules that describe the use of the Icelandic language.

Arbore is an Afro-Asiatic language spoken by the Arbore people in southern Ethiopia in a few settlements of Hamer woreda near Lake Chew Bahir.

<span class="mw-page-title-main">Inflection</span> Process of word formation

In linguistic morphology, inflection is a process of word formation in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and definiteness. The inflection of verbs is called conjugation, while the inflection of nouns, adjectives, adverbs, etc. can be called declension.

This article deals with the grammar of the Udmurt language.

Old Norse has three categories of verbs and two categories of nouns. Conjugation and declension are carried out by a mix of inflection and two nonconcatenative morphological processes: umlaut, a backness-based alteration to the root vowel; and ablaut, a replacement of the root vowel, in verbs.

Serbo-Croatian is a South Slavic language that, like most other Slavic languages, has an extensive system of inflection. This article describes exclusively the grammar of the Shtokavian dialect, which is a part of the South Slavic dialect continuum and the basis for the Bosnian, Croatian, Montenegrin, and Serbian standard variants of Serbo-Croatian. "An examination of all the major 'levels' of language shows that BCS is clearly a single language with a single grammatical system."

This article concerns the morphology of the Albanian language, including the declension of nouns and adjectives, and the conjugation of verbs. It refers to the Tosk-based Albanian standard regulated by the Academy of Sciences of Albania.

Levantine Arabic grammar is the set of rules by which Levantine Arabic creates statements, questions and commands. In many respects, it is quite similar to that of the other vernacular Arabic varieties.