International Computer Archive of Modern and Medieval English

Last updated August 15, 2025

The International Computer Archive of Modern and Medieval English (ICAME) is an international group of linguists and data scientists working in corpus linguistics to digitise English texts.^[1] The organisation was founded in Oslo, Norway in 1977 as the International Computer Archive of Modern English, before being renamed to its current title.^[2]

Its primary objectives were:^[3]

collecting and distributing information on
- English language material available for computer processing; and
- linguistic research completed or in progress on this material;
compiling an archive of corpora to be located at the University of Bergen, from where copies of the material can be obtained at cost.

The portal to their materials is hosted at the University of Bergen, where they have set out the aim of the organization to "collect and distribute information on English language material available for computer processing and on linguistic research to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions."^[4] Creating computer corpora, i.e. collections of texts in machine-readable form, is the most accessible way to study both transcribed spoken language and various genres of written texts for modern scholars, including both "descriptive and more theoretically-minded linguists".^[5]

The ICAME group hosts academic conferences that focus on corpus linguistic studies of historical changes and contemporary grammatical descriptions of English, and makes corpora of different varieties of English available to scholars, starting with editions of the 1960s Brown Corpus. Their first academic conference was held in Bergen, Norway in 1979, and scholars who were interested in corpus linguistics continued to meet each spring in different European and English-speaking countries. At these meetings, the compilation and distribution of corpora they enabled played a key role in the creation of the field of corpus linguistics in the 20th century, a precursor to current big data analytics. In summarizing the field, Kennedy's Introduction to Corpus Linguistics notes that "for corpus linguists with an interest in the description of English, the International Computer Archive of Modern and Medieval English has been the major resource".^[6] The influence of ICAME on the field has also be laid out in Facchinetti's history, Corpus Linguistics Twenty-five Years On.^[7]

One influential resource that ICAME made available was a CD of 20 different corpora, including those covering different regional Englishes (such as the Australian Corpus of English, the Wellington Corpus of Spoken New Zealand English, the Kolhapur Corpus of Indian English, the Bergen Corpus of London Teenage Language (COLT), the Helsinki Corpus of Older Scots, and the International Corpus of English—East-African component), as well as versions of the Brown Corpus and the Lancaster-Bergen-Oslo (LOB) corpus tagged for part of speech.^[8]

ICAME also published an annual journal, the ICAME Journal, formerly ICAME News,^[9] that contains articles, conference reports, reviews and notices related to corpus linguistics.^[10] The current editors of the ICAME Journal are Merja Kytö and Anna-Brita Stenström.^[11]

I am wearing a tie clip in the shape of a monkey wrench... The story behind this peculiar piece of jewelry goes back to the early 60s when I was assembling the notorious Brown Corpus and others were using computers to make concordances of William Butler Yeats and other poets. One of my colleagues, a specialist in modem Irish literature, was heard to remark that anyone who would use a computer on good literature was nothing but a plumber. Some of my students responded by forming a linguistic plumber's union, the symbol of which was, of course, a monkey wrench.

— W. Nelson Francis, Dinner speech given at the 5th ICAME Conference on Computers in English Language Research, Windermere, England, 21 May 1984, ICAME news, issue 10 (1985).

References

↑ Corpus Linguistics and Beyond: Proceedings of the Seventh International Conference on English Language Research on Computerized Corpora. Vol. 59. Rodopi. 1987. p. vi. ISBN 978-9-062-03569-4.
↑ Kennedy, Graeme (19 September 2014). An Introduction to Corpus Linguistics. Routledge. p. 85. ISBN 978-1-317-89258-8.
↑ https://icame.info/icame_static/archives/No_10_ICAME_News_index.pdf#page=10 ^{[ bare URL ]}
↑ "ICAME" . Retrieved March 28, 2015.
↑ Johansson, Stig (1994). "ICAME-Quo Vadis? Reflections on the use of computer corpora in linguistics". Computers and the Humanities. 28 (4–5): 243–252. doi:10.1007/BF01830271. S2CID 20568137.
↑ Kennedy, Graeme (2014). Introduction to Corpus Linguistics. Routledge. pp. ch. 2.
↑ Facchinetti, Roberta (2007). Corpus Linguistics Twenty-Five Years On. Brill / Rodopi.
↑ Hofland, K.; et al. (1999). ICAME collection of English language corpora [CD].
↑ "degruyter ICAME supplement" (PDF). Retrieved March 28, 2015.
↑ "The LinguistList--ICAME Journal". Archived from the original on October 26, 2007. Retrieved March 28, 2015.
↑ "ICAME Journal" . Retrieved March 28, 2015.

Authority control databases
International	ISNI VIAF
National	United States Israel
Other	IdRef Yale LUX

International Computer Archive of Modern and Medieval English

References

Further reading