TIMIT

Last updated January 25, 2026

TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element has been delineated in time.

TIMIT was designed to further acoustic-phonetic knowledge and automatic speech recognition systems. It was commissioned by DARPA and corpus design was a joint effort between the Massachusetts Institute of Technology, SRI International, and Texas Instruments (TI). The speech was recorded at TI, transcribed at MIT, and verified and prepared for publishing by the National Institute of Standards and Technology (NIST).^[1] There is also a telephone bandwidth version called NTIMIT (Network TIMIT).

TIMIT and NTIMIT are not freely available — either membership of the Linguistic Data Consortium, or a monetary payment, is required for access to the dataset.

Data

TIMIT contains ~5 hours of speech, of 10 sentences spoken by each of 630 speakers. The sentences were randomly sampled from a corpus of 2342 sentences. The speakers were native speakers of American English, classified under 8 major dialect regions: New England, Northern, North Midland, South Midland, Southern, New York City, Western, Army Brat (moved around). The speakers were 70% male and 30% female.^[2]

Recordings were made in a noise-isolated recording booth at Texas Instrument, using a semi-automatic computer system (STEROIDS) to control the presentation of prompts to the speaker and the recording. Two-channel recordings were made using a Sennheiser HMD 414 headset-mounted microphone and a Brüel & Kjær 1/2" far-field pressure microphone (#4165). The speech was digitized at a sample rate of 20 kHz then and downsampled to 16 kHz.^[2]

History

The TIMIT telephone corpus was an early attempt to create a database with speech samples.^[3] It was published in the year 1988 on CD-ROM and consists of only 10 sentences per speaker. Two 'dialect' sentences were read by each speaker, as well as another 8 sentences selected from a larger set ^[4] Each sentence averages 3 seconds long and is spoken by 630 different speakers.^[5] It was the first notable attempt in creating and distributing a speech corpus and the overall project has produced costs of 1.5 million US$.^[6]

An update was released in October 1990. It included^[2]

full 630-speaker corpus;
checked and corrected transcriptions;
word-alignment transcriptions;
NIST SPHERE-headered waveform files and header manipulation software;
phonemic dictionary;
new test and training subsets balanced for dialectal and phonetic coverage;
more extensive documentation.

The full name of the project is DARPA-TIMIT Acoustic-Phonetic Continuous Speech Corpus^[7] and the acronym TIMIT stands for Texas Instruments/Massachusetts Institute of Technology. The main reason why a corpus of telephone speech was created was to train speech recognition software. In the Blizzard challenge, different software has the obligation to convert audio recordings into textual data and the TIMIT corpus was used as a standardized baseline.^[8]

References

↑ Fisher, William M.; Doddington, George R.; Goudie-Marshall, Kathleen M. (1986). "The DARPA Speech Recognition Research Database: Specifications and Status". Proceedings of DARPA Workshop on Speech Recognition. pp. 93–99.
1 2 3 Garofolo, John S.; Lamel, L. F.; Fisher, W. M.; Fiscus, Jonathan G.; Pallett, D. S.; Dahlgren, Nancy L. (1993-02-01). "DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM. NIST speech disc 1-1.1" (PDF). NIST Interagency/Internal Report. 93 (4930): 27403. Bibcode:1993STIN...9327403G. This article incorporates text from this source, which is in the public domain .
↑ Morales, Nicolas and Tejedor, Javier and Garrido, Javier and Colas, Jose and Toledano, Doroteo T (2008). "STC-TIMIT Generation of a single-channel telephone corpus". Proceedings of the Sixth International Language Resources and Evaluation (LREC'08): 391–395.{{cite journal}}: CS1 maint: multiple names: authors list (link)
↑ Lori F Lamel and Robert H. Kassel and Stephanie Seneff (1986). Speech Database Development: Design and Analysis of the Acoustic-Phonetic Corpus (Technical report). DARPA (SAIC-86/1546).
↑ John S Garofolo and Lori F Lamel and William M Fisher and Jonathan G Fiscus and David S Pallett and Nancy L Dahlgren (1993). DARPA TIMIT (Technical report). National Institute of Standards and Technology. doi:10.6028/nist.ir.4930.
↑ Nattanun Chanchaochai and Christopher Cieri and Japhet Debrah and Hongwei Ding and Yue Jiang and Sishi Liao and Mark Liberman and Jonathan Wright and Jiahong Yuan and Juhong Zhan and Yuqing Zhan (2018). GlobalTIMIT: Acoustic-Phonetic Datasets for the World's Languages. Interspeech 2018. ISCA. doi:10.21437/interspeech.2018-1185.
↑ Bauer, Patrick and Scheler, David and Fingscheidt, Tim (2010). WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network. LREC.{{cite conference}}: CS1 maint: multiple names: authors list (link)
↑ Sawada, Kei and Asai, Chiaki and Hashimoto, Kei and Oura, Keiichiro and Tokuda, Keiichi (2016). The NITech text-to-speech system for the Blizzard Challenge 2016. Blizzard Challenge 2016 Workshop.{{cite conference}}: CS1 maint: multiple names: authors list (link)

External links

TIMIT Acoustic-Phonetic Continuous Speech Corpus

This text corpus or speech corpus-related article is a stub. You can help Wikipedia by adding missing information.

This article about the English language is a stub. You can help Wikipedia by adding missing information.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Fisher, William M.; Doddington, George R.; Goudie-Marshall, Kathleen M. (1986). "The DARPA Speech Recognition Research Database: Specifications and Status". Proceedings of DARPA Workshop on Speech Recognition. pp. 93–99.

[:0-2] 1 2 3 Garofolo, John S.; Lamel, L. F.; Fisher, W. M.; Fiscus, Jonathan G.; Pallett, D. S.; Dahlgren, Nancy L. (1993-02-01). "DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM. NIST speech disc 1-1.1" (PDF). NIST Interagency/Internal Report. 93 (4930): 27403. Bibcode:1993STIN...9327403G. This article incorporates text from this source, which is in the public domain .

[3] Morales, Nicolas and Tejedor, Javier and Garrido, Javier and Colas, Jose and Toledano, Doroteo T (2008). "STC-TIMIT Generation of a single-channel telephone corpus". Proceedings of the Sixth International Language Resources and Evaluation (LREC'08): 391–395.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[4] Lori F Lamel and Robert H. Kassel and Stephanie Seneff (1986). Speech Database Development: Design and Analysis of the Acoustic-Phonetic Corpus (Technical report). DARPA (SAIC-86/1546).

[5] John S Garofolo and Lori F Lamel and William M Fisher and Jonathan G Fiscus and David S Pallett and Nancy L Dahlgren (1993). DARPA TIMIT (Technical report). National Institute of Standards and Technology. doi:10.6028/nist.ir.4930.

[6] Nattanun Chanchaochai and Christopher Cieri and Japhet Debrah and Hongwei Ding and Yue Jiang and Sishi Liao and Mark Liberman and Jonathan Wright and Jiahong Yuan and Juhong Zhan and Yuqing Zhan (2018). GlobalTIMIT: Acoustic-Phonetic Datasets for the World's Languages. Interspeech 2018. ISCA. doi:10.21437/interspeech.2018-1185.

[7] Bauer, Patrick and Scheler, David and Fingscheidt, Tim (2010). WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network. LREC.{{cite conference}}: CS1 maint: multiple names: authors list (link)

[8] Sawada, Kei and Asai, Chiaki and Hashimoto, Kei and Oura, Keiichiro and Tokuda, Keiichi (2016). The NITech text-to-speech system for the Blizzard Challenge 2016. Blizzard Challenge 2016 Workshop.{{cite conference}}: CS1 maint: multiple names: authors list (link)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

v t e Corpus linguistics
Text corpora, English	American National Corpus Bank of English Bergen Corpus of London Teenage Language British National Corpus Brown Corpus Buckeye Corpus Cambridge English Corpus Corpus of Contemporary American English Enron Corpus EnTenTen International Corpus of English Lancaster-Oslo-Bergen Corpus Oxford English Corpus PropBank Spoken English Corpus Switchboard Telephone Speech Corpus TIMIT VerbNet Wellington Corpus of Spoken New Zealand English
Text corpora, non-English	Bijankhan Corpus CHILDES CorCenCC National Corpus of Contemporary Welsh Croatian Language Corpus Croatian National Corpus Czech National Corpus Europarl Corpus German Reference Corpus Hamshahri Corpus National Corpus of Polish Neo-Assyrian Text Corpus Project Persian Speech Corpus Quranic Arabic Corpus Russian National Corpus Somali Corpus Scottish Corpus of Texts and Speech Slovenian National Corpus TalkBank Tatoeba Tekstaro de Esperanto TenTen Corpus Family Thesaurus Linguae Graecae
Organizations	BNC consortium COBUILD Sketch Engine

TIMIT

Contents

Data

History

See also

References

External links