Switchboard Telephone Speech Corpus

Last updated

The Switchboard Telephone Speech Corpus is a corpus of spoken English language consisted of almost 260 hours of speech. It was created in 1990 by Texas Instruments via a DARPA grant, and released in 1992 by NIST. The corpus contains 2,400 telephone conversations among 543 US speakers (302 male, 241 female). [1] [2] [3] Participants did not know each other, and conversations were held on topics from a predetermined list. [4]

Switchboard-2 Phase II was collected in 1999 and includes "4,472 five-minute telephone conversations involving 679 participants". [5]

The corpus was used for development of speech recognition algorithms. [6]

Text example: [7]

A: All right um well [laughter-uh] let's see i'm twenty
B: How old are you Lisa. Okay that i'm older
A: Yeah how old are you. Older [laughter]
B: Older than you [laughter-are]
A: [laughter-okay]
B: Okay we are supposed to talk about places we like to go so i'm gonna and where are you from where are you calling from?
A: I'm calling from uh Provo Utah but I'm from Plano Texas
B: Oh you are from Plano my sister lives in Plano yes her husband is the new Director of Admissions at uh University of Texas at Dallas
A: Oh really. Oh wow my dad used to work at UTD also
B: Yeah so I [vocalized-noise]. Anyway so where's your favorite place to go?
A: Um. Generally we just go on family vacations to Arizona my grandparents live there that's generally our usual summer vacation

Further reading

Related Research Articles

<span class="mw-page-title-main">Telephone switchboard</span> Device used to connect telephone circuits to establish calls between users

A telephone switchboard was a device used to connect circuits of telephones to establish telephone calls between users or other switchboards, throughout the 20th century. The switchboard was an essential component of a manual telephone exchange, and was operated by switchboard operators who used electrical cords or switches to establish the connections.

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

Telephony is the field of technology involving the development, application, and deployment of telecommunication services for the purpose of electronic transmission of voice, fax, or data, between distant parties. The history of telephony is intimately linked to the invention and development of the telephone.

An interjection is a word or expression that occurs as an utterance on its own and expresses a spontaneous feeling or reaction. It is a diverse category, encompassing many different parts of speech, such as exclamations (ouch!, wow!), curses (damn!), greetings, response particles, hesitation markers, and other words. Due to its diverse nature, the category of interjections partly overlaps with a few other categories like profanities, discourse markers, and fillers. The use and linguistic discussion of interjections can be traced historically through the Greek and Latin Modistae over many centuries.

<span class="mw-page-title-main">Betty Ong</span> American flight attendant (1956–2001)

Betty Ann Ong was an American flight attendant who worked for American Airlines and boarded Flight 11, the first airplane hijacked during the September 11 attacks. Ong was the first person to alert authorities to the hijackings taking place that day. Shortly after the hijacking, Ong notified the American Airlines ground crew of the hijacking, staying on the radiophone for 23 minutes to relay vital information that led to the closing of airspace by the FAA, a first in United States history. For this, the 9/11 Commission declared Ong a hero.

<span class="mw-page-title-main">Switchboard operator</span> Former telephony occupation

In the early days of telephony, companies used manual telephone switchboards, and switchboard operators connected calls by inserting a pair of phone plugs into the appropriate jacks. They were gradually phased out and replaced by automated systems, first those allowing direct dialing within a local area, then for long-distance and international direct dialing.

<span class="mw-page-title-main">Life's Been Good</span> 1978 single by Joe Walsh

"Life's Been Good" is a song by American singer-songwriter and multi-instrumentalist Joe Walsh that first appeared on the soundtrack to the 1978 film FM. The original eight-minute version was released on Walsh's fourth studio album But Seriously, Folks... (1978), and an edited four-minute single version peaked at No. 12 on the US Billboard Hot 100, remaining his biggest solo hit.

<i>Stop the Bleeding</i> (Tourniquet album) 1990 studio album by Tourniquet

Stop the Bleeding is the debut studio album by the American Christian metal band Tourniquet. It was originally released on Intense Records in 1990. A remastered version was released independently on Pathogenic Records in 2001, which was later re-released in 2011. Retroactive Records released a Collector's Edition remaster on June 26, 2020. The remasters include updated artwork, expanded album booklets, and bonus tracks.

<span class="mw-page-title-main">Plano Senior High School</span> Public high school in Plano, Texas, United States

Plano Senior High School is a public secondary school in Plano, Texas, serving students in grades 11–12. The school is part of the Plano Independent School District, with admission based primarily on the locations of students' homes. Plano is a two-time Blue Ribbon School and a Texas Exemplary School. Students at Plano Senior typically attended one of two feeder high schools: Clark or Vines.

<span class="mw-page-title-main">Boomhauer</span> Fictional character

Jeffrey Dexter Boomhauer III, most commonly referred to as Boomhauer, is a fictional character in the Fox animated series King of the Hill, voiced by series creator Mike Judge, known for his fast-paced and nearly-incomprehensible speech.

In linguistics, a filler, filled pause, hesitation marker or planner is a sound or word that participants in a conversation use to signal that they are pausing to think but are not finished speaking. These are not to be confused with placeholder names, such as thingamajig. Fillers fall into the category of formulaic language, and different languages have different characteristic filler sounds. The term filler also has a separate use in the syntactic description of wh-movement constructions.

Herbert Herb Clark is a psycholinguist currently serving as Professor of Psychology at Stanford University. His focuses include cognitive and social processes in language use; interactive processes in conversation, from low-level disfluencies through acts of speaking and understanding to the emergence of discourse; and word meaning and word use. Clark is known for his theory of "common ground": individuals engaged in conversation must share knowledge in order to be understood and have a meaningful conversation. Together with Deanna Wilkes-Gibbs (1986), he also developed the collaborative model, a theory for explaining how people in conversation coordinate with one another to determine definite references. Clark's books include Semantics and Comprehension, Psychology and Language: An Introduction to Psycholinguistics, Arenas of Language Use and Using Language.

<span class="mw-page-title-main">History of the telephone</span> 19th-century development of the modern telephone

This history of the telephone chronicles the development of the electrical telephone, and includes a brief overview of its predecessors. The first telephone patent was granted to Alexander Graham Bell in 1869.

<span class="mw-page-title-main">Alicia Warrington</span> American drummer (born 1980)

Alicia Warrington is an American drummer and professional wrestling ring announcer. She has played drums for Kate Nash, Kelly Osbourne, Lillix, Hannah Montana, Uh Huh Her, Gore Gore Girls, Dawn Robinson, The All-Girl Boys Choir, The Dollyrots, The Bruises, Tracy Chapman, Selena Gomez, Colton Dixon, Chris Rene and others.

Linguistic categories include

<i>Avengers</i> (album) 1983 compilation album by Avengers

Avengers is a compilation album by the American punk group Avengers. It was released on vinyl in 1983 by CD Presents. It is the closest thing to a studio album the band has, although it was compiled by drummer Danny Furious from various recordings the band did in their three years of existence.

In linguistics, a backchanneling during a conversation occurs when one participant is speaking and another participant interjects responses to the speaker. A backchannel response can be verbal, non-verbal, or both. Backchannel responses are often phatic expressions, primarily serving a social or meta-conversational purpose, such as signifying the listener's attention, understanding, sympathy, or agreement, rather than conveying significant information. Examples of backchanneling in English include such expressions as "yeah", "OK", "uh-huh", "hmm", "right", and "I see".

<span class="mw-page-title-main">Air Illinois Flight 710</span> 1983 aviation accident

Air Illinois Flight 710 was a scheduled passenger flight from Chicago to Carbondale, Illinois, United States. On the night of October 11, 1983, the Hawker Siddeley HS 748 operating the flight crashed near Pinckneyville, Illinois due to the flightcrew's mismanagement of electrical generator and distribution problems. All 10 passengers and crew were killed in the accident.

Claire Hardaker is a British linguist. She is senior lecturer at the Department of Linguistics and English Language of Lancaster University, United Kingdom. Her research involves forensic linguistics and corpus linguistics. Her research focuses on deceptive, manipulative, and aggressive language in a range of online data. She has investigated behaviours ranging from trolling and disinformation to human trafficking and online scams. Her research typically uses corpus linguistic methods to approach forensic linguistic analyses.

<span class="mw-page-title-main">English interjections</span> Interjections in the English language

English interjections are a category of English words – such as yeah, ouch, Jesus, oh, mercy, yuck, etc. – whose defining features are the infrequency with which they combine with other words to form phrases, their loose connection to other elements in clauses, and their tendency to express emotive meaning. These features separate English interjections from the language's other lexical categories, such as nouns and verbs. Though English interjections, like interjections in general, are often overlooked in descriptions of the language, English grammars do offer minimal descriptions of the category.

References

  1. "Switchboard-1 Release 2 - Linguistic Data Consortium". catalog.ldc.upenn.edu. Retrieved 26 January 2024.
  2. "Papers with Code - Switchboard-1 Corpus Dataset". paperswithcode.com. Retrieved 26 January 2024.
  3. Godfrey, John J.; Holliman, Edward C.; McDaniel, Jane (23 March 1992). "SWITCHBOARD: Telephone speech corpus for research and development". [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE Computer Society. pp. 517–520. doi:10.1109/ICASSP.1992.225858. ISBN   0-7803-0532-9. S2CID   61412708 . Retrieved 26 January 2024.
  4. "NXT Swbd Overview". groups.inf.ed.ac.uk. Retrieved 26 January 2024.
  5. "Switchboard-2 Phase II - Linguistic Data Consortium". catalog.ldc.upenn.edu. Retrieved 26 January 2024.
  6. "Switchboard Transcription System". www1.icsi.berkeley.edu. Retrieved 26 January 2024.
  7. Soni, Mayank; Spillane, Brendan; Gilmartin, Emer; Saam, Christian; Cowan, Benjamin R.; Wade, Vincent (2021). "An Empirical Study of Topic Transition in Dialogue". arXiv: 2111.14188 [cs.CL].