Speech disfluency

Last updated

A speech disfluency, also spelled speech dysfluency, is any of various breaks, irregularities, or non-lexical vocables which occur within the flow of otherwise fluent speech. These include "false starts", i.e. words and sentences that are cut off mid-utterance; phrases that are restarted or repeated, and repeated syllables; "fillers", i.e. grunts, and non-lexical or semiarticulate utterances such as huh, uh, erm, um, and hmm, and, in English, well, so, I mean, and like; and "repaired" utterances, i.e. instances of speakers correcting their own slips of the tongue or mispronunciations (before anyone else gets a chance to). Huh is claimed to be a universal syllable. [1]

Contents

Definition

A disfluence or nonfluence is a non-pathological hesitance when speaking, the use of fillers (“like” or “uh”), or the repetition of a word or phrase. This needs to be distinguished from a fluency disorder like stuttering with an interruption of fluency of speech, accompanied by "excessive tension, speaking avoidance, struggle behaviors, and secondary mannerism". [2]

Fillers

Fillers are parts of speech which are not generally recognized as purposeful or containing formal meaning, usually expressed as pauses such as uh, like and er, but also extending to repairs ("He was wearing a blackuh, I mean a blue, a blue shirt"), and articulation problems such as stuttering. Use is normally frowned upon in mass media such as news reports or films, but they occur regularly in everyday conversation, sometimes representing upwards of 20% of "words" in conversation. [3] Fillers can also be used as a pause for thought ("I arrived at, um—3 o'clock"), and when used in this function are called hesitation markers or planners. [4]

Language-dependence

Research in computational linguistics has revealed a correlation between native language and patterns of disfluencies in spontaneously uttered speech. [5] Besides that research, there are other subjective accounts reported by individuals.

According to one commentator,[ who? ] Americans use pauses such as um or em, the Irish commonly use the pause em, [6] the British say uh or eh, the French use euh, the Germans say äh (pronounced eh or er), the Dutch use eh, Japanese use ああ ā, あのう anō or ええと ēto, the Spanish say ehhh (also used in Hebrew) and como (normally meaning 'like'), and Latin Americans but not the Spanish use este (normally meaning 'this'). Besides er and uh, the Portuguese use or é.

In Mandarin, 那个; nà gè and 这个; zhè ge are used, meaning 'that' or 'this', respectively. Arabic speakers say يعني, the pronunciation of which is close to yaa'ni, [jæʕni] or [jaʕni], (literally 'he means'; there is no grammatical gender-neutral third person) and Turkish say şey in addition to yani (without the [ ʕ ] found in Arabic) and ııı.[ citation needed ]

Despite the differences between languages, pause fillers in different languages often sound similar because they tend to be the easiest and most neutral vowel sounds to make (such as the schwa), i.e the sounds that can be pronounced with a relaxed tongue or jaw. [7]

Research

Recent[ when? ] linguistic research has suggested that non-pathological disfluencies may contain a variety of meaning; the frequency of uh and um in English is often reflective of a speaker's alertness or emotional state. Some have hypothesized that the time of an uh or um is used for the planning of future words; [8] other researchers have suggested that they are actually to be understood as full-fledged function words rather than accidents, indicating a delay of variable time in which the speaker wishes to pause without voluntarily yielding control of the dialogue. There is some debate as to whether to consider them a form of noise or as a meaning-filled part of language, but disfluency can improve language understanding by signalling that the speaker may be about to say something new or complex. [9] [10]

Hmm

Hmm is an exclamation (an emphatic interjection) typically used to express reflection, uncertainty, thoughtful absorption, or hesitation. [11] Hmm is technically categorized as an interjection, like um, huh, ouch, erm, and wow. The first h-sound is a mimic for breathing out, and the second m-sound, since the mouth is closed, is representing that the person is not currently sure what to say (erm and um are used similarly). The pause filler indicates that the person is temporarily speechless, but still engaged in thought. The variety of tones, pitches, and lengths used add nuances in meaning. [12]

A "hmm" emoji Twemoji2 1f914.svg
A "hmm" emoji

Etymology

The expression is used in many different languages; however, the origin of hmm is difficult to find, mainly because "the word is so natural that it may have arisen at any time", as highlighted by Anatoly Liberman, a linguist at the University of Minnesota and an expert on word origins. It is possible Neanderthals might have used hmm. Nicholas Christenfeld, a psychologist at the University of California, San Diego, and an expert on filled pauses, attests hmm is popular largely since it is such a neutral sound and that "it's easier to say than anything else". [12] The earliest attestations of hmm are from Shakespeare, "I cried hum ... But markt him not a word" (1598 Shakespeare Henry IV, Pt. 1 iii. i. 154). It may be a vocable that grew out of lexicalized throat-clearing. [13]

Use as a filler word

Hmm is a "filler" word, like um and er. Typically, hmm is uttered when the person is being especially conscious about whom they are talking with, and as a result are thinking deeply about what to say. Moreover, the use of hmm is often interactional and cognitive. The interactional function is to do with politeness: if someone is invited to a party and responds "no" without a filled pause, they might appear rude, but a reply of "Hmm, sorry, no" might appear much more polite, as it seems the speaker is giving the offer some thought, rather than abruptly declining. [14]

Thoughtful absorption

The use of hmm is typically used during "thoughtful absorption", which is when one is engrossed [15] in their flow of ideas and associations, that lead to a reality-oriented conclusion. [16] The utterance of hmm is key for listeners to understand that the speaker is currently engaged in thought; if the speaker thought silently instead, listeners may be unsure if the speaker had finished their utterance. Um and er are also used during thoughtful absorption; however, typically the extent of the absorption of thought is more limited since um and er are usually spoken mid-sentence [17] and for shorter periods of time than hmm. For this reason, thoughtful absorption is typically associated with the utterance of hmm. [18]

Huh – the universal syllable

A 2013 study suggested that the word/syllable huh is perhaps the most recognized syllable throughout the world. [19] It is an interrogative which crosses geography, language, cultures and nationalities. [20]

See also

Related Research Articles

<span class="mw-page-title-main">Utterance</span> Smallest unit of speech

In spoken language analysis, an utterance is a continuous piece of speech, by one person, before or after which there is silence on the part of the person. In the case of oral languages, it is generally, but not always, bounded by silence. Utterances do not exist in written language; only their representations do. They can be represented and delineated in written language in many ways.

An interjection is a word or expression that occurs as an utterance on its own and expresses a spontaneous feeling or reaction. It is a diverse category, encompassing many different parts of speech, such as exclamations (ouch!, wow!), curses (damn!), greetings, response particles, hesitation markers, and other words. Due to its diverse nature, the category of interjections partly overlaps with a few other categories like profanities, discourse markers, and fillers. The use and linguistic discussion of interjections can be traced historically through the Greek and Latin Modistae over many centuries.

In English, the word like has a very flexible range of uses, ranging from conventional to non-standard. It can be used as a noun, verb, adverb, adjective, preposition, particle, conjunction, hedge, filler, quotative, and semi-suffix.

Paralanguage, also known as vocalics, is a component of meta-communication that may modify meaning, give nuanced meaning, or convey emotion, by using techniques such as prosody, pitch, volume, intonation, etc. It is sometimes defined as relating to nonphonemic properties only. Paralanguage may be expressed consciously or unconsciously.

In the broadest sense of the word, a vocable is any identifiable utterance or writing, such as a word or term, that is fixed by their language and culture. The use of the term for words in the broad sense is archaic and the term is instead used for utterances which are not considered words, such as the English interjections of assent and denial, uh-huh and uh-uh, or the interjection of error, uh-oh.

In linguistics, prosody is the study of elements of speech that are not individual phonetic segments but which are properties of syllables and larger units of speech, including linguistic functions such as intonation, stress, and rhythm. Such elements are known as suprasegmentals.

<span class="mw-page-title-main">Word</span> Basic element of language

A word is a basic element of language that carries meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonological, grammatical or orthographic basis. Others suggest that the concept is simply a convention used in everyday situations.

Cluttering is a speech and communication disorder characterized by a rapid rate of speech, erratic rhythm, and poor syntax or grammar, making speech difficult to understand.

A filled pause is a non-silent pause in an otherwise fluent speech, where instead of a silent pause there is a filler. The filler can be non-lexical or semiarticulate utterances such as huh, uh, erm, um, or hmm. Fillers may also include words such as well, so, I mean, and like, when used in ways that don't change the meaning of the surrounding speech.

Formulaic language is a linguistic term for verbal expressions that are fixed in form, often non-literal in meaning with attitudinal nuances, and closely related to communicative-pragmatic context. Along with idioms, expletives and proverbs, formulaic language includes pause fillers and conversational speech formulas.

A speech error, commonly referred to as a slip of the tongue or misspeaking, is a deviation from the apparently intended form of an utterance. They can be subdivided into spontaneously and inadvertently produced speech errors and intentionally produced word-plays or puns. Another distinction can be drawn between production and comprehension errors. Errors in speech production and perception are also called performance errors. Some examples of speech error include sound exchange or sound anticipation errors. In sound exchange errors, the order of two individual morphemes is reversed, while in sound anticipation errors a sound from a later syllable replaces one from an earlier syllable. Slips of the tongue are a normal and common occurrence. One study shows that most people can make up to as much as 22 slips of the tongue per day.

Speech segmentation is the process of identifying the boundaries between words, syllables, or phonemes in spoken natural languages. The term applies both to the mental processes used by humans, and to artificial processes of natural language processing.

In linguistics, a filler, filled pause, hesitation marker or planner is a sound or word that participants in a conversation use to signal that they are pausing to think but are not finished speaking. These are not to be confused with placeholder names, such as thingamajig. Fillers fall into the category of formulaic language, and different languages have different characteristic filler sounds. The term filler also has a separate use in the syntactic description of wh-movement constructions.

A discourse marker is a word or a phrase that plays a role in managing the flow and structure of discourse. Since their main function is at the level of discourse rather than at the level of utterances or sentences, discourse markers are relatively syntax-independent and usually do not change the truth conditional meaning of the sentence. They can also indicate what a speaker is doing on a variety of different planes. Examples of discourse markers include the particles oh, well, now, then, you know, and I mean, and the discourse connectives so, because, and, but, and or. The term discourse marker was popularized by Deborah Schiffrin in her 1987 book Discourse Markers.

Herbert Herb Clark is a psycholinguist currently serving as Professor of Psychology at Stanford University. His focuses include cognitive and social processes in language use; interactive processes in conversation, from low-level disfluencies through acts of speaking and understanding to the emergence of discourse; and word meaning and word use. Clark is known for his theory of "common ground": individuals engaged in conversation must share knowledge in order to be understood and have a meaningful conversation. Together with Deanna Wilkes-Gibbs (1986), he also developed the collaborative model, a theory for explaining how people in conversation coordinate with one another to determine definite references. Clark's books include Semantics and Comprehension, Psychology and Language: An Introduction to Psycholinguistics, Arenas of Language Use and Using Language.

Jean E. Fox Tree is a professor in the Department of Psychology at the University of California at Santa Cruz. Fox Tree studies collateral signals that people use in spontaneous speech, such as fillers, prosodic information, fillers, and speech disfluencies.

Paraphasia is a type of language output error commonly associated with aphasia, and characterized by the production of unintended syllables, words, or phrases during the effort to speak. Paraphasic errors are most common in patients with fluent forms of aphasia, and come in three forms: phonemic or literal, neologistic, and verbal. Paraphasias can affect metrical information, segmental information, number of syllables, or both. Some paraphasias preserve the meter without segmentation, and some do the opposite. However, most paraphasias affect both partially.

Speech production is the process by which thoughts are translated into speech. This includes the selection of words, the organization of relevant grammatical forms, and then the articulation of the resulting sounds by the motor system using the vocal apparatus. Speech production can be spontaneous such as when a person creates the words of a conversation, reactive such as when they name a picture or read aloud a written word, or imitative, such as in speech repetition. Speech production is not the same as language production since language can also be produced manually by signs.

In linguistics, a backchanneling during a conversation occurs when one participant is speaking and another participant interjects responses to the speaker. A backchannel response can be verbal, non-verbal, or both. Backchannel responses are often phatic expressions, primarily serving a social or meta-conversational purpose, such as signifying the listener's attention, understanding, sympathy, or agreement, rather than conveying significant information. Examples of backchanneling in English include such expressions as "yeah", "OK", "uh-huh", "hmm", "right", and "I see".

<span class="mw-page-title-main">English interjections</span> Interjections in the English language

English interjections are a category of English words – such as yeah, ouch, Jesus, oh, mercy, yuck, etc. – whose defining features are the infrequency with which they combine with other words to form phrases, their loose connection to other elements in clauses, and their tendency to express emotive meaning. These features separate English interjections from the language's other lexical categories, such as nouns and verbs. Though English interjections, like interjections in general, are often overlooked in descriptions of the language, English grammars do offer minimal descriptions of the category.

References

  1. Dingemanse, Mark; Torreira, Francisco; Enfield, N. J. (2013). "Is "Huh?" a Universal Word? Conversational Infrastructure and the Convergent Evolution of Linguistic Items". PLOS ONE. 8 (11): e78273. Bibcode:2013PLoSO...878273D. doi: 10.1371/journal.pone.0078273 . PMC   3832628 . PMID   24260108.
  2. "Fluency Disorders". American Speech-Language-Hearing Association. Retrieved 2023-03-11.
  3. Fox Tree, J. E. (1995). "The effects of false starts and repetitions on the processing of subsequent words in spontaneous speech". Journal of Memory and Language. 34 (6): 709–738. doi:10.1006/jmla.1995.1032.
  4. Tottie, Gunnel (2016). "Planning what to say: Uh and um among the pragmatic markers". In Kaltenbock, Gunther; Keizer, Evelien; Lohmann, Arne (eds.). Outside the Clause: Form and Function of Extra-Clausal Constituents. pp. 97–122.
  5. Lamel, L.; Adda-Deckes, M.; Gauvain, J.L.; Adda, G. (1996). "Spoken language processing in a multilingual context". Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96. Vol. 4. pp. 2203–2206. CiteSeerX   10.1.1.16.6488 . doi:10.1109/ICSLP.1996.607242. ISBN   978-0-7803-3555-4. S2CID   8736842.
  6. "Guide to speaking with an Irish accent". 16 June 2020.
  7. Erard, M. (2007). Um...: Slips, Stumbles, and Verbal Blunders, and What They Mean. New York: Pantheon Books.
  8. Kowal, Sabine; Wiese, Richard; O'Connell, Daniel C. (1983). "The use of time in story telling". Language and Speech. 26 (4): 377–392. doi:10.1177/002383098302600405. S2CID   142712380.
  9. Arnold, J.; Tanenhaus, M. K; Altmann, R.; Fagnano, M. (2004). "The Old and Thee, uh, New". Psychological Science. 15 (9): 578–582. doi:10.1111/j.0956-7976.2004.00723.x. PMID   15327627.
  10. Arnold, J.E.; Hudson Kam, C.; Tanenhaus, M.K. (2007). "If you say thee uh- you're describing something hard: the on-line attribution of disfluency during reference comprehension". Journal of Experimental Psychology: Learning, Memory, and Cognition. 33 (5): 914–930. doi:10.1037/0278-7393.33.5.914. PMID   17723069.
  11. Online Dictionary Definitions of "hmm"
  12. 1 2 Wolchover, Natalie (8 June 2012). "Why do We Say 'Hmm' when Thinking?". Live Science .
  13. "HMM | Origin and meaning of HMM by Online Etymology Dictionary".
  14. "Why you say 'um' 'like' and 'you know?' so much" . Independent.co.uk . 2017-04-04. Archived from the original on 2022-05-24.
  15. "Absorption | Definition of absorption in English by Oxford Dictionaries". Archived from the original on September 25, 2016.
  16. Marić, Jovan (2005). Klinicka psihijatrija. Belgrade: Naša knjiga. p. 22. ISBN   978-86-901559-1-0.
  17. "Fill in the Gaps: 15+ Common English Filler Words You Should Know | FluentU English". 19 June 2023.
  18. Online Contrasting Dictionary Definitions of "hmm", "um", and "er"
  19. Schuessler, Jennifer (November 9, 2013). "The Syllable that Everyone Understands". The New York Times . Retrieved November 9, 2013.
  20. Dingemanse, Mark; Torreira, Francisco; Enfield, N. J. (2013). "Is "Huh?" a Universal Word? Conversational Infrastructure and the Convergent Evolution of Linguistic Items PLoS ONE 8(11): e78273". PLOS ONE. 8 (11): e78273. Bibcode:2013PLoSO...878273D. doi: 10.1371/journal.pone.0078273 . PMC   3832628 . PMID   24260108.

Further reading