Word frequency effect

Last updated

The word frequency effect is a psychological phenomenon where recognition times are faster for words seen more frequently than for words seen less frequently. [1] Word frequency depends on individual awareness of the tested language. [2] The phenomenon can be extended to different characters of the word in non-alphabetic languages such as Chinese. [3]

Contents

A word is considered to be high frequency if the word is commonly used in daily speech, such as the word "the". A word is considered to be low frequency if the word is not commonly used, such as the word "strait". [4] Some languages such as Chinese have multiple levels of daily speech that impact frequency of words. There is frequency at the character level or at the word level. [3] There is also an effect of frequency at the orthographic level. [5] Lower frequency words benefit more from a single repetition than higher frequency words. [6]

Examples

WordRanking
The1st [7]
At20th
So50th
Did70th
Got100th
Mind300th
Chaos5,000th
Falkland20,000th
Marche45,000th
Tisane85,000th

Methods for measuring the word frequency effect

Most studies looking at the word frequency effect use eye tracking data. When words have a higher frequency, readers fixate on them for shorter amounts of time. [8] In one study, participants' eye movements were recorded as they scanned single sentence stimuli for topic relevant words. [9] Researchers used an Eyelink eye tracker to record the movements of the participants' eyes. Reading times were found to be longer when focusing for comprehension due to increased average fixation durations. Results showed that reading for comprehension rather than scanning for certain words took longer fixations on the text. [9]

A second method used to measure the word frequency effect is electroencephalogram (EEG). [8] The results collected using EEG data vary depending on the context of the word. Expected or high frequency words exhibit a reduced N400 response at the beginning of the sentence. [8] This study found that predictable words showed a lower N400 amplitude, but did not find a significant effect of frequency. [8] More research is needed to see how frequency affects EEG data.

A third method of measuring the word frequency effect is reaction time. Reaction time is used particularly when reading aloud. Participants pronounce words differing in frequency as fast as they can. Higher frequency words are read faster than the low frequency words. [10] [11] [12]

A fourth method of measuring the word frequency effect is accuracy. Accuracy is used for recognition memory and free recall. Participants recognise and recall items that are studied differing in frequency. In recognition memory, higher-frequency words are more error prone less than lower frequency items. [13] In free recall, higher-frequency words are less error prone than lower-frequency items. [13]

Cognitive influences

The word frequency effect changes how the brain encodes the information. Readers began spelling the higher frequency words faster than the lower frequency words when spelling the words from dictation. The length of saccade varies depending on the frequency of words and the validity of the previous (preview) word in predicting the target word. [5] For higher frequency target words, the saccades as the reader approaches the word is longer when there is a valid preview word in front of it than for lower frequency words. When the preview word is invalid, there is no difference in saccades between high or low frequency words. [14] Fixations follow an opposite pattern with longer fixations on low frequency words. [5] Research has also found that high frequency words are skipped more when read than low frequency words. Gaze duration is also shorter when reading high frequency words than low frequency words. [14] Module connections are strengthened as words increase in frequency assisting to explain differences in brain processing. [6]

Real world applications

Written words

Leading character effect (LCF)

In many languages, certain characters are used more frequently than others. Examples of more frequent characters in English are vowels, m, r, s, t...etc. In other languages such as Chinese, characters are morphemes that are individual words. [3] More than 100,000 words in Chinese are made of the same 5,000 characters. [3] As people process the first character of the word, they make a mental prediction of what the word is before reading the rest of the characters. If the character and other preprocessing information indicates that the word is short and familiar, the reader is more likely to skip the entire word. [15]

The character frequency may be more important when reading than the frequency of the word as a whole. In a study examining the Chinese language, reaction times for target words with a first character that was high frequency was shorter than those with first characters that were low frequency when simply naming the Chinese word. When making a lexical decision, target words with higher LCF took longer to respond to than low LCF. An example of a high frequency character in Chinese is the character for family (家) which appears before many other characters. [3] These effects were moderated by the predictability of the next words as well as the predictability of the target word given the previous word. [14] The surrounding words also being high frequency results In faster reaction times particularly when the target word is high frequency as compared to low frequency words. [3]

Test-taking

The quick recognition of a word would potentially be important during a timed written assessment. With a strict limit on time available to complete a test, the presence of higher frequency words on the assessment would be more beneficial to the test-taker than low frequency words, as the high frequency words would be recognized faster and thus time could be utilized on other areas of the assessment.

Bilingualism

With more people becoming fluent in multiple languages, the word frequency effect could present differently in a first language than a second language. One study examined differences in reading across participants who were bilingual in Spanish and English. The participants had varying levels of competence in the second language with more fluent participants demonstrating a stronger word frequency effect. [2] As the word frequency effect increased in both languages, total reading time decreased. In L1 (first language) there were higher skipping rates than in L2 (second language). This suggests that lower frequency words in L2 were harder to process than both high and low frequency words in L1. Familiarity of the language plays a large role in reacting to the frequency of words. [2] Reaction rates of bilingual adults could also be impacted by age. Older adults were significantly slower to respond to lower frequency words but were faster to process higher frequency words. [2]

Spoken words

In several studies, participants read a list of high or low frequency words along with nonwords (or pseudowords). They were tasked with pronouncing the words or nonwords as fast as possible. [10] High frequency words were read aloud faster than low frequency words. [10] Participants read the nonwords the slowest. [10] [11] More errors were made when pronouncing the low frequency words than the high frequency words. [10]

Physical activities

Driving

Quick recognition of a word could also be important when reading road signs while driving. As a vehicle moves and passed road signs on the side of the road, there is only a short amount of time available to be able to read the road signs. The presence of higher frequency words on the road sign would allow for faster recognition and processing of road sign meaning, which could be critical in such a time sensitive situation.

Criticisms

Daniel Voyer proposed some criticism for the word frequency effect in 2003 after experiments on laterality effects in lexical decisions. [16] His experiments demonstrated two findings:

(1) Word frequency effect was only significant for the left visual field presentation
(2) In a case-altered condition, the word frequency effect meaningful for right visual field presentations.[ clarification needed ]

Voyer further posits that hemispheric asymmetries may play a role in the word frequency effect.

Future directions

Psycholinguists believe that future study of the word frequency effect needs to consider the role of heuristics to determine the difference in eye movements between high and low frequency words. [14]

See also

Related Research Articles

<span class="mw-page-title-main">Picture superiority effect</span> Psychological phenomenon

The picture superiority effect refers to the phenomenon in which pictures and images are more likely to be remembered than are words. This effect has been demonstrated in numerous experiments using different methods. It is based on the notion that "human memory is extremely sensitive to the symbolic modality of presentation of event information". Explanations for the picture superiority effect are not concrete and are still being debated, however an evolutionary explanation is that sight has a long history stretching back millions of years and was crucial to survival in the past, whereas reading is a relatively recent invention, and requires specific cognitive processes, such as decoding symbols and linking them to meaning.

In psychology, the emotional Stroop task is used as an information-processing approach to assessing emotions. Like the standard Stroop effect, the emotional Stroop test works by examining the response time of the participant to name colors of words presented to them. Unlike the traditional Stroop effect, the words presented either relate to specific emotional states or disorders, or they are neutral. For example, depressed participants will be slower to say the color of depressing words rather than non-depressing words. Non-clinical subjects have also been shown to name the color of an emotional word slower than naming the color of a neutral word. Negative words selected for the emotional Stroop task can be either preselected by researchers or taken from the lived experiences of participants completing the task. Typically, when asked to identify the color of the words presented to them, participants reaction times for negative emotional words is slower than the identification of the color of neutral words. While it has been shown that those in negative moods tend to take longer to respond when presented with negative word stimuli, this is not always the case when participants are presented with words that are positive or more neutral in tone.

The lexical decision task (LDT) is a procedure used in many psychology and psycholinguistics experiments. The basic procedure involves measuring how quickly people classify stimuli as words or nonwords.

Tip of the tongue is the phenomenon of failing to retrieve a word or term from memory, combined with partial recall and the feeling that retrieval is imminent. The phenomenon's name comes from the saying, "It's on the tip of my tongue." The tip of the tongue phenomenon reveals that lexical access occurs in stages.

The generation effect is a phenomenon whereby information is better remembered if it is generated from one's own mind rather than simply read. Researchers have struggled to account for why the generated information is better recalled than read information, but no single explanation has been sufficient to explain everything.

In cognitive psychology, the missing letter effect refers to the finding that, when people are asked to consciously detect target letters while reading text, they miss more letters in frequent function words than in less frequent, content words. Understanding how, why and where this effect arises becomes useful in explaining the range of cognitive processes that are associated with reading text. The missing letter effect has also been referred to as the reverse word superiority effect, since it describes a phenomenon where letters in more frequent words fail to be identified, instead of letter identification benefitting from increased word frequency.

Indirect memory tests assess the retention of information without direct reference to the source of information. Participants are given tasks designed to elicit knowledge that was acquired incidentally or unconsciously and is evident when performance shows greater inclination towards items initially presented than new items. Performance on indirect tests may reflect contributions of implicit memory, the effects of priming, a preference to respond to previously experienced stimuli over novel stimuli. Types of indirect memory tests include the implicit association test, the lexical decision task, the word stem completion task, artificial grammar learning, word fragment completion, and the serial reaction time task.

Priming is a concept in psychology to describe how exposure to one stimulus may influence a response to a subsequent stimulus, without conscious guidance or intention. The priming effect is the positive or negative effect of a rapidly presented stimulus on the processing of a second stimulus that appears shortly after. Generally speaking, the generation of priming effect depends on the existence of some positive or negative relationship between priming and target stimuli. For example, the word nurse might be recognized more quickly following the word doctor than following the word bread. Priming can be perceptual, associative, repetitive, positive, negative, affective, semantic, or conceptual. Priming effects involve word recognition, semantic processing, attention, unconscious processing, and many other issues, and are related to differences in various writing systems. Onset of priming effects can be almost instantaneous.

In psychology, implicit memory is one of the two main types of long-term human memory. It is acquired and used unconsciously, and can affect thoughts and behaviours. One of its most common forms is procedural memory, which allows people to perform certain tasks without conscious awareness of these previous experiences; for example, remembering how to tie one's shoes or ride a bicycle without consciously thinking about those activities.

The Deese–Roediger–McDermott (DRM) paradigm is a procedure in cognitive psychology used to study false memory in humans. The procedure was pioneered by James Deese in 1959, but it was not until Henry L. Roediger III and Kathleen McDermott extended the line of research in 1995 that the paradigm became popular. The procedure typically involves the oral presentation of a list of related words and then requires the subject to remember as many words from the list as possible. Typical results show that subjects recall a related but absent word, known as a 'lure', with the same frequency as other presented words. When asked about their experience after the test, about half of all participants report that they are sure that they remember hearing the lure, indicating a false memory – a memory for an event that never occurred.

The encoding specificity principle is the general principle that matching the encoding contexts of information at recall assists in the retrieval of episodic memories. It provides a framework for understanding how the conditions present while encoding information relate to memory and recall of that information.

Word recognition, according to Literacy Information and Communication System (LINCS) is "the ability of a reader to recognize written words correctly and virtually effortlessly". It is sometimes referred to as "isolated word recognition" because it involves a reader's ability to recognize words individually from a list without needing similar words for contextual help. LINCS continues to say that "rapid and effortless word recognition is the main component of fluent reading" and explains that these skills can be improved by "practic[ing] with flashcards, lists, and word grids".

The mental lexicon is a component of the human language faculty that contains information regarding the composition of words, such as their meanings, pronunciations, and syntactic characteristics. The mental lexicon is used in linguistics and psycholinguistics to refer to individual speakers' lexical, or word, representations. However, there is some disagreement as to the utility of the mental lexicon as a scientific construct.

Retrieval-induced forgetting (RIF) is a memory phenomenon where remembering causes forgetting of other information in memory. The phenomenon was first demonstrated in 1994, although the concept of RIF has been previously discussed in the context of retrieval inhibition.

The dual-route theory of reading aloud was first described in the early 1970s. This theory suggests that two separate mental mechanisms, or cognitive routes, are involved in reading aloud, with output of both mechanisms contributing to the pronunciation of a written stimulus.

Bilingual lexical access is an area of psycholinguistics that studies the activation or retrieval process of the mental lexicon for bilingual people.

<span class="mw-page-title-main">Parafovea</span>

Parafovea or the parafoveal belt is a region in the retina that circumscribes the fovea and is part of the macula lutea. It is circumscribed by the perifovea.

Charles Perfetti is the director of, and Senior Scientist for, the Learning and Research Development Center at the University of Pittsburgh. His research is centered on the cognitive science of language and reading processes, including but not limited to lower- and higher-level lexical and syntactic processes and the nature of reading proficiency. He conducts cognitive behavioral studies involving ERP, fMRI and MEG imaging techniques. His goal is to develop a richer understanding of how language is processed in the brain.

In psychology, the transposed letter effect is a test of how a word is processed when two letters within the word are switched.

Age of acquisition is a psycholinguistic variable referring to the age at which a word is typically learned. For example, the word 'penguin' is typically learned at a younger age than the word 'albatross'. Studies in psycholinguistics suggest that age of acquisition has an effect on the speed of reading words. The findings have demonstrated that early-acquired words are processed more quickly than later-acquired words. It is a particularly strong variable in predicting the speed of picture naming. It has been generally found that words that are more frequent, shorter, more familiar and refer to concrete concepts are learned earlier than the counterparts in simple words and compound words, together with their respective lexemes. In addition, the AoA effect has been demonstrated in several languages and in bilingual speakers.

References

  1. Daniel Smilek; Scott Sinnett; Alan Kingstone. "Cognition". Oxford University Press Canada. Retrieved 7 May 2014.
  2. 1 2 3 4 Whitford, Veronica (2017). "The effects of word frequency and word predictability during first- and second-language paragraph reading in bilingual older and younger adults". Psychology and Aging. 32 (2): 158–177. doi:10.1037/pag0000151. PMID   28287786. S2CID   25819451.
  3. 1 2 3 4 5 6 Li, Meng-Feng; Gao, Xin-Yu; Chou, Tai-Li; Wu, Jei-Tun (2017-02-01). "Neighborhood Frequency Effect in Chinese Word Recognition: Evidence from Naming and Lexical Decision". Journal of Psycholinguistic Research. 46 (1): 227–245. doi:10.1007/s10936-016-9431-5. ISSN   0090-6905. PMID   27119658. S2CID   24411450.
  4. "Word Frequency Effect". Oxford University Press. Retrieved 21 October 2014.
  5. 1 2 3 Bonin, Patrick; Laroche, Betty; Perret, Cyril (2016). "Locus of word frequency effects in spelling to dictation: Still at the orthographic level!". Journal of Experimental Psychology: Learning, Memory, and Cognition. 42 (11): 1814–1820. doi:10.1037/xlm0000278. PMID   27088496.
  6. 1 2 Besner, Derek; Risko, Evan F. (2016). "Thinking outside the box when reading aloud: Between (localist) module connection strength as a source of word frequency effects". Psychological Review. 123 (5): 592–599. doi:10.1037/rev0000041. PMID   27657439.
  7. Harris, Jonathan. "Wordcount" . Retrieved 4 November 2014.
  8. 1 2 3 4 Kretzschmar, Franziska; Schlesewsky, Matthias; Staub, Adrian (2015). "Dissociating word frequency and predictability effects in reading: Evidence from coregistration of eye movements and EEG" (PDF). Journal of Experimental Psychology: Learning, Memory, and Cognition. 41 (6): 1648–1662. doi:10.1037/xlm0000128. PMID   26010829.
  9. 1 2 White, Sarah J.; Warrington, Kayleigh L.; McGowan, Victoria A.; Paterson, Kevin B. (2015). "Eye movements during reading and topic scanning: Effects of word frequency" (PDF). Journal of Experimental Psychology: Human Perception and Performance. 41 (1): 233–248. doi:10.1037/xhp0000020. hdl: 2381/31659 . PMID   25528014.
  10. 1 2 3 4 5 O'Malley, Shannon; Besner, Derek (2008). "Reading aloud: Qualitative differences in the relation between stimulus quality and word frequency as a function of context" (PDF). Journal of Experimental Psychology: Learning, Memory, and Cognition. 34 (6): 1400–1411. doi:10.1037/a0013084. hdl: 10012/3853 . PMID   18980404.
  11. 1 2 White, Darcy; Besner, Derek (2017). "Reading aloud: On the determinants of the joint effects of stimulus quality and word frequency". Journal of Experimental Psychology: Learning, Memory, and Cognition. 43 (5): 749–756. doi:10.1037/xlm0000344. PMID   27936847. S2CID   3560733.
  12. Elsherif, M. M.; Preece, E.; Catling, J. C. (2023-01-09). "Age-of-acquisition effects: A literature review". Journal of Experimental Psychology: Learning, Memory, and Cognition. 49 (5): 812–847. doi: 10.1037/xlm0001215 . ISSN   1939-1285. PMID   36622701. S2CID   255545091.
  13. 1 2 Lau, Mabel C; Goh, Winston D; Yap, Melvin J (October 2018). "An item-level analysis of lexical-semantic effects in free recall and recognition memory using the megastudy approach". Quarterly Journal of Experimental Psychology. 71 (10): 2207–2222. doi:10.1177/1747021817739834. ISSN   1747-0218. PMID   30226433. S2CID   52292000.
  14. 1 2 3 4 Liu, Yanping; Reichle, Erik D.; Li, Xingshan (2016). "The effect of word frequency and parafoveal preview on saccade length during the reading of Chinese". Journal of Experimental Psychology: Human Perception and Performance. 42 (7): 1008–1025. doi:10.1037/xhp0000190. PMC   4925191 . PMID   27045319.
  15. Angele, Bernhard; Laishley, Abby E.; Rayner, Keith; Liversedge, Simon P. (2014). "The effect of high- and low-frequency previews and sentential fit on word skipping during reading". Journal of Experimental Psychology: Learning, Memory, and Cognition. 40 (4): 1181–1203. doi:10.1037/a0036396. PMC   4100595 . PMID   24707791.
  16. Voyer, Daniel (2003). "Word frequency and laterality effects in lexical decision: Right hemisphere mechanisms". Brain and Language. 87 (3): 421–431. doi:10.1016/s0093-934x(03)00143-3. PMID   14642544. S2CID   38332364.