IQ classification

Last updated

Score distribution chart for sample of 905 children tested on 1916 Stanford-Binet Test Terman1916Fig2IQDistribution.png
Score distribution chart for sample of 905 children tested on 1916 Stanford–Binet Test

IQ classification is the practice of categorizing human intelligence, as measured by intelligence quotient (IQ) tests, into categories such as "superior" or "average". [1] [2] [3] [4]

Contents

In the current IQ scoring method, an IQ score of 100 means that the test-taker's performance on the test is of average performance in the sample of test-takers of about the same age as was used to norm the test. An IQ score of 115 means performance one standard deviation above the mean, while a score of 85 means performance one standard deviation below the mean, and so on. [5] This "deviation IQ" method is now used for standard scoring of all IQ tests in large part because they allow a consistent definition of IQ for both children and adults. By the current "deviation IQ" definition of IQ test standard scores, about two-thirds of all test-takers obtain scores from 85 to 115, and about 5 percent of the population scores above 125 (i.e. normal distribution). [6]

When IQ testing was first created, Lewis Terman and other early developers of IQ tests noticed that most child IQ scores come out to approximately the same number regardless of testing procedure. Variability in scores can occur when the same individual takes the same test more than once. [7] [8] Further, a minor divergence in scores can be observed when an individual takes tests provided by different publishers at the same age. [9] There is no standard naming or definition scheme employed universally by all test publishers for IQ score classifications.

Even before IQ tests were invented, there were attempts to classify people into intelligence categories by observing their behavior in daily life. [10] [11] Those other forms of behavioral observation were historically important for validating classifications based primarily on IQ test scores. Some early intelligence classifications by IQ testing depended on the definition of "intelligence" used in a particular case. Current IQ test publishers take into account reliability and error of estimation in the classification procedure.

Differences in individual IQ classification

IQ scores can differ to some degree for the same person on different IQ tests, so a person does not always belong to the same IQ score range each time the person is tested (IQ score table data and pupil pseudonyms adapted from description of KABC-II norming study cited in Kaufman 2009). [12] [13]
PupilKABC-IIWISC-IIIWJ-III
Asher9095111
Brianna125110105
Colin10093101
Danica116127118
Elpha9310593
Fritz106105105
Georgi9510090
Hector112113103
Imelda1049697
Jose1019986
Keoku817875
Leo116124102

IQ tests generally are reliable enough that most people 10 years of age and older have similar IQ scores throughout life. [14] Both the WAIS-IV and the Stanford-Binet Intelligence Scale, for example, have a reliability of 0.970.98 for IQ across all age groups. [15]

IQ test publishers use large and "representative samples, use items that measure their intended constructs well, and produce unbiased scores. Thus, these instruments tend to provide scores that reliably and validly measure the constructs they intend to measure." [16] Still, some individuals score very differently when taking the same test at different times or when taking more than one kind of IQ test at the same age. [17] About 42% of children change their score by 5 or more points when re-tested. [18]

For example, many children in the famous longitudinal Genetic Studies of Genius begun in 1921 by Lewis Terman showed declines in IQ as they grew up. Terman recruited school pupils based on referrals from teachers, and gave them his Stanford–Binet IQ test. Children with an IQ above 140 by that test were included in the study. There were 643 children in the main study group. When the students who could be contacted again (503 students) were retested at high school age, they were found to have dropped 9 IQ points on average in Stanford–Binet IQ. Some children dropped by 15 IQ points and or by 25 points or more. Yet parents of those children thought that the children were still as bright as ever, or even brighter. [19]

Modern tests, however, have subsequently improved in reliability. The WAIS-IV test-retest correlation is .96. [20]

Because all IQ tests have error of measurement in the test-taker's IQ score, a test-giver should always inform the test-taker of the confidence interval around the score obtained on a given occasion of taking each test. [21] IQ scores are ordinal scores and are not expressed in an interval measurement unit. [22] [23] [24] [25] [26] Besides the reported error interval around IQ test scores, an IQ score could be misleading if a test-giver failed to follow standardized administration and scoring procedures. In cases of test-giver mistakes, the usual result is that tests are scored too leniently, giving the test-taker a higher IQ score than the test-taker's performance justifies. On the other hand, some test-givers err by showing a "halo effect", with low-IQ individuals receiving IQ scores even lower than if standardized procedures were followed, while high-IQ individuals receive inflated IQ scores. [27]

The categories of IQ vary between IQ test publishers as the category labels for IQ score ranges are specific to each brand of test. The test publishers do not have a uniform practice of labeling IQ score ranges, nor do they have a consistent practice of dividing up IQ score ranges into categories of the same size or with the same boundary scores. [28] Thus psychologists should specify which test was given when reporting a test-taker's IQ category if not reporting the raw IQ score. [29] Psychologists and IQ test authors recommend that psychologists adopt the terminology of each test publisher when reporting IQ score ranges. [30] [31]

Although intelligence is important in modern life as it predicts success in many areas, [32] IQ classifications from IQ testing are not the last word on how a test-taker will do in life, nor are they the only information to be considered for placement in school or job-training programs. There is still a dearth of information about how behavior differs between people with differing IQ scores. [33] For placement in school programs, for medical diagnosis, and for career advising, factors other than IQ can be part of an individual assessment as well.

The lesson here is that classification systems are necessarily arbitrary and change at the whim of test authors, government bodies, or professional organizations. They are statistical concepts and do not correspond in any real sense to the specific capabilities of any particular person with a given IQ. The classification systems provide descriptive labels that may be useful for communication purposes in a case report or conference, and nothing more. [34]

Alan S. Kaufman and Elizabeth O. Lichtenberger, Assessing Adolescent and Adult Intelligence (2006)

IQ classification tables for current tests

There are a variety of individually administered IQ tests in use. [35] [36] Not all report test results as "IQ", but most now report a standard score with a mean score level of 100. When a test-taker scores higher or lower than the median score, the score is indicated as 15 standard score points higher or lower for each standard deviation difference higher or lower in the test-taker's performance on the test item content.

Wechsler Intelligence Scales

The Wechsler intelligence scales were originally developed from earlier intelligence scales by David Wechsler. David Wechsler, using the clinical and statistical skills he gained under Charles Spearman and as a World War I psychology examiner, crafted a series of intelligence tests. These eventually surpassed other such measures, becoming the most widely used and popular intelligence assessment tools for many years. The first Wechsler test published was the Wechsler–Bellevue Scale in 1939. [37] The Wechsler IQ tests for children and for adults are the most frequently used individual IQ tests in the English-speaking world [38] and in their translated versions are perhaps the most widely used IQ tests worldwide. [39] The Wechsler tests have long been regarded as the "gold standard" in IQ testing. [40] The Wechsler Adult Intelligence Scale—Fourth Edition (WAIS–IV) was published in 2008 by The Psychological Corporation. [35] The Wechsler Intelligence Scale for Children—Fifth Edition (WISC–V) was published in 2014 by The Psychological Corporation, and the Wechsler Preschool and Primary Scale of Intelligence—Fourth Edition (WPPSI–IV) was published in 2012 by The Psychological Corporation. Like all current IQ tests, the Wechsler tests report a "deviation IQ" as the standard score for the full-scale IQ, with the norming sample mean raw score defined as IQ 100 and a score one standard deviation higher defined as IQ 115 (and one deviation lower defined as IQ 85).

During the First World War in 1917, adult intelligence testing gained prominence as an instrument for assessing drafted soldiers in the United States. Robert Yerkes, an American psychologist, was assigned to devise psychometric tools to allocate recruits to different levels of military service, leading to the development of the Army Alpha and Army Beta group-based tests. The collective efforts of Binet, Simon, Terman, and Yerkes laid the groundwork for modern intelligence test series. [15]

Current Wechsler (WAIS–IV, WPPSI–IV) IQ classification
IQ Range ("deviation IQ")IQ Classification [41] [42]
130 and aboveVery Superior
120–129Superior
110–119High Average
90–109Average
80–89Low Average
70–79Borderline
69 and belowExtremely Low
Wechsler Intelligence Scale for Children–Fifth Edition (WISC-V) IQ classification
IQ Range ("deviation IQ")IQ Classification [43]
130 and aboveExtremely High
120–129Very High
110–119High Average
90–109Average
80–89Low Average
70–79Very Low
69 and belowExtremely Low

Psychologists have proposed alternative language for Wechsler IQ classifications. [44] [45] The term "borderline", which implies being very close to being intellectually disabled (defined as IQ under 70), is replaced in the alternative system by a term that doesn't imply a medical diagnosis.

Alternate Wechsler IQ Classifications (after Groth-Marnat 2009) [46]
Corresponding IQ RangeClassificationsMore value-neutral terms
130+Very superiorUpper extreme
120–129SuperiorWell above average
110–119High averageHigh average
90–109AverageAverage
80–89Low averageLow average
70–79BorderlineWell below average
69 and belowExtremely lowLower extreme

Stanford–Binet Intelligence Scale Fifth Edition

The current fifth edition of the Stanford–Binet scales (SB5) was developed by Gale H. Roid and published in 2003 by Riverside Publishing. [35] Unlike scoring on previous versions of the Stanford–Binet test, SB5 IQ scoring is deviation scoring in which each standard deviation up or down from the norming sample median score is 15 points from the median score, IQ 100, just like the standard scoring on the Wechsler tests. The standardized SB5 was established after five years of development and analysis to "address possible biases, such as racial/ethnic, gender, cultural, and religious discriminations. Roughly 500 examiners from all 50 states were trained in administering this test. There were 4,800 subjects in the average sampling group that ranged from 2 to 85+ years of age. Based on the year 2000 U.S. Census data, the sample was nationally representative of all demographic factors, including age, geographic region, race/ethnicity, and socio‐economic level." [47]

Stanford–Binet Fifth Edition (SB5) classification [42] [48]
IQ Range ("deviation IQ")IQ Classification
140+Very gifted or highly advanced
130–140Gifted or very advanced
120–129Superior
110–119High average
90–109Average
80–89Low average
70–79Borderline impaired or delayed
55–69Mildly impaired or delayed
40–54Moderately impaired or delayed

Woodcock–Johnson Test of Cognitive Abilities

The Woodcock–Johnson a III NU Tests of Cognitive Abilities (WJ III NU) was developed by Richard W. Woodcock, Kevin S. McGrew and Nancy Mather and published in 2007 by Riverside. [35] The WJ III classification terms are not applied.

Woodcock–Johnson R
IQ ScoreWJ III Classification [49]
131 and aboveVery superior
121 to 130Superior
111 to 120High Average
90 to 110Average
80 to 89Low Average
70 to 79Low
69 and belowVery Low

Kaufman Tests

The Kaufman Adolescent and Adult Intelligence Test was developed by Alan S. Kaufman and Nadeen L. Kaufman and published in 1993 by American Guidance Service. [35] Kaufman test scores "are classified in a symmetrical, nonevaluative fashion", [50] in other words the score ranges for classification are just as wide above the mean as below the mean, and the classification labels do not purport to assess individuals.

KAIT 1993 IQ classification
130 and aboveUpper Extreme
120–129Well Above Average
110–119Above average
90–109Average
80–89Below Average
70–79Well Below Average
69 and belowLower Extreme

The Kaufman Assessment Battery for Children, Second Edition was developed by Alan S. Kaufman and Nadeen L. Kaufman and published in 2004 by American Guidance Service. [35]

KABC-II 2004 Descriptive Categories [51] [52]
Range of Standard ScoresName of Category
131–160Upper Extreme
116–130Above Average
85–115Average Range
70–84Below Average
40–69Lower Extreme

Cognitive Assessment System

The Das-Naglieri Cognitive Assessment System test was developed by Jack Naglieri and J. P. Das and published in 1997 by Riverside. [35]

Cognitive Assessment System 1997 full scale score classification [53]
Standard ScoresClassification
130 and aboveVery Superior
120–129Superior
110–119High Average
90–109Average
80–89Low Average
70–79Below Average
69 and belowWell Below Average

Differential Ability Scales

The Differential Ability Scales Second Edition (DAS–II) was developed by Colin D. Elliott and published in 2007 by Psychological Corporation. [35] The DAS-II is a test battery given individually to children, normed for children from ages two years and six months through seventeen years and eleven months. [54] It was normed on 3,480 noninstitutionalized, English-speaking children in that age range. [55] The DAS-II yields a General Conceptual Ability (GCA) score scaled like an IQ score with the mean standard score set at 100 and 15 standard score points for each standard deviation up or down from the mean. The lowest possible GCA score on DAS–II is 30, and the highest is 170. [56]

DAS-II 2007 GCA classification [42] [57]
GCAGeneral Conceptual Ability Classification
≥ 130Very high
120–129High
110–119Above average
90–109Average
80–89Below average
70–79Low
≤ 69Very low

Reynolds Intellectual Ability Scales

Reynolds Intellectual Ability Scales (RIAS) were developed by Cecil Reynolds and Randy Kamphaus. The RIAS was published in 2003 by Psychological Assessment Resources. [35]

RIAS 2003 Scheme of Verbal Descriptors of Intelligence Test Performance [58]
Intelligence test score rangeVerbal descriptor
≥ 130Significantly above average
120–129Moderately above average
110–119Above average
90–109Average
80–89Below average
70–79Moderately below average
≤ 69Significantly below average

Historical IQ classification tables

Reproduction of an item from the 1908 Binet-Simon intelligence scale, showing three pairs of pictures, about which the tested child was asked, "Which of these two faces is the prettier?" Reproduced from the article "A Practical Guide for Administering the Binet-Simon Scale for Measuring Intelligence" by J. E. Wallace Wallin in the March 1911 issue of the journal The Psychological Clinic (volume 5 number 1), public domain. Simon-Binet Ugly Face Item from 1911 journal.png
Reproduction of an item from the 1908 Binet–Simon intelligence scale, showing three pairs of pictures, about which the tested child was asked, "Which of these two faces is the prettier?" Reproduced from the article "A Practical Guide for Administering the Binet–Simon Scale for Measuring Intelligence" by J. E. Wallace Wallin in the March 1911 issue of the journal The Psychological Clinic (volume 5 number 1), public domain.

Lewis Terman, developer of the Stanford–Binet Intelligence Scales, based his English-language Stanford–Binet IQ test on the French-language Binet–Simon test developed by Alfred Binet. Terman believed his test measured the "general intelligence" construct advocated by Charles Spearman (1904). [59] [60] Terman differed from Binet in reporting scores on his test in the form of intelligence quotient ("mental age" divided by chronological age) scores after the 1912 suggestion of German psychologist William Stern. Terman chose the category names for score levels on the Stanford–Binet test. When he first chose classification for score levels, he relied partly on the usage of earlier authors who wrote, before the existence of IQ tests, on topics such as individuals unable to care for themselves in independent adult life. Terman's first version of the Stanford–Binet was based on norming samples that included only white, American-born subjects, mostly from California, Nevada, and Oregon. [61]

Terman's Stanford–Binet original (1916) classification [62] [63]
IQ Range ("ratio IQ")IQ Classification
Above 140"Near" genius or genius
120–140Very superior intelligence
110–120Superior intelligence
90–110Normal, or average, intelligence
80–90Dullness, rarely classifiable as feeble-mindedness
70–80Border-line deficiency, sometimes classifiable as dullness, often as feeble-mindedness
Below 70Definite feeble-mindedness

Rudolph Pintner proposed a set of classification terms in his 1923 book Intelligence Testing: Methods and Results. [4] Pintner commented that psychologists of his era, including Terman, went about "the measurement of an individual's general ability without waiting for an adequate psychological definition." [64] Pintner retained these terms in the 1931 second edition of his book. [65]

Pintner 1923 IQ classification [4]
IQ Range ("ratio IQ")IQ Classification
130 and aboveVery Superior
120–129Very Bright
110–119Bright
90–109Normal
80–89Backward
70–79Borderline

Albert Julius Levine and Louis Marks proposed a broader set of categories in their 1928 book Testing Intelligence and Achievement. [66] [67] Some of the entries came from contemporary terms for people with intellectual disability.

Levine and Marks 1928 IQ classification [66] [67]
IQ Range ("ratio IQ")IQ Classification
175 and overPrecocious
150–174Very superior
125–149Superior
115–124Very bright
105–114Bright
95–104Average
85–94Dull
75–84Borderline
50–74Morons
25–49Imbeciles
0–24Idiots

The second revision (1937) of the Stanford–Binet test retained "quotient IQ" scoring, despite earlier criticism of that method of reporting IQ test standard scores. [68] The term "genius" was no longer used for any IQ score range. [69] The second revision was normed only on children and adolescents (no adults), and only "American-born white children". [70]

Terman's Stanford–Binet Second Revision (1937) classification [69]
IQ Range ("ratio IQ")IQ Classification
140 and overVery superior
120–139Superior
110–119High average
90–109Normal or average
80–89Low average
70–79Borderline defective
Below 70Mentally defective

A data table published later as part of the manual for the 1960 Third Revision (Form L-M) of the Stanford–Binet test reported score distributions from the 1937 second revision standardization group.

Score Distribution of Stanford–Binet 1937 Standardization Group [69]
IQ Range ("ratio IQ")Percent of Group
160–1690.03
150–1590.2
140–1491.1
130–1393.1
120–1298.2
110–11918.1
100–10923.5
90–9923.0
80–8914.5
70–795.6
60–692.0
50–590.4
40–490.2
30–390.03

David Wechsler, developer of the Wechsler–Bellevue Scale of 1939 (which was later developed into the Wechsler Adult Intelligence Scale) popularized the use of "deviation IQs" as standard scores of IQ tests rather than the "quotient IQs" ("mental age" divided by "chronological age") then used for the Stanford–Binet test. [71] He devoted a whole chapter in his book The Measurement of Adult Intelligence to the topic of IQ classification and proposed different category names from those used by Lewis Terman. Wechsler also criticized the practice of earlier authors who published IQ classification tables without specifying which IQ test was used to obtain the scores reported in the tables. [72]

Wechsler–Bellevue 1939 IQ classification
IQ Range ("deviation IQ")IQ ClassificationPercent Included
128 and overVery Superior2.2
120–127Superior6.7
111–119Bright Normal16.1
91–110Average50.0
80–90Dull normal16.1
66–79Borderline6.7
65 and belowDefective2.2

In 1958, Wechsler published another edition of his book Measurement and Appraisal of Adult Intelligence. He revised his chapter on the topic of IQ classification and commented that "mental age" scores were not a more valid way to score intelligence tests than IQ scores. [73] He continued to use the same classification terms.

Wechsler Adult Intelligence Scales 1958 Classification [74]
IQ Range ("deviation IQ")IQ Classification(Theoretical) Percent Included
128 and overVery Superior2.2
120–127Superior6.7
111–119Bright Normal16.1
91–110Average50.0
80–90Dull normal16.1
66–79Borderline6.7
65 and belowDefective2.2

The third revision (Form L-M) in 1960 of the Stanford–Binet IQ test used the deviation scoring pioneered by David Wechsler. For rough comparability of scores between the second and third revision of the Stanford–Binet test, scoring table author Samuel Pinneau set 100 for the median standard score level and 16 standard score points for each standard deviation above or below that level. The highest score obtainable by direct look-up from the standard scoring tables (based on norms from the 1930s) was IQ 171 at various chronological ages from three years six months (with a test raw score "mental age" of six years and two months) up to age six years and three months (with a test raw score "mental age" of ten years and three months). [75] The classification for Stanford–Binet L-M scores does not include terms such as "exceptionally gifted" and "profoundly gifted" in the test manual itself. David Freides, reviewing the Stanford–Binet Third Revision in 1970 for the Buros Seventh Mental Measurements Yearbook (published in 1972), commented that the test was obsolete by that year. [76]

Terman's Stanford–Binet Third Revision (Form L-M) classification [48]
IQ Range ("deviation IQ")IQ Classification
140 and overVery superior
120–139Superior
110–119High average
90–109Normal or average
80–89Low average
70–79Borderline defective
Below 70Mentally defective

The first edition of the Woodcock–Johnson Tests of Cognitive Abilities was published by Riverside in 1977. The classifications used by the WJ-R Cog were "modern in that they describe levels of performance as opposed to offering a diagnosis." [49]

Woodcock–Johnson R
IQ ScoreWJ-R Cog 1977 Classification [49]
131 and aboveVery superior
121 to 130Superior
111 to 120High Average
90 to 110Average
80 to 89Low Average
70 to 79Low
69 and belowVery Low

The revised version of the Wechsler Adult Intelligence Scale (the WAIS-R) was developed by David Wechsler and published by Psychological Corporation in 1981. Wechsler changed a few of the boundaries for classification categories and a few of their names compared to the 1958 version of the test. The test's manual included information about how the actual percentage of people in the norming sample scoring at various levels compared to theoretical expectations.

Wechsler Adult Intelligence Scales 1981 Classification [77]
IQ Range ("deviation IQ")IQ ClassificationActual Percent IncludedTheoretical Percent Included
130+Very Superior2.62.2
120–129Superior6.96.7
110–119High Average16.616.1
90–109Average49.150.0
80–89Low Average16.116.1
70–79Borderline6.46.7
below 70Mentally Retarded2.32.2

The Kaufman Assessment Battery for Children (K-ABC) was developed by Alan S. Kaufman and Nadeen L. Kaufman and published in 1983 by American Guidance Service.

K-ABC 1983 Ability Classifications [77]
Range of Standard ScoresName of CategoryPercent of Norm SampleTheoretical Percent Included
130+Upper Extreme2.32.2
120–129Well Above Average7.46.7
110–119Above Average16.716.1
90–109Average49.550.0
80–89Below Average16.116.1
70–79Well Below Average6.16.7
below 70Lower Extreme2.12.2

The fourth revision of the Stanford–Binet scales (S-B IV) was developed by Thorndike, Hagen, and Sattler and published by Riverside Publishing in 1986. It retained the deviation scoring of the third revision with each standard deviation from the mean being defined as a 16 IQ point difference. The S-B IV adopted new classification terminology. After this test was published, psychologist Nathan Brody lamented that IQ tests had still not caught up with advances in research on human intelligence during the twentieth century. [78]

Stanford–Binet Intelligence Scale, Fourth Edition (S-B IV) 1986 classification [79] [80]
IQ Range ("deviation IQ")IQ Classification
132 and aboveVery superior
121–131Superior
111–120High average
89–110Average
79–88Low average
68–78Slow learner
67 or belowMentally retarded

The third edition of the Wechsler Adult Intelligence Scale (WAIS-III) used different classification terminology from the earliest versions of Wechsler tests.

Wechsler (WAIS–III) 1997 IQ test classification
IQ Range ("deviation IQ")IQ Classification
130 and aboveVery superior
120–129Superior
110–119High average
90–109Average
80–89Low average
70–79Borderline
69 and belowExtremely low

Classification of low IQ

The earliest terms for classifying individuals of low intelligence were medical or legal terms that preceded the development of IQ testing. [10] [11] The legal system recognized a concept of some individuals being so cognitively impaired that they were not responsible for criminal behavior. Medical doctors sometimes encountered adult patients who could not live independently, being unable to take care of their own daily living needs. Various terms were used to attempt to classify individuals with varying degrees of intellectual disability. Many of the earliest terms are now considered extremely offensive.

In current medical diagnosis, IQ scores alone are not conclusive for a finding of intellectual disability. Recently adopted diagnostic standards place the major emphasis on the adaptive behavior of each individual, with IQ score a factor in diagnosis in addition to adaptive behavior scales. Some advocate for no category of intellectual disability to be defined primarily by IQ scores. [81] Psychologists point out that evidence from IQ testing should always be used with other assessment evidence in mind: "In the end, any and all interpretations of test performance gain diagnostic meaning when they are corroborated by other data sources and when they are empirically or logically related to the area or areas of difficulty specified in the referral." [82]

In the United States, the Supreme Court ruled in the case Atkins v. Virginia , 536 U.S. 304 (2002) that states could not impose capital punishment on people with "mental retardation", defined in subsequent cases as people with IQ scores below 70.[ citation needed ] This legal standard continues to be actively litigated in capital cases. [83]

Historical

Historically, terms for intellectual disability eventually became perceived as an insult, in a process commonly known as the euphemism treadmill. [84] [85] [86] The terms mental retardation and mentally retarded became popular in the middle of the 20th century to replace the previous set of terms, which included "imbecile", "idiot", "feeble-minded", and "moron", [87] among others. By the end of the 20th century, retardation and retard became widely seen as disparaging and politically incorrect, although they are still used in some clinical contexts. [88]

The American Association for the Study of the Feeble-minded divided adults with intellectual deficits into three categories. Idiot indicated the greatest degree of intellectual disability in which a person's mental age is below three years. Imbecile indicated an intellectual disability less severe than idiocy and a mental age between three and seven years. Moron was defined as someone a mental age between eight and twelve. [89] Alternative definitions of these terms based on IQ were also used.[ citation needed ]

The term cretin dates to 1770–80 and comes from a dialectal French word for Christian. [90] The implication was that people with significant intellectual or developmental disabilities were "still human" (or "still Christian") and deserved to be treated with basic human dignity. Although cretin is no longer in use, the term cretinism is still used to refer to the mental and physical disability resulting from untreated congenital hypothyroidism. [90]

Mongolism and Mongoloid idiot were terms used to identify someone with Down syndrome, as the doctor who first described the syndrome, John Langdon Down, believed that children with Down syndrome shared facial similarities with the now-obsolete category of "Mongolian race". The Mongolian People's Republic requested that the medical community cease the use of the term; in 1960, the World Health Organization agreed the term should cease being used. [91]

Retarded comes from the Latin retardare, 'to make slow, delay, keep back, or hinder', so mental retardation meant the same as mentally delayed. The first record of retarded in relation to being mentally slow was in 1895. The term mentally retarded was used to replace terms like idiot, moron, and imbecile because retarded was not then a derogatory term. By the 1960s, however, the term had taken on a partially derogatory meaning. The noun retard is particularly seen as pejorative; a BBC survey in 2003 ranked it as the most offensive disability-related word. [92] The terms mentally retarded and mental retardation are still fairly common, but organizations such as the Special Olympics and Best Buddies are striving to eliminate their use and often refer to retard and its variants as the "r-word". These efforts resulted in U.S. federal legislation, known as Rosa's Law, which replaced the term mentally retarded with the term intellectual disability in federal law. [93] [94]

Classification of high IQ

Genius

Galton in his later years Francis Galton2.jpg
Galton in his later years

Francis Galton (1822–1911) was a pioneer in investigating both eminent human achievement and mental testing. In his book Hereditary Genius, written before the development of IQ testing, he proposed that hereditary influences on eminent achievement are strong, and that eminence is rare in the general population. Lewis Terman chose "'near' genius or genius" as the classification label for the highest classification on his 1916 version of the Stanford–Binet test. [62] By 1926, Terman began publishing about a longitudinal study of California schoolchildren who were referred for IQ testing by their schoolteachers, called Genetic Studies of Genius, which he conducted for the rest of his life. Catherine M. Cox, a colleague of Terman's, wrote a whole book, The Early Mental Traits of 300 Geniuses, published as volume 2 of The Genetic Studies of Genius book series, in which she analyzed biographical data about historic geniuses. Although her estimates of childhood IQ scores of historical figures who never took IQ tests have been criticized on methodological grounds, [95] [96] [97] Cox's study was thorough in finding out what else matters besides IQ in becoming a genius. [98] By the 1937 second revision of the Stanford–Binet test, Terman no longer used the term "genius" as an IQ classification, nor has any subsequent IQ test. [69] [99] In 1939, Wechsler wrote "we are rather hesitant about calling a person a genius on the basis of a single intelligence test score." [100]

The Terman longitudinal study in California eventually provided historical evidence on how genius is related to IQ scores. [101] Many California pupils were recommended for the study by schoolteachers. Two pupils who were tested but rejected for inclusion in the study because of IQ scores too low for the study grew up to be Nobel Prize winners in physics: William Shockley [102] [103] and Luis Walter Alvarez. [104] [105] Based on the historical findings of the Terman study and on biographical examples such as Richard Feynman, who had an IQ of 125 and went on to win the Nobel Prize in physics and become widely known as a genius, [106] [107] the current view of psychologists and other scholars of genius is that a minimum IQ, about 125, is strictly necessary for genius, but that IQ is sufficient for the development of genius only when combined with the other influences identified by Cox's biographical study: an opportunity for talent development along with the characteristics of drive and persistence. Charles Spearman, bearing in mind the influential theory that he originated—that intelligence comprises both a "general factor" and "special factors" more specific to particular mental tasks—wrote in 1927, "Every normal man, woman, and child is, then, a genius at something, as well as an idiot at something." [108]

Giftedness

A major point of consensus among all scholars of intellectual giftedness is that there is no generally agreed upon definition of giftedness. [109] Although there is no scholarly agreement about identifying gifted learners, there is a de facto reliance on IQ scores for identifying participants in school gifted education programs. In practice, many school districts in the United States use an IQ score of 130, including roughly the upper 2 to 3 percent of the national population as a cut-off score for inclusion in school gifted programs. [110]

Five levels of giftedness have been suggested to differentiate the vast difference in abilities that exists between children on varying ends of the gifted spectrum. [111] Although there is no strong consensus on the validity of these quantifiers, they are accepted by many experts of gifted children.

Levels of Giftedness (M.U. Gross) [111]
ClassificationIQ RangeσPrevalence
Mildly gifted115–129+1.00–+1.991:6–1:44
Moderately gifted130–144+2.00–+2.991:44–1:1,000
Highly gifted145–159+3.00–+3.991:1,000–1:10,000
Exceptionally gifted160–179+4.00–+5.331:10,000–1:1,000,000
Profoundly gifted180–+5.33–< 1:1,000,000

As long ago as 1937, Lewis Terman pointed out that error of estimation in IQ scoring increases as IQ score increases, so that there is less and less certainty about assigning a test-taker to one band of scores or another as one looks at higher bands. [112] Current IQ tests also have large error bands for high IQ scores. [113] As an underlying reality, such distinctions as those between "exceptionally gifted" and "profoundly gifted" have never been well established. All longitudinal studies of IQ have shown that test-takers can bounce up and down in score, and thus switch up and down in rank order as compared to one another, over the course of childhood. IQ classification categories such as "profoundly gifted" are those are based on the obsolete Stanford–Binet Third Revision (Form L-M) test. [114] The highest reported standard score for most IQ tests is IQ 160, approximately the 99.997th percentile. [115] IQ scores above this level have wider error ranges as there are fewer normative cases at this level of intelligence. [116] [117] Moreover, there has never been any validation of the Stanford–Binet L-M on adult populations, and there is no trace of such terminology in the writings of Lewis Terman. Although two current tests attempt to provide "extended norms" that allow for classification of different levels of giftedness, those norms are not based on well validated data. [118]

See also

Related Research Articles

Genius is a characteristic of original and exceptional insight in the performance of some art or endeavor that surpasses expectations, sets new standards for the future, establishes better methods of operation, or remains outside the capabilities of competitors. Genius is associated with intellectual ability and creative productivity. The term genius can also be used to refer to people characterised by genius, and/or to polymaths who excel across many subjects.

<span class="mw-page-title-main">Intelligence quotient</span> Score from a test designed to assess intelligence

An intelligence quotient (IQ) is a total score derived from a set of standardised tests or subtests designed to assess human intelligence. The abbreviation "IQ" was coined by the psychologist William Stern for the German term Intelligenzquotient, his term for a scoring method for intelligence tests at University of Breslau he advocated in a 1912 book.

The Stanford–Binet Intelligence Scales is an individually administered intelligence test that was revised from the original Binet–Simon Scale by Alfred Binet and Théodore Simon. It is in its fifth edition (SB5), which was released in 2003.

Intellectual giftedness is an intellectual ability significantly higher than average. It is a characteristic of children, variously defined, that motivates differences in school programming. It is thought to persist as a trait into adult life, with various consequences studied in longitudinal studies of giftedness over the last century. There is no generally agreed definition of giftedness for either children or adults, but most school placement decisions and most longitudinal studies over the course of individual lives have followed people with IQs in the top 2.5 percent of the population—that is, IQs above 130. Definitions of giftedness also vary across cultures.

<span class="mw-page-title-main">David Wechsler</span> Romanian-American psychologist (1896–1981)

David Wechsler was a Romanian-American psychologist. He developed well-known intelligence scales, such as the Wechsler Adult Intelligence Scale (WAIS) and the Wechsler Intelligence Scale for Children (WISC) to get to know his patients at Bellevue Hospital. A Review of General Psychology survey, published in 2002, ranked Wechsler as the 51st most cited psychologist of the 20th century.

The Wechsler Adult Intelligence Scale (WAIS) is an IQ test designed to measure intelligence and cognitive ability in adults and older adolescents.

<span class="mw-page-title-main">Triple Nine Society</span> High IQ society

The Triple Nine Society (TNS) is an international high-IQ society for adults whose score on a standardized test demonstrates an IQ at or above the 99.9th percentile of the human population. The society recognizes scores from over 20 intelligence and academic aptitude tests. TNS was founded in 1978 and, since 2010, is a non-profit 501(c)(7) organization incorporated in Virginia, USA. It is the second-largest high-IQ society after Mensa. As of February 2024, TNS reports a member base of over 1,900 adults in 50 countries.

<span class="mw-page-title-main">Lewis Terman</span> American educational psychologist, academic, and eugenicist (1877–1956)

Lewis Madison Terman was an American psychologist, academic, and proponent of eugenics. He was noted as a pioneer in educational psychology in the early 20th century at the Stanford School of Education. Terman is best known for his revision of the Stanford–Binet Intelligence Scales and for initiating the longitudinal study of children with high IQs called the Genetic Studies of Genius. As a prominent eugenicist, he was a member of the Human Betterment Foundation, the American Eugenics Society, and the Eugenics Research Association. He also served as president of the American Psychological Association. A Review of General Psychology survey, published in 2002, ranked Terman as the 72nd most cited psychologist of the 20th century, in a tie with G. Stanley Hall.

Cognitive tests are assessments of the cognitive capabilities of humans and other animals. Tests administered to humans include various forms of IQ tests; those administered to animals include the mirror test and the T maze test. Such testing is used in psychology and psychometrics, as well as other fields studying human and animal intelligence.

The Wechsler Intelligence Scale for Children (WISC) is an individually administered intelligence test for children between the ages of 6 and 16. The Fifth Edition is the most recent version.

Catharine Morris Cox Miles was an American psychologist known for her work on intelligence and genius. Born in San Jose, CA, to Lydia Shipley Bean and Charles Elwood Cox. In 1927 married psychologist Walter Richard Miles. Her sister was classics scholar and Quaker administrator Anna Cox Brinton.

A high-IQ society is an organization that limits its membership to people who have attained a specified score on an IQ test, usually in the top two percent of the population or above. These may also be referred to as genius societies. The largest and oldest such society is Mensa International, which was founded by Roland Berrill and Lancelot Ware in 1946.

The Kaufman Assessment Battery for Children (KABC) is a clinical instrument for assessing cognitive development. Its construction incorporates several recent developments in both psychological theory and statistical methodology. The test was developed by Alan S. Kaufman and Nadeen L. Kaufman in 1983 and revised in 2004. The test has been translated and adopted for many countries, such as the Japanese version of the K-ABC by the Japanese psychologists Tatsuya Matsubara, Kazuhiro Fujita, Hisao Maekawa, and Toshinori Ishikuma.

The Culture Fair Intelligence Test (CFIT) was created by Raymond Cattell in 1949 as an attempt to measure cognitive abilities devoid of sociocultural and environmental influences. Scholars have subsequently concluded that the attempt to construct measures of cognitive abilities devoid of the influences of experiential and cultural conditioning is a challenging one. Cattell proposed that general intelligence (g) comprises both fluid intelligence (Gf) and crystallized intelligence (Gc). Whereas Gf is biologically and constitutionally based, Gc is the actual level of a person's cognitive functioning, based on the augmentation of Gf through sociocultural and experiential learning.

<span class="mw-page-title-main">Mental age</span> Concept relating to intelligence

Mental age is a concept related to intelligence. It looks at how a specific individual, at a specific age, performs intellectually, compared to average intellectual performance for that individual's actual chronological age (i.e. time elapsed since birth). The intellectual performance is based on performance in tests and live assessments by a psychologist. The score achieved by the individual is compared to the median average scores at various ages, and the mental age (x, say) is derived such that the individual's score equates to the average score at age x.

In statistics, a floor effect arises when a data-gathering instrument has a lower limit to the data values it can reliably specify. This lower limit is known as the "floor". The "floor effect" is one type of scale attenuation effect; the other scale attenuation effect is the "ceiling effect". Floor effects are occasionally encountered in psychological testing, when a test designed to estimate some psychological trait has a minimum standard score that may not distinguish some test-takers who differ in their responses on the test item content. Giving preschool children an IQ test designed for adults would likely show many of the test-takers with scores near the lowest standard score for adult test-takers. To indicate differences in current intellectual functioning among young children, IQ tests specifically for young children are developed, on which many test-takers can score well above the floor score. An IQ test designed to help assess intellectually disabled persons might intentionally be designed with easier item content and a lower floor score to better distinguish among individuals taking the test as part of an assessment process.

The Cognitive Abilities Test (CogAT) is a group-administered K–12 assessment published by Riverside Insights and intended to estimate students' learned reasoning and problem solving abilities through a battery of verbal, quantitative, and nonverbal test items. The test purports to assess students' acquired reasoning abilities while also predicting achievement scores when administered with the co-normed Iowa Tests. The test was originally published in 1954 as the Lorge-Thorndike Intelligence Test, after the psychologists who authored the first version of it, Irving Lorge and Robert L. Thorndike. The CogAT is one of several tests used in the United States to help teachers or other school staff make student placement decisions for gifted education programs, and is accepted for admission to Intertel, a high IQ society for those who score at or above the 99th percentile on a test of intelligence.

The Reynolds Intellectual Assessment Scales (RIAS) is an individually administered test of intelligence that includes a co-normed, supplemental measure of memory. It is appropriate for individuals ages 3–94.

The Genetic Studies of Genius, later known as the Terman Study of the Gifted, is currently the oldest and longest-running longitudinal study in the field of psychology. It was begun by Lewis Terman at Stanford University in 1921 to examine the development and characteristics of gifted children into adulthood.

The following outline is provided as an overview of and topical guide to human intelligence:

References

  1. Wechsler 1958, Chapter 3: The Classification of Intelligence
  2. Matarazzo 1972, Chapter 5: The Classification of Intelligence
  3. Gregory 1995, entry "Classification of Intelligence"
  4. 1 2 3 Kamphaus 2005, pp. 518–20 section "Score Classification Schemes"
  5. Gottfredson 2009, pp. 31–32
  6. Hunt 2011, p. 5 "As mental testing expanded to the evaluation of adolescents and adults, however, there was a need for a measure of intelligence that did not depend upon mental age. Accordingly the intelligence quotient (IQ) was developed. ... The narrow definition of IQ is a score on an intelligence test ... where 'average' intelligence, that is the median level of performance on an intelligence test, receives a score of 100, and other scores are assigned so that the scores are distributed normally about 100, with a standard deviation of 15. Some of the implications are that: 1. Approximately two-thirds of all scores lie between 85 and 115. 2. Five percent (1/20) of all scores are above 125, and one percent (1/100) are above 135. Similarly, five percent are below 75 and one percent below 65."
  7. Aiken 1979, p. 139
  8. Anastasi & Urbina 1997, p. 326 "Correlation studies of test scores provide actuarial data, applicable to group predictions. ... Studies of individuals, on the other hand, may reveal large upward or downward shifts in test scores."
  9. Kaufman 2009, pp. 151–153 "Thus, even for tests that measure similar CHC constructs and that represent the most sophisticated, high–quality IQ tests ever available at any point in time, IQs differ."
  10. 1 2 Terman 1916, p.  79 "What do the above IQ's imply in such terms as feeble-mindedness, border-line intelligence, dullness, normality, superior intelligence, genius, etc.? When we use these terms two facts must be born in mind: (1) That the boundary lines between such groups are absolutely arbitrary, a matter of definition only; and (2) that the individuals comprising one of the groups do not make up a homogeneous type."
  11. 1 2 Wechsler 1939, p. 37 "The earliest classifications of intelligence were very rough ones. To a large extent they were practical attempts to define various patterns of behavior in medical-legal terms."
  12. Kaufman 2009, Figure 5.1 IQs earned by preadolescents (ages 12–13) who were given three different IQ tests in the early 2000s
  13. Kaufman 2013, Figure 3.1 "Source: A. S. Kaufman. IQ Testing 101 (New York: Springer, 2009). Adapted with permission."
  14. Mackintosh 2011, p. 169 "after the age of 8–10, IQ scores remain relatively stable: the correlation between IQ scores from age 8 to 18 and IQ at age 40 is over 0.70."
  15. 1 2 Carducci, Bernardo J.; Nave, Christopher S.; Fabio, Annamaria; Saklofske, Donald H.; Stough, Con, eds. (2020-09-18). The Wiley Encyclopedia of Personality and Individual Differences. doi:10.1002/9781119547174. ISBN   9781119057536.
  16. "Front Matter". The Wiley Encyclopedia of Personality and Individual Differences: 447. 2020-09-18. doi:10.1002/9781119547174.fmatter. ISBN   9781119547174.
  17. Uzieblo et al. 2012, p. 34 "Despite the increasing disparity between total test scores across intelligence batteries—as the expanding factor structures cover an increasing amount of cognitive abilities (Flanagan, et al., 2010)—Floyd et al. (2008) noted that still 25% of assessed individuals will obtain a 10-point IQ score difference with another IQ battery. Even though not all studies indicate significant discrepancies between intelligence batteries at the group level (e.g., Thompson et al., 1997), the absence of differences at the individual level cannot be automatically assumed."
  18. Ryan, Joseph J.; Glass, Laura A.; Bartels, Jared M. (2010-02-10). "Stability of the WISC-IV in a Sample of Elementary and Middle School Children". Applied Neuropsychology. 17 (1): 68–72. doi:10.1080/09084280903297933. ISSN   0908-4282. PMID   20146124. S2CID   205615200.
  19. Shurkin 1992, pp. 89–90 (citing Burks, Jensen & Terman, The Promise of Youth: Follow–up Studies of a Thousand Gifted Children 1930) "Twelve even dropped below the minimum for the Terman study, and one girl fell below 104, barely above average for the general population. ... Interestingly, while his tests measured decreases in test scores, the parents of the children noted no changes at all. Of all the parents who filled out the home questionnaire, 45 percent perceived no change in their children, 54 percent thought their children were getting brighter, including the children whose scores actually dropped."
  20. Carducci, Bernardo J.; Nave, Christopher S.; Fabio, Annamaria; Saklofske, Donald H.; Stough, Con, eds. (2020-09-18). The Wiley Encyclopedia of Personality and Individual Differences. p. 461. doi:10.1002/9781119547174. ISBN   9781119057536.
  21. Sattler 2008, p. 121 "Whenever you report an overall standard score (e.g., a Full Scale IQ or a similar standard score), accompany it with a confidence interval (see Chapter 4). The confidence interval is a function of both the standard error of measurement and the confidence level: the greater the confidence level (e.g., 99% > 95% > 90% > 85% > 68%) or the lower the reliablility of the test (rxx = .80 <rxx = .85 <rxx = .90), the wider the confidence interval. Psychologists usually use a confidence interval of 95%."
  22. Matarazzo 1972, p. 121 "The psychologist's effort at classifying intelligence utilizes, at present, an ordinal scale, and is akin to what a layman does when he tries to distinguish colors of the rainbow." (emphasis in original)
  23. Gottfredson 2009, pp. 32–33 "We cannot be sure that IQ tests provide interval–level measurement rather than just ordinal–level (i.e., rank–order) measurement. ... we really do not know whether a 10–point difference measures the same intellectual difference at all ranges of IQ."
  24. Mackintosh 2011, pp. 33–34 "Although many psychometricians have argued otherwise (e.g., Jensen 1980), it is not immediately obvious that IQ is even an interval scale, that is, one where, say, the ten–point difference between IQ scores of 110 and 100 is the same as the ten–point difference between IQs of 160 and 150. The most conservative view would be that IQ is simply an ordinal scale: to say that someone has an IQ of 130 is simply to say that their test score lies within the top 2.5% of a representative sample of people the same age."
  25. Jensen 2011, p. 172 "The problem with IQ tests and virtually all other scales of mental ability in popular use is that the scores they yield are only ordinal (i.e., rank-order) scales; they lack properties of true ratio scales, which are essential to the interpretation of the obtained measures."
  26. Flynn 2012, p. 160 (quoting Jensen, 2011)
  27. Kaufman & Lichtenberger 2006, pp. 198–202 (section "Scoring Errors") "Bias errors were in the direction of leniency for all subtests, with Comprehension producing the strongest halo effect."
  28. Reynolds & Horton 2012, Table 4.1 Descriptions for Standard Score Performances Across Selected Pediatric Neuropsychology Tests
  29. Aiken 1979, p. 158
  30. Sattler 1988, p. 736
  31. Sattler 2001, p. 698 "Tests usually provide some system by which to classify scores. Follow the specified classification system strictly, labeling scores according to what is recommended in the test manual. If you believe that a classification does not accurately reflect the examinee's status, state your concern in the report when you discuss the reliability and validity of the findings."
  32. Carducci, Bernardo J.; Nave, Christopher S.; Fabio, Annamaria; Saklofske, Donald H.; Stough, Con, eds. (2020-09-18). The Wiley Encyclopedia of Personality and Individual Differences. p. 449. doi:10.1002/9781119547174. ISBN   9781119057536.
  33. Gottfredson 2009, p. 32 "One searches in vain, for instance, for a good accounting of the capabilities that 10-year-olds, 15-year-olds, or adults of 110 usually possess but similarly aged individuals of IQ 90 do not ... IQ tests are not intended to isolate and measure highly specific skills and knowledge. This is the job of suitably designed achievement tests."
  34. Kaufman & Lichtenberger 2006, p. 89
  35. 1 2 3 4 5 6 7 8 9 Urbina 2011, Table 2.1 Major Examples of Current Intelligence Tests
  36. Flanagan & Harrison 2012, chapters 8-13, 15-16 (discussing Wechsler, Stanford–Binet, Kaufman, Woodcock–Johnson, DAS, CAS, and RIAS tests)
  37. Mackintosh 2011, p. 32 "The most widely used individual IQ tests today are the Wechsler tests, first published in 1939 as the Wechsler–Bellevue Scale."
  38. Saklofske et al. 2003, p. 3 "To this day, the Wechsler tests remain the most often used individually administered, standardized measures for assessing intelligence in children and adults" (citing Camara, Nathan & Puente, 2000; Prifitera, Weiss & Saklofske, 1998)
  39. Georgas et al. 2003, p. xxv "The Wechsler tests are perhaps the most widely used intelligence tests in the world"
  40. Meyer & Weaver 2005, p. 219 Campbell 2006, p. 66 Strauss, Sherman & Spreen 2006, p. 283 Foote 2007, p. 468 Kaufman & Lichtenberger 2006, p. 7 Hunt 2011, p. 12
  41. Weiss et al. 2006, Table 5 Qualitative Descriptions of Composite Scores
  42. 1 2 3 Sattler 2008, inside back cover
  43. Kaufman, Alan S.; Engi Raiford, Susan; Coalson, Diane L. (2016). Intelligent Testing With the WISC-V. Hoboken, New Jersey: John Wiley & Sons. p. 237. ISBN   978-1-118-58923-6.
  44. Kamphaus 2005, p. 519 "Although the Wechsler classification system for intelligence test scores is by far the most popular, it may not be the most appropriate (Reynolds & Kaufman 1990)."
  45. Groth-Marnat 2009, p. 136
  46. Groth-Marnat 2009, Table 5.5
  47. Carducci, Bernardo J.; Nave, Christopher S.; Fabio, Annamaria; Saklofske, Donald H.; Stough, Con, eds. (2020-09-18). The Wiley Encyclopedia of Personality and Individual Differences. p. 452. doi:10.1002/9781119547174. ISBN   9781119057536.
  48. 1 2 Kaufman 2009, p. 112
  49. 1 2 3 Kamphaus 2005, p. 337
  50. Kamphaus 2005, pp. 367–68
  51. Kaufman et al. 2005, Table 3.1 Descriptive Category System
  52. Gallagher & Sullivan 2011, p. 347
  53. Naglieri 1999, Table 4.1 Descriptive Categories of PASS and Full Scale Standard Scores
  54. Dumont, Willis & Elliot 2009, p. 11
  55. Dumont, Willis & Elliot 2009, p. 20
  56. Dumont & Willis 2013, "Range of DAS Subtest Scaled Scores" (Web resource)
  57. Dumont, Willis & Elliot 2009, Table Rapid Reference 5.1 DAS-II Classification Schema
  58. Reynolds & Kamphaus 2003, p. 30 (Table 3.2 RIAS Scheme of Verbal Descriptors of Intelligence Test Performance)
  59. Spearman 1904
  60. Wasserman 2012, pp. 19–20 "The scale does not pretend to measure the entire mentality of the subject, but only general intelligence. (citing Terman, 1916, p. 48, emphasis in original)
  61. Wasserman 2012, p. 19 "No foreign-born or minority children were included. ... The overall sample was predominantly white, urban, and middle-class"
  62. 1 2 Terman 1916, p.  79
  63. Kaufman 2009, p. 110
  64. Naglieri 1999, p. 7 "The concept of general intelligence was assumed to exist, and psychologists went about 'the measurement of an individual's general ability without waiting for an adequate psychological definition.' (Pintner, 1923, p. 52)."
  65. Pintner 1931, p. 117
  66. 1 2 Levine & Marks 1928, p. 131
  67. 1 2 Kamphaus et al. 2012, pp. 57–58 (citing Levine and Marks, page 131)
  68. Wasserman 2012, p. 35 "Inexplicably, Terman and Merrill made the mistake of retaining a ratio IQ (i.e., mental age/chronological age) on the 1937 Stanford–Binet, even though the method had long been recognized as producing distorted IQ estimates for adolescents and adults (e.g., Otis, 1917). Terman and Merrill (1937, pp. 27–28) justified their decision on the dubious ground that it would have been too difficult to reeducate teachers and other test users familiar with ratio IQ."
  69. 1 2 3 4 Terman & Merrill 1960, p. 18
  70. Terman & Merrill 1937, p. 20
  71. Wasserman 2012, p. 35 "The 1939 test battery (and all subsequent Wechsler intelligence scales) also offered a deviation IQ, the index of intelligence based on statistical difference from the normative mean in standardized units, as Arthur Otis (1917) had proposed. Wechsler deserves credit for popularizing the deviation IQ, although the Otis Self-Administering Tests and the Otis Group Intelligence Scale had already used similar deviation-based composite scores in the 1920s."
  72. Wechsler 1939, pp. 39–40 "We have seen equivalent Binet I.Q. ratings reported for nearly every intelligence test now in use. In most cases the reporters proceeded to interpret the I.Q.'s obtained as if the tests measured the same thing as the Binet, and the indices calculated were equivalent to those obtained on the Stanford–Binet. ... The examiners were seemingly unaware of the fact that identical I.Q.'s on the different tests might well represent very different orders of intelligence."
  73. Wechsler 1958, pp. 42–43 "In brief, mental age is no more an absolute measure of intelligence than any other test score."
  74. Wechsler 1958, p. 42 Table 3 Intelligence classification of WAIS IQ's
  75. Terman & Merrill 1960, pp. 276–296 (scoring tables for 1960 Stanford–Binet)
  76. Freides 1972, pp. 772–773 "My comments in 1970 [published in 1972] are not very different from those made by F. L. Wells 32 years ago in The 1938 Mental Measurements Yearbook. The Binet scales have been around for a long time and their faults are well known."
  77. 1 2 Gregory 1995, Table 4 Ability classifications, IQ ranges, and percent of norm sample for contemporary tests
  78. Naglieri 1999, p.  7 "In fact, the stagnation of intelligence tests is apparent in Brody's (1992) statement: 'I do not believe that our intellectual progress has had a major impact on the development of tests of intelligence' (p. 355)."
  79. Sattler 1988, Table BC-2 Classification Ratings on Stanford–Binet: Fourth Edition, Wechsler Scales, and McCarthy Scales
  80. Kaufman 2009, p. 122
  81. American Psychiatric Association 2013, pp. 33–37 Intellectual Disability (Intellectual Development Disorder): Specifiers "The various levels of severity are defined on the basis of adaptive functioning, and not IQ scores, because it is adaptive functioning that determines the level of supports required. Moreover, IQ measures are less valid in the lower end of the IQ range."
  82. Flanagan & Kaufman 2009, p. 134 (emphasis in original)
  83. Flynn 2012, Chapter 4: Death, Memory, and Politics
  84. Shaw, Steven R.; Anna M.; Jankowska (2018). Pediatric Intellectual Disabilities at School. Brooklyn, New York: Springer. p. 5. ISBN   978-3-030-02990-6.
  85. Gernsbacher, Morton Ann; Raimond, Adam R.; Balinghasay, M. Theresa; Boston, Jilana S. (2016-12-19). ""Special needs" is an ineffective euphemism". Cognitive Research: Principles and Implications. 1 (1): 29. doi: 10.1186/s41235-016-0025-4 . ISSN   2365-7464. PMC   5256467 . PMID   28133625.
  86. Nash, Chris; Hawkins, Ann; Kawchuk, Janet; Shea, Sarah E (February 2012). "What's in a name? Attitudes surrounding the use of the term 'mental retardation'". Paediatrics & Child Health. 17 (2): 71–74. doi:10.1093/pch/17.2.71. ISSN   1205-7088. PMC   3299349 . PMID   23372396.
  87. Rafter, Nicole Hahn (1998). Creating Born Criminals. University of Illinois Press, ISBN   978-0-252-06741-9
  88. Cummings NA, Wright RH (2005). "Chapter 1, Psychology's surrender to political correctness". Destructive trends in mental health: the well-intentioned path to harm. New York: Routledge. ISBN   978-0-415-95086-2.
  89. Treadway, Walter L. (1916). "The Feeble-Minded: Their Prevalence and Needs in the School Population of Arkansas". Public Health Reports. 31 (47): 3231–3247. doi:10.2307/4574285. hdl: 2027/loc.ark:/13960/t4hm5zr5h . ISSN   0094-6214. JSTOR   4574285. S2CID   68261373.
  90. 1 2 "cretin". The American Heritage Dictionary of the English Language, Fourth Edition. Houghton Mifflin Company. 2006. Archived from the original on 2008-09-14. Retrieved 2008-08-04.
  91. Howard-Jones N (January 1979). "On the diagnostic term "Down's disease"". Medical History. 23 (1): 102–4. doi:10.1017/s0025727300051048. PMC   1082401 . PMID   153994.
  92. "Worst Word Vote". Ouch. BBC. 2003. Archived from the original on 2007-03-20. Retrieved 2007-08-17.
  93. Rosa's Law, Pub. L. 111-256, 124 Stat. 2643 (2010).
  94. "SpecialOlympics.org". SpecialOlympics.org. Archived from the original on 2010-07-30. Retrieved 2010-06-29.
  95. Pintner 1931, pp. 356–357 "From a study of these boyhood records, estimates of the probable I.Q.s of these men in childhood have been made. ... It is of course obvious that much error may creep into an experiment of this sort, and the I.Q. assigned to any one individual is merely a rough estimate, depending to some extent upon how much information about his boyhood years has come down to us."
  96. Shurkin 1992, pp. 70–71 "She, of course, was not measuring IQ, she was measuring the length of biographies in a book. Generally, the more information, the higher the IQ. Subjects were dragged down if there was little information about their early lives."
  97. Eysenck 1995, p. 59 "Cox might well have been advised to reject a few of her geniuses for lack of evidence." Eysenck 1998, p. 126 "Cox found that the more was known about a person's youthful accomplishments, that is, what he had done before he was engaged in doing the things that made him known as a genius, the higher was his IQ ... So she proceeded to make a statistical correction in each case for lack of knowledge; this bumped up the figure considerably for the geniuses about whom little was in fact known. ... I am rather doubtful about the justification for making the correction."
  98. Cox 1926, pp. 215–219, 218 (Chapter XIII: Conclusions) "3. That all equally intelligent children do not as adults achieve equal eminence is in part accounted for by our last conclusion: youths who achieve eminence are characterized not only by high intellectual traits, but also by persistence of motive and effort, confidence in their abilities, and great strength or force of character." (emphasis in original)
  99. Kaufman 2009, p. 117 "Terman (1916), as I indicated, used near genius or genius for IQs above 140, but mostly very superior has been the label of choice" (emphasis in original)
  100. Wechsler 1939, p. 45
  101. Eysenck 1998, pp. 127–128 "Terman, who originated those 'Genetic Studies of Genius', as he called them, selected ... children on the basis of their high IQs, the mean was 151 for both sexes. Seventy–seven who were tested with the newly translated and standardized Binet test had IQs of 170 or higher–well at or above the level of Cox's geniuses. What happened to these potential geniuses–did they revolutionize society? ... The answer in brief is that they did very well in terms of achievement, but none reached the Nobel Prize level, let alone that of genius. ... It seems clear that these data powerfully confirm the suspicion that intelligence is not a sufficient trait for truly creative achievement of the highest grade."
  102. Simonton 1999, p.  4 "When Terman first used the IQ test to select a sample of child geniuses, he unknowingly excluded a special child whose IQ did not make the grade. Yet a few decades later that talent received the Nobel Prize in physics: William Shockley, the cocreator of the transistor. Ironically, not one of the more than 1,500 children who qualified according to his IQ criterion received so high an honor as adults."
  103. Shurkin 2006, p.  13 (See also "The Truth About the 'Termites'"; Kaufman, S. B. 2009)
  104. Leslie 2000. "We also know that two children who were tested but didn't make the cut -- William Shockley and Luis Alvarez -- went on to win the Nobel Prize in Physics. According to Hastorf, none of the Terman kids ever won a Nobel or Pulitzer."
  105. Park, Lubinski & Benbow 2010. "There were two young boys, Luis Alvarez and William Shockley, who were among the many who took Terman's tests but missed the cutoff score. Despite their exclusion from a study of young 'geniuses,' both went on to study physics, earn PhDs, and win the Nobel prize."
  106. Gleick 2011, p.  32 "Still, his score on the school IQ test was a merely respectable 125."
  107. Robinson 2011, p.  47 "After all, the American physicist Richard Feynman is generally considered an almost archetypal late 20th-century genius, not just in the United States but wherever physics is studied. Yet, Feynman's school-measured IQ, reported by him as 125, was not especially high"
  108. Spearman 1927, p. 221
  109. Sternberg, Jarvin & Grigorenko 2010, Chapter 2: Theories of Giftedness
  110. McIntosh, Dixon & Pierson 2012, pp. 636–637
  111. 1 2 Gross 2000, pp. 3–9
  112. Terman & Merrill 1937, p. 44 "The reader should not lose sight of the fact that a test with even a high reliability yields scores which have an appreciable probable error. The probable error in terms of mental age is of course larger with older than with young children because of the increasing spread of mental age as we go from younger to older groups. For this reason it has been customary to express the P.E. [probable error] of a Binet score in terms of I.Q., since the spread of Binet I.Q.'s is fairly constant from age to age. However, when our correlation arrays [between Form L and Form M] were plotted for separate age groups they were all discovered to be distinctly fan-shaped. Figure 3 is typical of the arrays at every age level. From Figure 3 it becomes clear that the probable error of an I.Q. score is not a constant amount, but a variable which increases as I.Q. increases. It has frequently been noted in the literature that gifted subjects show greater I.Q. fluctuation than do clinical cases with low I.Q.'s ... we now see that this trend is inherent in the I.Q. technique itself, and might have been predicted on logical grounds."
  113. Lohman & Foley Nicpon 2012, Section "Conditional SEMs" "The concerns associated with SEMs [standard errors of measurement] are actually substantially worse for scores at the extremes of the distribution, especially when scores approach the maximum possible on a test ... when students answer most of the items correctly. In these cases, errors of measurement for scale scores will increase substantially at the extremes of the distribution. Commonly the SEM is from two to four times larger for very high scores than for scores near the mean (Lord, 1980)."
  114. Lohman & Foley Nicpon 2012, Section "Scaling Issues" "The spreading out of scores for young children at the extremes of the ratio IQ scale is viewed as a positive attribute of the SB-LM by clinicians who want to distinguish among the highly and profoundly gifted (Silverman, 2009). Although spreading out the test scores in this way may be helpful, the corresponding normative scores (i.e., IQs) cannot be trusted both because they are based on out-of-date norms and because the spread of IQ scores is a necessary consequence of the way ratio IQs are constructed, not a fact of nature."
  115. Hunt 2011, p. 8
  116. Perleth, Schatz & Mönks 2000, p.  301 "Norm tables that provide you with such extreme values are constructed on the basis of random extrapolation and smoothing but not on the basis of empirical data of representative samples."
  117. Urbina 2011, Chapter 2: Tests of Intelligence. "[Curve-fitting] is just one of the reasons to be suspicious of reported IQ scores much higher than 160"
  118. Lohman & Foley Nicpon 2012, Section "Scaling Issues" "Modern tests do not produce such high scores, in spite of heroic efforts to provide extended norms for both the Stanford Binet, Fifth Edition (SB-5) and the WISC-IV (Roid, 2003; Zhu, Clayton, Weiss, & Gabel, 2008)."

Bibliography