High-stakes testing

Last updated
A driving test is a high-stakes test: Without passing the test, the test taker cannot obtain a driver's license. 2009-03-11 Student driver SB on N Gregson St in Durham.jpg
A driving test is a high-stakes test: Without passing the test, the test taker cannot obtain a driver's license.

A high-stakes test is a test with important consequences for the test taker. [1] Passing has important benefits, such as a high school diploma, a scholarship, or a license to practice a profession. Failing has important disadvantages, such as being forced to take remedial classes until the test can be passed, not being allowed to drive a car, or difficulty finding employment.

Contents

The use and misuse of high-stakes tests is a controversial topic in public education, especially in the United States and U.K., where they have become especially popular in recent years, used not only to assess school-age students but in attempts to increase teacher accountability. [2]

Definitions

This dance audition is a high-stakes test. If the dancers don't pass the test, they won't get a part in the upcoming show. Asia Dance Audition Round.1.jpg
This dance audition is a high-stakes test. If the dancers don't pass the test, they won't get a part in the upcoming show.

In common usage, a high-stakes test is any test that has major consequences or is the basis of a major decision. [1] [3] [4]

Under a more precise definition, a high-stakes test is any test that:

For example, exit examinations for high school graduation are often high-stakes tests: there is a single, defined test (the student must pass this test; no other test can be substituted); some scores are high enough to pass and others are not; and failing has the direct consequence of preventing graduation. Similarly, driving tests are often high-stakes, as they also meet the same three criteria.

High-stakes testing is not synonymous with high-pressure testing. An American high school student might feel pressure to perform well on the SAT-I college aptitude exam. However, SAT scores do not directly determine admission to any college or university, and there is no clear line drawn between those who pass and those who fail, so it is not formally considered a high-stakes test. [6] [7] On the other hand, because the SAT-I scores are given significant weight in the admissions process at some schools, many people believe that it has consequences for doing well or poorly, and it could therefore be considered a high-stakes test under the simpler, common definition. [8] [9]

A high-stakes test can be contrasted with a medium-stakes test or a low-stakes test. [7] A medium-stakes test might provide access to a desirable but less necessary benefit, such as an award, or it is only one component of a decision-making process, such as an admissions program that looks at the test results plus other factors. A low-stakes test has no significant consequences to the test taker.

The stakes

High stakes are not a characteristic of the test itself, but rather of the consequences placed on the outcome. For example, no matter what type of test is used—written essays, computer-based multiple choice, oral examination, performance test, or anything else—a medical licensing test must be passed to practice medicine.

The perception of the stakes may vary. For example, college students who wish to skip an introductory-level course are often given exams to see whether they have already mastered the material and can be passed to the next level. Passing the exam can reduce tuition costs and time spent at university. A student who is anxious to have these benefits may consider the test to be a high-stakes exam. Another student, who places no importance on the outcome, so long as he is placed in a class that is appropriate to his skill level, may consider the same exam to be a low-stakes test. [5]

The phrase "high stakes" is derived directly from a gambling term. In gambling, a stake is the quantity of money or other goods that is risked on the outcome of some specific event. A high-stakes game is one in which, in the player's personal opinion, a large quantity of money is being risked. The term is meant to imply that implementing such a system introduces uncertainty and potential losses for test takers,[ citation needed ] who must pass the exam to "win," instead of being able to obtain the goal through other means.[ citation needed ]

Examples

Examples of high-stakes tests and their "stakes" include:

Stakeholders

A high-stakes system may be intended to benefit people other than the test-taker. For professional certification and licensure examinations, the purpose of the test is to protect the general public from incompetent practitioners. The individual stakes of the medical student and the medical school are, hopefully, balanced against the social stakes of possibly allowing an incompetent doctor to practice medicine. [10]

A test may be "high-stakes" based on consequences for others beyond the individual test-taker. [4] For example, an individual medical student who fails a licensing exam cannot practice his or her profession. However, if enough students at the same school fail the exam, the school's reputation and accreditation may be jeopardized. Similarly, testing under the U.S.'s No Child Left Behind Act had no direct negative consequences for failing students, [11] but potentially serious consequences for their schools, including loss of accreditation, funding, teacher pay, teacher employment, or changes to the school's management. [12] The stakes were therefore high for the school, but low for the individual test-takers.

Assessments used

Any form of assessment can be used as a high-stakes test. Many times, an inexpensive multiple-choice test is chosen for convenience. A high-stakes assessment may also involve answering open-ended questions or a practical, hands-on section. For example, a typical high-stakes licensing exam for a medical nurse determines whether the nurse can insert an I.V. line by watching the nurse actually do this task. These assessments are called authentic assessments or performance tests. [5]

Some high-stakes tests may be standardized tests (in which all examinees take the same test under reasonably equal conditions), with the expectation that standardization affords all examinees a fair and equal opportunity to pass. [5] Some high-stakes tests are non-standardized, such as a theater audition.

As with other tests, high-stakes tests may be criterion-referenced or norm-referenced. [5] For example, a written driver's license examination typically is criterion-referenced, with an unlimited number of potential drivers able to pass if they correctly answer a certain percentage of questions. On the other hand, essay portions of some bar exams are often norm-referenced, with the worst essays failed and the best essays passed, without regard for the overall quality of the essays.

The "clear line" between passing and failing on an exam may be achieved through use of a cut score: for example, test takers correctly answering 75% or more of the questions pass the test; test takers correctly answering 74% or fewer fail, or don't "make the cut". In large-scale high-stakes testing, rigorous and expensive standard-setting studies may be employed to determine the ideal cut score or to keep the test results consistent between groups taking the test at different times.

Criticisms

High-stakes tests, despite their extensive usage for determination of academic and non-academic proficiency, are subject to criticism for various reasons. Example concerns include the following:

Advantages

In addition to the criticisms, high-stakes testing retains some advantages:

Related Research Articles

<span class="mw-page-title-main">SAT</span> Standardized test widely used for college admissions in the United States

The SAT is a standardized test widely used for college admissions in the United States. Since its debut in 1926, its name and scoring have changed several times. For much of its history, it was called the Scholastic Aptitude Test and had two components, Verbal and Mathematical, each of which was scored on a range from 200 to 800. Later it was called the Scholastic Assessment Test, then the SAT I: Reasoning Test, then the SAT Reasoning Test, then simply the SAT.

<span class="mw-page-title-main">Standardized test</span> Test administered and scored in a predetermined, standard manner

A standardized test is a test that is administered and scored in a consistent, or "standard", manner. Standardized tests are designed in such a way that the questions and interpretations are consistent and are administered and scored in a predetermined, standard manner.

<span class="mw-page-title-main">Victorian Certificate of Education</span> School qualification offered in Victoria, Australia

The Victorian Certificate of Education is one credential available to secondary school students who successfully complete year 11 and 12 in the Australian state of Victoria as well as in some international schools in China, Malaysia, Philippines, Timor-Leste and Vietnam. The VCE is the predominant choice for students wishing to pursue tertiary education. An alternative to VCE is the Victorian Certificate of Applied Learning (VCAL), a vocational based senior secondary school qualification. About 67% of all 19-year-olds in Victoria had completed the VCE in 2020, compared to about 11% of students completing the VCAL. A small number of government secondary schools, and a somewhat larger number of private schools, offer the IB Diploma Programme as an alternative.

Educational assessment or educational evaluation is the systematic process of documenting and using empirical data on the knowledge, skill, attitudes, aptitude and beliefs to refine programs and improve student learning. Assessment data can be obtained from directly examining student work to assess the achievement of learning outcomes or can be based on data from which one can make inferences about learning. Assessment is often used interchangeably with test, but not limited to tests. Assessment can focus on the individual learner, the learning community, a course, an academic program, the institution, or the educational system as a whole. The word 'assessment' came into use in an educational context after the Second World War.

<span class="mw-page-title-main">College Board</span> US educational nonprofit testing organization

The College Board is an American not-for-profit organization that was formed in December 1899 as the College Entrance Examination Board (CEEB) to expand access to higher education. While the College Board is not an association of colleges, it runs a membership association of institutions, including over 6,000 schools, colleges, universities, and other educational organizations.

Electronic assessment, also known as digital assessment, e-assessment, online assessment or computer-based assessment, is the use of information technology in assessment such as educational assessment, health assessment, psychiatric assessment, and psychological assessment. This covers a wide range of activities ranging from the use of a word processor for assignments to on-screen testing. Specific types of e-assessment include multiple choice, online/electronic submission, computerized adaptive testing such as the Frankfurt Adaptive Concentration Test, and computerized classification testing.

<span class="mw-page-title-main">Florida Comprehensive Assessment Test</span> Standardized test formerly used in public schools in Florida

The Florida Comprehensive Assessment Test, or the FCAT/FCAT 2.0, was the standardized test used in the primary and secondary public schools of Florida. First administered statewide in 1998, it replaced the State Student Assessment Test (SSAT) and the High School Competency Test (HSCT). As of the 2014-2015 school year FCAT was replaced in the state of Florida. The Florida Department of Education later implemented the Florida Standards Assessments (FSA) for English Language Arts, Reading, Mathematics and a Writing or typing test. A Comprehensive science test is still used for grades 5 and 8.

<span class="mw-page-title-main">Hong Kong Certificate of Education Examination</span>

The Hong Kong Certificate of Education Examination was a standardised examination between 1974 and 2011 after most local students' five-year secondary education, conducted by the Hong Kong Examinations and Assessment Authority (HKEAA), awarding the Hong Kong Certificate of Education secondary school leaving qualification. The examination has been discontinued in 2012 and its roles are now replaced by the Hong Kong Diploma of Secondary Education as part of educational reforms in Hong Kong. It was considered as the equivalent of the GCSE in the United Kingdom.

University admission or college admission is the process through which students enter tertiary education at universities and colleges. Systems vary widely from country to country, and sometimes from institution to institution.

In New York State, Regents Examinations are statewide standardized examinations in core high school subjects. Students are required to pass these exams to earn a Regents Diploma. To graduate, students are required to have earned appropriate credits in a number of specific subjects by passing year-long or half-year courses, after which they must pass at least five Regents examinations in some of the subject areas. For higher-achieving students, a Regents with Advanced designation and an Honors designation are also offered. Students with disabilities or enrolled in an English as a Second Language program are able to earn a local diploma.

<span class="mw-page-title-main">Gaokao</span> Chinese examinations for entrance to higher education

The National College Entrance Examination (NCEE), commonly known as the gaokao, is a standardized college entrance exam held annually in Mainland China. It is required for entrance into all higher education institutions at the undergraduate level in the country. It is usually taken by students in their third and last year of senior high school.

The California High School Exit Examination (CAHSEE) was an examination created by the California Department of Education, that was previously mandated to administer in high schools statewide in order to graduate. The examination was suspended in 2015, when Governor Jerry Brown signed a bill undoing the decade old requirement. It was originally created by the California Department of Education to improve the academic performance of California high school students, and especially of high school graduates, in the areas of reading, writing, and mathematics. In addition to other graduation requirements, public school students needed to pass the exam before they could receive a high school diploma.

An exit examination is a test that students must pass to receive a diploma and graduate from school. Such examinations have been used in a variety of countries; this article focuses on their use within the United States. These are usually criterion-referenced tests which were implemented as part of a comprehensive standards-based education reform program which sets into place new standards intended to increase the learning of all students.

The General Certificate of Education (GCE) Advanced Level, or A Level, is a main school leaving qualification in England, Wales, Northern Ireland, the Channel Islands and the Isle of Man. It is available as an alternative qualification in other countries.

<span class="mw-page-title-main">Test preparation</span> Preparation for standardized tests

Test preparation or exam preparation is an educational course, tutoring service, educational material, or a learning tool designed to increase students' performance on standardized tests. Examples of these tests include entrance examinations used for admissions to institutions of higher education, such as college, business school, law school, medical school, BMAT, UKCAT and GAMSAT and graduate school and qualifying examinations for admission to gifted education programs.

<span class="mw-page-title-main">Exam</span> Educational assessment

An examination or test is an educational assessment intended to measure a test-taker's knowledge, skill, aptitude, physical fitness, or classification in many other topics. A test may be administered verbally, on paper, on a computer, or in a predetermined area that requires a test taker to demonstrate or perform a set of skills.

Placement testing is a practice that many colleges and universities use to assess college readiness and determine which classes a student should initially take. Since most two-year colleges have open, non-competitive admissions policies, many students are admitted without college-level academic qualifications. Placement tests assess abilities in English, mathematics and reading; they may also be used in other disciplines such as foreign languages, computer and internet technologies, health and natural sciences. The goal is to offer low-scoring students remedial coursework to prepare them for regular coursework. Less-prepared students are placed into various remedial situations, from adult basic education through various levels of developmental college courses.

In Nigeria, the academic grading system scales from A to F (fail). Below is the grading system of Nigerian schools.

References

  1. 1 2 "Lexicon of Learning". Association for Supervision and Curriculum Development.
  2. Rosemary Sutton; Kelvin Seifert (2009). "Chapter 1: The Changing Teaching Profession and You". Educational Psychology (PDF) (2nd ed.). p. 14.
  3. 1 2 3 Togut, Torin D. "High-Stakes Testing: Educational Barometer for Success, or False Prognosticator for Failure". The Beacon. No. Fall 2004. Harbor House Law Press.
  4. 1 2 Torin D. Togut. "EDEX 790 Glossary of Education Terms". Archived from the original on January 11, 2009. Retrieved July 23, 2009.
  5. 1 2 3 4 5 "The nature of assessment: A guide to standardized testing — Center for Public Education". Archived from the original on July 25, 2011. Retrieved July 23, 2009.
  6. Pfeiffer, Steven I (Winter 2009). "The Debate about Using the SAT in College Admissions". Duke University Talent Identification Program. Archived from the original on 2009-10-14. Gaston Caperton, president of the College Board, which publishes the SAT, counters that the SAT I is "not a high-stakes test" but is a useful admissions tool when considered along with other evidence of a student's potential for college success.
  7. 1 2 Phelps, Richard P. (June 2010). "Source of Lake Wobegon" (PDF). Nonpartisan Education Review. Retrieved 2020-10-18.
  8. Mari Pearlman (April 4, 2001). "High-stakes Testing: Perils & Opportunities". Archived from the original on 2009-09-25. Retrieved July 23, 2009.
  9. Eddy Ramírez (30 April 2008). "Admissions Officials Shrug at SAT Writing Test" . Retrieved 24 July 2009.
  10. Mehrens, W.A. (1995). Legal and Professional Bases for Licensure Testing.' In Impara, J.C. (Ed.) Licensure testing: Purposes, procedures, and practices, pp. 33-58. Lincoln, NE: Buros Institute.
  11. "NCLB has nothing to do with the high-stakes nature of the test for students". Archived from the original on 2012-12-13.
  12. Greene, Jay P.; Marcus A. Winters; Greg Forster (February 2003). "Testing High Stakes Tests: Can We Believe the Results of Accountability Tests?". Civic Report. Manhattan Institute for Policy Research.
  13. File, Thom; Ryan, Camille (November 2014). "Computer and Internet Use in the United States: 2013" (PDF). census.gov.
  14. Zuriff GE (1997). "Accommodations for test anxiety under ADA?". J. Am. Acad. Psychiatry Law. 25 (2): 197–206. PMID   9213292.
  15. "Appropriate Use of High-Stakes Testing in Our Nation's Schools". American Psychological Association. Retrieved 2008-01-09.
  16. 1 2 Jacob, Brian A. and Steven D. Levitt (Winter 2004). "To Catch a Cheat" (PDF). Education Next.
  17. "Figure 1-10: Employee/faculty support for high stakes testing: 2000". Archived from the original on 2008-02-07. Retrieved 2008-02-06.
  18. Lewis, Anne (April 2000). High-stakes testing: Trends and issues (PDF) (Report). Mid-Continent Research for Education and Learning. Archived from the original (PDF) on 2011-07-27.
  19. Myers, David (2001). Psychology . New York: Worth Publishers. p.  464. ISBN   1-57259-791-7. Why blame the tests for exposing unequal experiences and opportunities?
  20. Dang, Nick (18 March 2003). "Reform education, not exit exams". Daily Bruin. One common complaint from failed test-takers is that they weren't taught the tested material in school. Here, inadequate schooling, not the test, is at fault. Blaming the test for one's failure is like blaming the service station for a failed smog check; it ignores the underlying problems within the 'schooling vehicle.'
  21. "Tackling the SAT? Test-prep help abounds". Christian Science Monitor . Vol. 90, no. 175. Associated Press. August 4, 1998. pp. B3. ISSN   0882-7729 . Retrieved 2007-07-09. Some parents spend thousands of dollars for private sessions...
  22. Johnson, Dale, Bonnie Johnson, Stephen J. Farenga, & Daniel Ness. (2008). Stop High-Stakes Testing: An Appeal to America's Conscience. Lanham, MD: Rowman & Littlefield.
  23. Weinkopf, Chris (2002). "Blame the test: LAUSD denies responsibility for low scores". Daily News. The blame belongs to 'high-stakes tests' like the Stanford 9 and California's High School Exit Exam. Reliance on such tests, the board grumbles, 'unfairly penalizes students that have not been provided with the academic tools to perform to their highest potential on these tests'.
  24. "Blaming The Test". Investor's Business Daily . 11 May 2006. A judge in California is set to strike down that state's high school exit exam. Why? Because it's working. It's telling students they need to learn more. We call that useful information. To the plaintiffs who are suing to stop the use of the test as a graduation requirement, it's something else: Evidence of unequal treatment ... the exit exam was deemed unfair because too many students who failed the test had too few credentialed teachers. Well, maybe they did, but granting them a diploma when they lack the required knowledge only compounds the injustice by leaving them with a worthless piece of paper.
  25. Kozol, Jonathan (2005). The Shame of the Nation. New York: Crown Publishers. p.  53. ISBN   978-1-4000-5245-5.
  26. Kohn, A. (1999) Confusing Harder with Better.  Retrieved on 1/26/21 from https://www.alfiekohn.org/article/confusing-harder-better/
  27. Eklöf, Hanna (2007). "Test-taking motivation and mathematics performance in TIMSS". International Journal of Testing. 7 (3): 311–326. doi:10.1080/15305050701438074. S2CID   144686714.
  28. Finn B. (2015). Measuring motivation in low-stakes assessments (Research Report RR-15-19). Educational Testing Service.
  29. Hawthorne, K.A.; Bol, L.; Pribesh, S.; Suh, Y. (2015). "Test-taking motivation and mathematics performance in TIMSS". Research and Practice in Assessment. 10: 30–38.
  30. Wise, SL; DeMars, CE (2010). "Examinee noneffort and the validity of program assessment results". Educational Assessment. 15: 27–41. doi:10.1080/10627191003673216. S2CID   143794026.
  31. "The Lake Wobegon Effect: Twenty Years Later". Nonpartisan Education Review.
  32. Cizek, G.J. (1999). Cheating on Tests: How To Do It, Detect It, and Prevent It. Routledge. doi:10.4324/9781410601520. ISBN   9781410601520.
  33. Steger, D.; Schroeders, U.; Gnambs, T. (2018). "A Meta-Analysis of Test Scores in Proctored and Unproctored Ability Assessments". European Journal of Psychological Assessment. 36: 1–11. doi:10.1027/1015-5759/a000494. S2CID   149485786.
  34. U.S. Government Accountability Office (2013). K-12 Education: States' Test Security Policies and Procedures Varied (Report).
  35. Phelps, R. P. (2019). "Test Frequency, Stakes, and Feedback in Student Achievement: A Meta-Analysis". Evaluation Review. 43 (3–4): 111–151. doi:10.1177/0193841X19865628. PMID   31382776. S2CID   199449477.

Further reading