National Assessment of Educational Progress

Last updated
NAEP Logo NatlAssessmentOfEduProgress.jpg
The Nation's Report Card Logo TheNationsReportCard.png
The Nation's Report Card Logo

The National Assessment of Educational Progress (NAEP) is the largest continuing and nationally representative assessment of what U.S. students know and can do in various subjects. NAEP is a congressionally mandated project administered by the National Center for Education Statistics (NCES), within the Institute of Education Sciences (IES) of the U.S. Department of Education. The first national administration of NAEP occurred in 1969. The National Assessment Governing Board (NAGB) is an independent, bipartisan board that sets policy for NAEP and is responsible for developing the framework and test specifications.The National Assessment Governing Board, whose members are appointed by the U.S. Secretary of Education, includes governors, state legislators, local and state school officials, educators, business representatives, and members of the general public. Congress created the 26-member Governing Board in 1988.


NAEP results are designed to provide group-level data on student achievement in various subjects, and are released as The Nation's Report Card. There are no results for individual students, classrooms, or schools. NAEP reports results for different demographic groups, including gender, socioeconomic status, and race/ethnicity. Assessments are given most frequently in mathematics, reading, science and writing. Other subjects such as the arts, civics, economics, geography, technology and engineering literacy (TEL) and U.S. history are assessed periodically.

In addition to assessing student achievement in various subjects, NAEP also surveys students, teachers, and school administrators to help provide contextual information. Questions asking about participants' race or ethnicity, school attendance, and academic expectations help policy makers, researchers, and the general public better understand the assessment results.

Teachers, principals, parents, policymakers, and researchers all use NAEP results to assess student progress across the country and develop ways to improve education in the United States. NAEP has been providing valid and reliable data on student performance since 1969.

NAEP uses a carefully designed sampling procedure that allows the assessment to be representative of the geographical, racial, ethnic, and socioeconomic diversity of the schools and students in the United States. Data is also provided on students with disabilities and English language learners. Since NAEP assessments are administered uniformly to all participating students using the same test booklets and identical procedures across the nation, NAEP results serve as a common metric for states and select urban districts that participate in the assessment.

There are two NAEP websites: the NCES NAEP website and The Nation's Report Card website. The first site details the NAEP program holistically, while the second focuses primarily on the individual releases of data.


Education in the United States
Diploma icon.png Educationportal
Flag of the United States.svg United Statesportal

NAEP began in 1964, with a grant from the Carnegie Corporation to set up the Exploratory Committee for the Assessment of Progress in Education (ESCAPE). The first national assessments were held in 1969. Voluntary assessments for the states began in 1990 on a trial basis and in 1996 were made a permanent feature of NAEP to be administered every two years. In 2002, selected urban districts participated in the state-level assessments on a trial basis and continue as the Trial Urban District Assessment (TUDA).

The development of a successful NAEP program has involved many, including researchers, state education officials, contractors, policymakers, students, and teachers. [1]


There are two types of NAEP assessments, main NAEP and long-term trend NAEP. This separation makes it possible to meet two objectives:

  1. As educational priorities change, develop new assessment instruments that reflect current educational content and assessment methodology.
  2. Measure student progress over time.


Main NAEP assessments are conducted in a range of subjects with fourth-, eighth- and twelfth-graders across the country. Assessments are given most frequently in mathematics, reading, science, and writing. Other subjects such as the arts, civics, economics, geography, technology and engineering literacy (TEL), and U.S. history are assessed periodically.

These assessments follow subject-area frameworks that are developed by the NAGB and use the latest advances in assessment methodology. [2] Under main NAEP, results are reported at the national level, and in some cases, the state and district levels.


National NAEP reports statistical information about student performance and factors related to educational performance for the nation and for specific demographic groups in the population (e.g., race/ethnicity, gender). It includes students from both public and nonpublic (private) schools and depending on the subject reports results for grades 4, 8, and 12.


State NAEP results are available in some subjects for grades 4 and 8. This allows participating states to monitor their own progress over time in mathematics, reading, science, and writing. They can then compare the knowledge and skills of their students with students in other states and with the nation.

The assessments given in the states are exactly the same as those given nationally. Traditionally, state NAEP was assessed only at grades 4 and 8. However, a 2009 [3] pilot program allowed 11 states (Arkansas, Connecticut, Florida, Idaho, Illinois, Iowa, Massachusetts, New Hampshire, New Jersey, South Dakota, and West Virginia) to receive scores at the twelfth-grade level.

Through 1988, NAEP reported only on the academic achievement of the nation as a whole and for demographic groups within the population. Congress passed legislation in 1988 authorizing a voluntary Trial State Assessment. Separate representative samples of students were selected from each state or jurisdiction that agreed to participate in state NAEP. Trial state assessments were conducted in 1990, 1992, and 1994. Beginning with the 1996 assessment, the authorizing statute no longer considered the state component a "trial.”

A significant change to state NAEP occurred in 2001 with the reauthorization of the Elementary and Secondary Education Act, also referred to as "No Child Left Behind" legislation. This legislation requires that states which receive Title I funding must participate in state NAEP assessments in mathematics and reading at grades 4 and 8 every two years. State participation in other subjects assessed by state NAEP (science and writing) remains voluntary.

Like all NAEP assessments, state NAEP does not provide individual scores for the students or schools assessed.

Trial Urban District Assessment

The Trial Urban District Assessment (TUDA) is a project developed to determine the feasibility of using NAEP to report on the performance of public school students at the district level. As authorized by congress, NAEP has administered the mathematics, reading, science, and writing assessments to samples of students in selected urban districts.

TUDA began with six urban districts in 2002, and has since expanded to 27 districts for the 2017 assessment cycle.

Albuquerque Public Schools xxxx
Atlanta Public Schools xxxxxxxxx
Austin Independent School District xxxxxxx
Baltimore City Public Schools xxxxx
Boston Public Schools xxxxxxxx
Charlotte-Mecklenburg Schools xxxxxxxx
Chicago Public Schools xxxxxxxxx
Clark County (NV) School District x
Cleveland Metropolitan School District xxxxxxxx
Dallas Independent School District xxxx
Denver Public Schools x
Detroit Public Schools xxxxx
District of Columbia Public Schools xxxxxxxxx
Duval County (FL) Public Schools xx
Fort Worth (TX) Independent School District x
Fresno Unified School District xxxxx
Guilford County (NC) Schools x
Hillsborough County (FL) Public Schools xxxx
Houston Independent School District xxxxxxxxx
Jefferson County (KY) Public Schools xxxxx
Los Angeles Unified School District xxxxxxxxx
Miami-Dade County Public Schools xxxxx
Milwaukee Public Schools xxxx
New York City Department of Education xxxxxxxxx
School District of Philadelphia xxxxx
San Diego Unified School District xxxxxxxx
Shelby County (TN) Schools x

Long-term trend

Long-term trend NAEP is administered to 9-, 13-, and 17-year-olds periodically at the national level. Long-term trend assessments measure student performance in mathematics and reading and allow the performance of today's students to be compared with students since the early 1970s.

Although long-term trend and main NAEP both assess mathematics and reading, there are several differences between them. In particular, the assessments differ in the content assessed, how often the assessment is administered, and how the results are reported. These and other differences mean that results from long-term trend and main NAEP cannot be compared directly. [4]

Assessment schedule

NAGB sets the calendar for NAEP assessments. Please refer to the entire assessment schedule for all NAEP assessments since 1968 and those planned through 2017.

Main NAEP assessments are typically administered over approximately six weeks between the end of January and the beginning of March of every year. Long-term trend assessments are typically administered every four years by age group between October and May. All of the assessments are administered by NAEP-contracted field staff across the country.

NAEP State Coordinators (NSC)

NAEP is conducted in partnership with states. The NAEP program provides funding for a full-time NSC in each state. He or she serves as the liaison between NAEP, the state's education agency, and the schools selected to participate.

NSCs provide many important services for the NAEP program and are responsible for:

New digitally-based assessments (DBA)

While most NAEP assessments are administered in a paper-and-pencil based format, NAEP is evolving to address the changing educational landscape through its transition to digitally-based assessments. NAEP is using the latest technology available to deliver assessments to students, and as technology evolves, so will the nature of delivery of the DBAs. The goal is for all NAEP assessments to be paperless by the end of the decade. The 2011 writing assessment was the first to be fully computer-based.

Interactive Computer Tasks (ICTs)

In 2009, ICTs were administered as part of the paper-and-pencil science assessment. The computer delivery affords measurement of science knowledge, processes, and skills not able to be assessed in other modes. Tasks included performance of investigations that include observations of phenomena that would otherwise take a long time, modeling of phenomena on a very large scale or invisible to the naked eye, and research of extensive resource documents.

Mathematics Computer-Based Study

This special study in multi-stage testing, implemented in 2011, investigated the use of adaptive testing principles in the NAEP context. A sample of students were given an online mathematics assessment which adapts to their ability level. All of the items in the study are existing NAEP items.

Technology and Engineering Literacy (TEL) Assessment

The TEL assessment framework describes technology and engineering literacy as the capacity to use, understand, and evaluate technology as well as to understand technological principles and strategies needed to develop solutions and achieve goals. The three areas of the assessment are:

Eighth-grade students throughout the nation took the assessment in winter of 2014. Results from this assessment were released in May 2016.

Writing Computer-Based Assessment

In 2011, NAEP transitioned its writing assessment (at grades 8 and 12) from paper and pencil to a computer-based administration in order to measure students' ability to write using a computer. The assessment takes advantage of many features of current digital technology and the tasks are delivered in multimedia formats, such as short videos and audio. Additionally, in an effort to include as many students as possible, the writing computer-based assessment system has embedded within it several universal design features such as text-to-speech, adjustable font size, and electronic spell check. In 2012, NAEP piloted the computer-based assessment for students at grade 4.

Studies using NAEP data

In addition to the assessments, NAEP coordinates a number of related special studies that often involve special data collection processes, secondary analyses of NAEP results, and evaluations of technical procedures.

Achievement gaps

Achievement gaps occur when one group of students outperforms another group and the difference in average scores for the two groups is statistically significant (that is, larger than the margin of error). In initial report releases NAEP highlights achievement gaps across student groups. However, NAEP has also releases a number of reports and data summaries that highlight achievement gap. – Some examples include the School Composition and the Black-White Achievement Gap and the Hispanic-White and the Black-White Achievement Gap Performance. [5] These publications use NAEP scores in mathematics and/or reading for these groups to either provide data summaries or illuminate patterns and changes in these gaps over time. Research reports, like the School Composition and Black-White Achievement Gap, also include caveats and cautions to interpreting the data.

High School Transcript Study (HSTS)

The HSTS explores the relationship between grade 12 NAEP achievement and high school academic careers by surveying the curricula being followed in our nation's high schools and the course-taking patterns of high school students through a collection of transcripts. Recent studies have placed an emphasis on STEM education and how it correlates to student achievement on the NAEP mathematics and science assessments.

NAEP-TIMSS Linking Study

The Trends in International Mathematics and Science Study (TIMSS) is an international assessment by the International Association for the Evaluation of Educational Achievement (IEA) that measures student learning in mathematics and science. NCES initiated the NAEP-TIMSS linking study so that states and selected districts can compare their own students' performance against international benchmarks. The linking study was conducted in 2011 at grade 8 in mathematics and science. NCES will "project", state and district-level scores on TIMSS in both subjects using data from NAEP.

National Indian Education Study (NIES)

The NIES is a two-part study designed to describe the condition of education for American Indian/Alaska Native students in the United States. The first part of the study consists of assessment results in mathematics and reading at grades 4 and 8. The second part presents the results of a survey given to American Indian/Alaska Native students, their teachers and their school administrators. The surveys focus on the students' cultural experiences in and out of school.

Mapping State Proficiency Standards

Under the 2001 reauthorization of the Elementary and Secondary Education Act (ESEA) of 1965, states develop their own assessments and set their own proficiency standards to measure student achievement. Each state controls its own assessment programs, including developing its own standards, resulting in great variation among the states in statewide student assessment practices. This variation creates a challenge in understanding the achievement levels of students across the United States. Since 2003, NCES has supported research that compares the proficiency standards of NAEP with those of individual states. State assessments are placed onto a common scale defined by NAEP scores, which allows states' proficiency standards to be compared not only to NAEP, but also to each other. NCES has released the Mapping State Proficiency Standards report using state data for mathematics and reading in 2003, 2005, 2007, 2009, and most recently 2013. [6]

Past studies

Over the years, NCES has conducted a number of other studies related to different aspects of the NAEP program. A few studies from the recent past are listed below:


NAEP's heavy use of statistical hypothesis testing has drawn some criticism related to interpretation of results. For example, the Nation's Report Card reported "Males Outperform Females at all Three Grades in 2005" as a result of science test scores of 100,000 students in each grade. [7] Hyde and Linn criticized this claim, because the mean difference was only 4 out of 300 points, implying a small effect size and heavily overlapped distributions. They argue that "small differences in performance in the NAEP and other studies receive extensive publicity, reinforcing subtle, persistent, biases." [8]

Related Research Articles

Standardized test Test administered and scored in a predetermined, standard manner

A standardized test is a test that is administered and scored in a consistent, or "standard", manner. Standardized tests are designed in such a way that the questions, conditions for administering, scoring procedures, and interpretations are consistent and are administered and scored in a predetermined, standard manner.

No Child Left Behind Act former United States Law

The No Child Left Behind Act of 2002 (NCLB) was a U.S. Act of Congress that reauthorized the Elementary and Secondary Education Act; it included Title I provisions applying to disadvantaged students. It supported standards-based education reform based on the premise that setting high standards and establishing measurable goals could improve individual outcomes in education. The Act required states to develop assessments in basic skills. To receive federal school funding, states had to give these assessments to all students at select grade levels.

International Association for the Evaluation of Educational Achievement nonprofit organization in Amsterdam, Netherlands

The International Association for the Evaluation of Educational Achievement (IEA) is an independent, international cooperative of national research institutions and governmental research agencies. It conducts large-scale comparative studies of educational achievement and other aspects of education, with the aim of gaining in-depth understanding of the effects of policies and practices within and across systems of education.

Achievement gaps in the United States are observed, persistent disparities in measures of educational performance among subgroups of U.S. students, especially groups defined by socioeconomic status (SES), race/ethnicity and gender. The achievement gap can be observed on a variety of measures, including standardized test scores, grade point average, dropout rates, and college enrollment and completion rates. While this article focuses on the achievement gap in the United States, the gap in achievement between lower income students and higher income students exists in all nations and it has been studied extensively in the U.S. and other countries, including the U.K. Various other gaps between groups exist around the globe as well.

The Programme for International Student Assessment (PISA) is a worldwide study by the Organisation for Economic Co-operation and Development (OECD) in member and non-member nations intended to evaluate educational systems by measuring 15-year-old school pupils' scholastic performance on mathematics, science, and reading. It was first performed in 2000 and then repeated every three years. Its aim is to provide comparable data with a view to enabling countries to improve their education policies and outcomes. It measures problem solving and cognition.

National Curriculum assessment usually refers to the statutory assessments carried out in primary schools in England, colloquially known as standard attainment tests (SATs). The assessments are made up of a combination of testing and teacher assessment judgements, and are used in all government-funded primary schools in England to assess the attainment of pupils against the programmes of study of the National Curriculum at the end of Key Stages 1 and 2, when most pupils are aged 7 and 11 respectively. Until 2008, assessments were also required at the end of Key Stage 3 (14-year-olds) in secondary schools.

The National Center for Education Statistics (NCES) is the part of the United States Department of Education's Institute of Education Sciences (IES) that collects, analyzes, and publishes statistics on education and public school district finance information in the United States. It also conducts international comparisons of education statistics and provides leadership in developing and promoting the use of standardized terminology and definitions for the collection of those statistics. NCES is a principal agency of the U.S. Federal Statistical System.

The Washington Assessment of Student Learning (WASL) was a standardized educational assessment system given as the primary assessment in the state of Washington from spring 1997 to summer 2009. The WASL was also used as a high school graduation examination beginning in the spring of 2006 and ending in 2009. It has been replaced by the High School Proficiency Exam (HSPE), the Measurements of Students Progress (MSP) for grades 3-8, and later the Smarter Balanced Assessment (SBAC). The WASL assessment consisted of examinations over four subjects with four different types of questions. It was given to students from third through eighth grades and tenth grade. Third and sixth graders were tested in reading and math; fourth and seventh graders in math, reading and writing. Fifth and eighth graders were tested in reading, math and science. The high school assessment, given during a student's tenth grade year, contained all four subjects.

Renaissance Learning, Inc. (Renaissance) is a software as a service and learning analytics company that makes cloud-based, pre-K–12 educational software and adaptive assessments. Renaissance employs about 1,000 employees in nine U.S. cities and subsidiaries in Canada, the United Kingdom, Korea, and Australia. Renaissance's solutions are used in one-third of U.S. schools and more than 90 countries around the world.

The New England Common Assessment Program was a series of reading, writing, mathematics and science achievement tests, administered annually, which were developed in response to the Federal No Child Left Behind Act. Starting in 2005, school students in New Hampshire, Rhode Island, and Vermont participated in NECAP, and Maine joined the assessment program in 2009. It was a collaborative project of the New Hampshire, Rhode Island and Vermont departments of education, with assistance from the National Center for the Improvement of Educational Assessments. Measured Progress, an assessment contractor from Dover, New Hampshire, coordinates production, administration, scoring and reporting.

Progress in International Reading Literacy Study Progress in International Reading Literacy Study

The IEA's Progress in International Reading Literacy Study (PIRLS) is an international study of reading (comprehension) achievement in fourth graders. It is conducted by the International Association for the Evaluation of Educational Achievement (IEA). It is designed to measure children's reading literacy achievement, to provide a baseline for future studies of trends in achievement, and to gather information about children's home and school experiences in learning to read. PIRLS 2006 tested 215,000 students from 46 educational systems. PIRLS 2011 testing results were published in December 2012. "The reading achievement results present each country with an opportunity to examine educational policies and practices against a globally-defined benchmark, while the report also contains rich information about children's early literacy experiences and reading instruction."

The Harrisburg University of Science and Technology High School also known as SciTech High, located in downtown Harrisburg, Pennsylvania, welcomed its first students in September 2003. SciTech High is a regional math and science magnet school that attracts students from Harrisburg and neighboring school districts and is affiliated with the Harrisburg University of Science and Technology. Beginning with the 9th grade, SciTech High prepares students for a university curriculum, studying toward degrees in science, engineering and technology. Enrollment is limited to 400 pupils The opportunity to create a unified 9-16 curriculum is unique in the nation and represents a potentially replicable approach to bridging the frequent discontinuity between high school and higher education. The school is a federally designated Title I school.

STAR (software) assessment software

STAR Reading, STAR Early Literacy and STAR Math are standardized, computer-adaptive assessments created by Renaissance Learning, Inc., for use in K-12 education. Each is a "Tier 2" assessment of a skill that can be used any number of times due to item-bank technology. These assessments fall somewhere between progress monitoring tools and high-stakes tests.

The Colorado Student Assessment Program (CSAP) was an assessment required by the No Child Left Behind Act administered by the Unit of Student Assessment in the Colorado Department of Education (CDE). The CSAP was designed to measure how well students are learning material from the Colorado Model Content Standards, the established content standards that all Colorado public school students should learn. The CSAP only tested four of the thirteen subject areas in the Colorado Model Content Standards.

Trends in International Mathematics and Science Study series of international assessments of the mathematics and science knowledge of students around the world

The IEA's Trends in International Mathematics and Science Study (TIMSS) is a series of international assessments of the mathematics and science knowledge of students around the world. The participating students come from a diverse set of educational systems in terms of economic development, geographical location, and population size. In each of the participating educational systems, a minimum of 4,500 to 5,000 students is evaluated. Contextual data about the conditions in which participating students learn mathematics and science are collected from the students and their teachers, their principals, and their parents via questionnaires.

An exit examination is a test that students in the United States of America must pass to receive a diploma and graduate from school. Such examinations have also been used in a variety of countries; this article focuses on their use within the United States. These are usually criterion-referenced tests which were implemented as part of a comprehensive standards-based education reform program which sets into place new standards intended to increase the learning of all students.

Sandra Stotsky is Professor emerita in the Department of Education Reform at the University of Arkansas, and held the 21st Century Chair in Teacher Quality. Her research ranges from teacher licensure tests, e.g., (1), coherence in the literature and reading curriculum, e.g., (2), and academic achievement in single-sex classrooms, e.g., (3) to critiques of Common Core’s standards in English language arts, e.g., (4) mathematics.(5), and US History and civic education (6), and other aspects of the Common Core project, e.g., (7), and to reviews of books in education, e.g., (8) She is an advocate of standards-based reform and strong academic standards and assessments for students and teachers.

TerraNova is a series of standardized achievement tests used in the United States designed to assess K-12 student achievement in reading, language arts, mathematics, science, social studies, vocabulary, spelling, and other areas.

Education in Georgia (U.S. state) education in the U.S. state

Education in Georgia consists of public and private schools in Georgia, including the University of Georgia, private colleges, and secondary and primary schools.

The racial achievement gap in the United States refers to the educational disparities between various ethnic groups. It manifests itself in a variety of ways: among students, blacks and Hispanics are more likely to receive lower grades, score lower on standardized tests, drop out of high school, and they are less likely to enter and complete college than whites, while whites score lower than Asian Americans. For every dollar the board spent on a black or brown student, equals up to seven spent on a white student.


  1. "Measuring Student Progress Since 1964". National Center for Education Statistics. Retrieved 2011-09-29.
  2. "Frameworks and Specifications". National Center for Education Statistics. Retrieved 2011-09-29.
  3. "Results for public school students in 11 states available for the first time". National Center for Education Statistics. Retrieved 2011-09-29.
  4. "What are the main differences between Long-Term Trend NAEP and Main NAEP?". National Center for Education Statistics. Retrieved 2011-09-29.
  5. Achievement Gaps NAEP (homesite), retrieved 13 April 2013
  6. Mapping State Proficiency Standards National Center for Education Statistics, retrieved 13 April 2013
  7. "Male and Female Students Make Gains Since 2000 at Grade 4; Males Outperform Females at all Three Grades in 2005". The Nation's Report Card. U.S. Department of Education. Retrieved 16 September 2012.
  8. Hyde, Janet Shibley; Marcia C. Linn (27 October 2006). "Gender similarities in mathematics and science". Science. 314 (5799): 599–600. doi:10.1126/science.1132154. PMID   17068246 . Retrieved 16 September 2012.

Further reading