What is a NAEP Framework?
It is the blueprint for an assessment. In general, NAEP frameworks have been documents drawn up by teams of teachers, school officials, and members of the general public to guide item development and scoring of items in each subject.
NAEP frameworks generally have a ten-year lifespan.
The 1992
The NAEP Mathematics framework, which has been revised several times since 1990 to align with National Council of Teachers of Mathematics (NCTM) standards, is due to be replaced in 2005.
The 2007 NAEP Reading framework is currently being
developed by a national committee of educators supervised by American Institutes
for Research (AIR).
How Do NAEP Frameworks Generate Assessment Results?
NAEP reports results for states and the nation as scaled scores, percentiles, and in relation to Achievement Levels. NAEP also reports subscale scores corresponding to Reading contexts and Mathematics strands described in their respective frameworks:
Reading Contexts
| Reading for Literary Experience |
| Reading to Gain Information |
| Reading to Perform a Task |
Mathematics Strands
| Number Sense, Properties, and Operations |
| Measurement |
| Geometry and Spatial Sense |
| Data Analysis, Statistics, and Probability |
| Algebra and Functions |
How
Do NAEP Frameworks Compare with
NAEP Reading subscales are based upon items sorted by passage type using the contexts specified by the NAEP Reading framework; MEA categories of items are structured differently because they are based upon Learning Results standards
In addition, NAEP Reading and Mathematics surveys use sets of sub-categories that are not reflected in the scaled scores. The NAEP Reading framework also specifies four stances; NAEP Mathematics, two domains:
Reading Stances
| Initial Understanding |
| Developing Interpretation |
| Personal Response |
| Critical Sense |
Mathematics Domains
| Mathematical Abilities |
| Mathematical Power |
These sub-categories are used to classify and distribute the items in an assessment for a given year. There appears to be a correspondence between NAEP Mathematics strands and the proposed MEA Mathematics clusters; the distribution of categories of items in both assessments is roughly similar if counted at the strand/cluster level.
The distributions of multiple choice and constructed response items in NAEP and MEA assessments differ in that NAEP uses more constructed response and MEA uses more multiple-choice items overall, but this effect is offset by assigning different values to the different item types in MEA scoring.
Since the NAEP and MEA Reading blueprints differ so greatly, direct comparison of results cannot really be made except in respect to NAEP Achievement Levels, which represent the application of standards to the NAEP scaled scores.
NAEP
Achievement Levels
In order to interpret the scaled scores in terms of student
proficiency in each subject, the National Assessment Governing Board (NAGB) has
developed Achievement Levels of Basic, Proficient,
and Advanced in mathematics, reading, history, geography, science,
writing, and civics since 1990. These
are more easily compared with performance levels in
While MEA scores are directly connected to standards set by
The percentage of
Using
NAEP Item Maps
The NAEP Achievement Levels have another implication for state assessments as a result of NAEP’s development of item maps. The Item Response Theory (IRT) statistical model that NAEP uses to develop scaled scores can be used to predict student performance on specific items. Each year NAEP releases 25% of the items used in that year’s assessment. IRT modeling can place these items on the achievement scale to show how the ability to answer a specific type of question correctly would relate to the overall achievement of students participating in the assessment.
For example, a 4th grade student performing at the Basic level overall on the Mathematics assessment would probably be able to answer the following question correctly:
“Determine how much change a person will get back from a purchase.”
But a student also answering the following question correctly has a high probability of achieving the Proficient level in 4th grade Mathematics:
“Apply the concept of symmetry to visualize the result of folding a marked strip of paper.”
This suggests that specific MEA and NAEP items might be
compared in content to make connections between the two assessments.
Released NAEP items can be found online.
Log on to the
Comparing
Scores
NAEP and MEA scoring scales are not as similar as they may
seem. MEA scores are statistically
computed from individual students’ raw scores, while NAEP scores are derived
from abstract statistical values called thetas.
Both assessments use IRT statistics, but each uses a different method and uses its results for different purposes. MEA uses IRT for equating, which is a way to ensure that students taking different tests in different years in the same subject at the same level receive comparable scores. MEA does not use IRT to create the scores themselves; NAEP does—because the Nation’s Report Card is building a projection of student performance at the state and national level.
MEA results have implications for instructional practices at the school level, while NAEP results do not. NAEP does not report scores at the school level because of the sampling methods it uses. Each student participating in a NAEP assessment sees only a small portion of the items for that subject and level.
Schools and students are selected by NCES across the state
to provide complete coverage of the assessment items for a subject and level by
a student population representative of the entire state. The same kind of
sampling of students and items is done to generate a prediction of student
performance for the nation as a whole. It
is unlikely that the group of students selected for a single school in
Both are valuable information resources for
NAEP State Coordinator for
All current NAEP
frameworks may be viewed on- line at the NCES website.