ACHIEVEMENT TESTS
- Defining Achievement Tests and Their Purpose
- Categorization of Achievement Instruments
- Major General Achievement Batteries: The Metropolitan Achievement Tests
- Major General Achievement Batteries: The Sequential Tests of Educational Progress (STEP)
- Overview of Other Standardized General Batteries
- Achievement Testing at Secondary and Collegiate Levels
- Specialized and Content-Area Achievement Tests
- Specific Achievement Testing in STEM and Humanities Subjects
Defining Achievement Tests and Their Purpose
Achievement tests are standardized psychological and educational instruments specifically designed to quantify the knowledge, skills, or proficiency attained by an individual following formal instruction or training in a particular subject area. Unlike aptitude tests, which aim to predict future potential or innate ability, achievement tests are fundamentally retrospective; they measure what has already been learned or mastered within a structured educational context. These measures are critical tools within educational systems globally, providing objective data used for a multitude of purposes, including student placement, evaluation of instructional efficacy, curriculum planning, and the diagnosis of specific learning difficulties. The scope of these tests ranges dramatically, covering foundational literacy and numeracy skills in early education through to highly specialized professional competencies required in advanced fields.
The core objective underlying the deployment of achievement tests is to provide objective, measurable evidence of educational attainment. This objectivity is achieved primarily through rigorous standardization, wherein the test administration, scoring, and interpretation procedures are maintained uniformly for all examinees. Historically, the development of these standardized instruments marked a significant advance over subjective teacher evaluations, allowing educators and policymakers to assess educational outcomes on a much broader, comparable scale. The reliability and validity of these instruments are paramount, necessitating meticulous development processes involving large, representative norming samples to ensure that scores accurately reflect the intended construct and allow for meaningful comparisons across diverse student populations.
Categorization of Achievement Instruments
Achievement tests are broadly classified into two major categories based on their scope and application in educational assessment. The first category comprises general batteries, which are comprehensive, multi-subject instruments designed to cover major academic areas expected to be taught across several grade levels. These batteries provide a holistic profile of a student’s academic standing across core subjects like reading, mathematics, and language arts, making them suitable for large-scale assessment programs or longitudinal tracking of student progress. The second category encompasses special instruments, which are tailored to assess proficiency in narrow content domains or serve highly specific diagnostic or readiness functions.
The specialized instruments serve distinct functions and include several critical subtypes. Readiness tests assess the prerequisite skills necessary for success in a specific upcoming course or grade level, commonly used before formal schooling begins. Diagnostic tests delve deeply into specific skills to pinpoint the exact nature of a student’s difficulties, such as identifying specific deficits in algebraic thinking or phonological awareness. Furthermore, this category includes highly focused content-area tests (e.g., tests in chemistry or European history), vocational achievement tests used in technical training settings, and specialized assessments for the professions, ensuring candidates meet competency standards prior to licensure. It is important to note the distinction: industrial achievement tests, often referred to as trade tests, are generally discussed separately under the domain of personnel tests due to their specific application in workforce selection and evaluation.
Major General Achievement Batteries: The Metropolitan Achievement Tests
The Metropolitan Achievement Tests (MAT) constitute one of the most widely recognized and extensively administered series of general achievement batteries in the United States. This series is systematically structured into five distinct batteries, each typically available in three or four equivalent forms to facilitate secure administration and retesting. The MAT batteries are designed to cover a wide spectrum of educational attainment, spanning from the foundational elementary grades (Grade 1) through to the junior high level (Grades 7-9). The comprehensive nature of the MAT ensures that key academic areas are thoroughly evaluated, providing a detailed and normed snapshot of student achievement across various educational domains considered essential for academic success.
The specific content areas rigorously assessed by the MAT are multifaceted. They invariably include measures of word knowledge and word discrimination, comprehensive reading assessment, and a detailed evaluation of arithmetic skills, which is often segmented into concepts and skills, rigorous problem solving, and computational abilities. Additionally, the MAT evaluates proficiency in spelling and various complex aspects of language usage, including grammar, pronunciation, capitalization, identification of parts of speech, and understanding of different kinds of sentences. Beyond these linguistic and numerical subjects, the tests often incorporate sections dedicated to language-studies skills, such as the effective use of dictionaries and other reference materials, and social-studies skills, which typically involve interpreting complex data presented in maps, tables, and graphs. Science knowledge is also assessed within the battery. The results are meticulously processed, resulting in individual score profiles that are frequently converted into grade equivalents based on extensive norms derived from a representative sample of the public school population, allowing educators to benchmark performance against national standards.
Major General Achievement Batteries: The Sequential Tests of Educational Progress (STEP)
The Sequential Tests of Educational Progress (STEP) represent another pivotal instrument in the landscape of general achievement testing. Developed through collaboration between leading educators and specialists from the Educational Testing Service, STEP places significant emphasis on the application of learned knowledge to solve new, realistic problems, rather than merely assessing the recall of specific factual information. This focus on applied learning distinguishes STEP and makes it highly relevant for assessing higher-order cognitive skills. The STEP tests are systematically structured across four distinct levels, accommodating students from grades 4 through 6, 7 through 9, 10 through 12, and extending into the collegiate freshman and sophomore years (grades 13-14).
Seven core tests are made available at each of these four levels, ensuring continuity and direct comparability of scores across developmental stages. These core areas include Reading, Writing, Mathematics, Science, Social Studies, Language Comprehension, and Essay Writing. All tests, with the exception of the Essay Writing component, utilize a multiple-choice format. A defining characteristic of the STEP series is its special commitment to assessing communication skills. This is evident not only in the traditional reading and writing sections but also through dedicated assessments like the listening and comprehension test, as well as the writing tests in which subjects must indicate how they would try to improve actual writing specimens, thereby demonstrating practical editing and revision skills. Even though these tests are timed, similar to the Metropolitan Tests, the STEP battery is fundamentally categorized as a power test, prioritizing the depth, complexity, and thoughtful application of knowledge over mere speed of response.
The scoring methodology employed by STEP further distinguishes it from other general batteries. Performance on any individual STEP test is articulated using a single, unified scale that is applicable across all grades covered by the series. This sophisticated scaling facilitates robust longitudinal tracking of student growth across their academic careers. All scores can be reliably transformed into percentiles, and a detailed profile is constructed for each student. This profile vividly illustrates the student’s relative standing on the different tests, offering a comprehensive view of their relative strengths and weaknesses in applying educational content.
Overview of Other Standardized General Batteries
The field of general achievement testing boasts several other widely used batteries, each tailored to specific grade ranges and academic emphasis, complementing the MAT and STEP instruments. The Stanford Achievement Test, a battery with a long history of use, covers grades 1 through 9, providing reliable assessments in core subjects including arithmetic, reading, science, study skills, and social studies. Similarly, the Iowa Test of Basic Skills (ITBS), which targets grades 3 through 9, focuses heavily on essential foundational academic competencies. ITBS offers specific tests of vocabulary, comprehensive reading skills, language mechanics, critical arithmetical skills, and word-study skills, providing educators with precise data on fundamental learning proficiencies critical for future academic success.
Further contributing to the array of general measures is the California Achievement Test (CAT), which covers an exceptionally broad range from early elementary (Grade 1) up to the early college level (Grade 14). The CAT rigorously assesses crucial components such as reading vocabulary and comprehension, arithmetic fundamentals and reasoning, and the mechanics of English coupled with spelling proficiency. The SRA Achievement Series also offers comprehensive forms for grades 2 through 9, specializing in reading, language perception, language arts, arithmetic, and work-study skills. These batteries collectively ensure that educators have access to versatile and robust instruments for evaluating generalized academic performance across all major disciplines. Other noteworthy general batteries include the Tests of Academic Promise and the recently developed Fundamental Achievement Series.
Achievement Testing at Secondary and Collegiate Levels
As students progress, achievement assessment adapts to the greater subject depth and critical thinking required at the secondary and tertiary levels. Batteries designed for the high school level, such as the Essential High School Content Batteries and the Evaluation and Adjustment Series, shift their focus toward advanced content mastery. These tests emphasize specific sciences, rigorous English language and literature analysis, and complex social studies concepts. Crucially, they also measure the student’s ability to apply advanced thought processes and utilize intricate data when dealing with problematic academic situations, demanding skills beyond the basic recall expected at lower grade levels.
The Iowa Tests of Educational Development (ITED), specifically designed for high school cohorts, is another key instrument that assesses a student’s ability to interpret and apply knowledge across various content domains relevant to success in higher education and adult life. For students transitioning from secondary to tertiary education, the Cooperative General Achievement Tests are frequently employed for high school seniors and college freshmen to gauge readiness across core liberal arts subjects. Moving further into the college environment, the Cooperative General Culture Test has been meticulously constructed for college use, particularly targeting the sophomore level, to assess broad knowledge in areas like history, social studies, literature, and fine arts.
A particularly specialized instrument is the Adult Basic Learning Examination (ABLE). This test is unique in that it is specifically designed for undereducated adults and focuses on practical, functional problems in vocabulary, reading, spelling, and arithmetic. The primary purpose of ABLE is to accurately assess an adult’s foundational ability to successfully participate in adult-education classes, vocational training programs, or specialized programs conducted by penal institutions and special agencies such as the Job Corps, thereby facilitating appropriate placement and intervention strategies for lifelong learning.
Specialized and Content-Area Achievement Tests
In addition to general batteries, a vast number of highly specialized achievement tests have been devised to measure proficiency in single subjects or discrete skills, invaluable for targeted remediation and precise curriculum evaluation. One such instrument is the Wide Range Achievement Test (WRAT), which was revised in 1965. The WRAT is an individually administered test used primarily for diagnostic, remedial, and vocational assessment purposes. It provides a quick and reliable assessment of skill level in three fundamental areas: oral reading, spelling, and computation. A key strength of the WRAT is its versatility, allowing the examiner to adjust the testing range dynamically to suit various educational levels, spanning from kindergarten readiness up to college proficiency.
In the essential area of literacy, specialized reading tests provide detailed metrics on comprehension and speed. The Davis Reading Test stands out as a significant measure of reading ability, providing a continuous assessment of both the level and the speed of comprehension for students ranging from grades eight to eleven and grades eleven to thirteen. More recently, the widely respected Gates Reading Tests have been replaced by the newly revised Gates-MacGinitie Reading Tests. This updated series is grounded in rigorous standardization, utilizing a large, representative sample of approximately 40,000 pupils across 38 communities, ensuring its norms are highly accurate and reflective of contemporary student populations. These specialized instruments allow educators to precisely identify reading difficulties and monitor progress following remedial instruction.
Specific Achievement Testing in STEM and Humanities Subjects
The development of content-specific achievement tests is particularly crucial in subjects requiring deep, specialized knowledge, such as Science, Mathematics, and History. In the sciences, several instruments are available to assess high school and collegiate mastery. For example, the Biology: BSCS Final Examination is specifically tailored for use at the conclusion of a high school course utilizing the Biological Sciences Curriculum Study (BSCS) framework. Furthermore, the Processes of Science Test is designed not just to test factual recall but to measure the student’s deeper understanding of core scientific principles, methods of inquiry, and logical reasoning inherent in the scientific method. Other general science assessments include the comprehensive Cooperative Science Tests and the Test of Scientific Knowledge, which measures a student’s foundational background in general science through questions covering factual information and underlying scientific principles.
A comprehensive list of other widely utilized achievement tests in specialized subjects demonstrates the depth of measurement available to educators:
-
Mathematics Assessment Instruments: These include the Blyth Second-Year Algebra Test, the Lankton First-Year Algebra Test, the Contemporary Mathematics Test, the Cooperative Mathematics Tests, the Modern Math Understanding Test, and the Stanford Modern Mathematics Concepts Test, alongside the Wisconsin Contemporary Test of Elementary Mathematics. These tests cover both traditional and modern mathematical curricula.
-
Social Studies and History Tests: Proficiency in historical knowledge and social studies concepts can be reliably evaluated using instruments such as the Crary American History Test and the Cumming’s World History Test.
-
Physics and Chemistry Tests: Specialized science knowledge is rigorously tested by instruments like the Anderson-Fiske Chemistry Test and the Dunning-Abeles Physics Test. The Nelson Biology Test is also available for life science assessment.
-
Foreign Language Proficiency: Assessing communicative competence and grammatical mastery in languages is accomplished through tests such as the MLA Cooperative Foreign Language Tests and the Pimsleur Modern Foreign Language Proficiency Tests (available for French, German, and Spanish).