Beware of the Standards, Not Just the Tests (Opinion)

Save to favorites
Print

Copy URL

Alfie Kohn

Alfie Kohn鈥檚 most recent books are The 69传媒 Our Children Deserve: Moving Beyond Traditional Classrooms and 鈥淭ougher Standards鈥� (Houghton Mifflin, 1999) and The Case Against Standardized Testing (Heinemann, 2000). Requests for permission to photocopy or reprint this essay can be made by contacting the author directly. He lives in Belmont, Mass., and can be reached at

A number of prominent educators are finally raising their voices against standardized testing鈥攑articularly multiple-choice, norm-referenced tests; particularly tests with 鈥渉igh stakes鈥� (read: bribes and threats) attached; and particularly in the context of a federal mandate to force every state to test every student in grades 3-8 every year. Yet even as more opinion leaders come to understand the damage attributable to testing mania, it is still rare to hear objections to the standards movement as a whole.

Learning is threatened by specific, measurable uniform mandates.

The Learning First Alliance, a coalition of leading education groups, cautiously raised concerns about the tests not long ago, but mostly out of fear that the burgeoning grassroots opposition might bring down the state standards, too. Education Week鈥榮 2001 edition of Quality Counts likewise worried that tests 鈥渁re overshadowing鈥� and 鈥渄o not adequately reflect鈥� the standards. Major conferences carry titles such as 鈥淪tandards: From Theory to Practice鈥� and 鈥淲ill Standards Survive the Classroom?鈥� (You will look in vain for conferences called 鈥淲ill Classrooms Survive the Standards?鈥� or 鈥淪tandards: From Capitulation to Resistance.鈥�)

A list of boat-rocking books on the subject begins and pretty much ends with Susan Ohanian鈥檚 One Size Fits Few and Deborah Meier鈥檚 Will Standards Save Public Education? Alarms have been quietly raised by Nel Noddings, Elliot Eisner, James Beane, and a few other eminent educators in the pages of Phi Delta Kappan. Otherwise, the field seems to have closed ranks around the idea that it is permissible to criticize the tests, but not the standards. Indeed, test opponents are sternly reminded to avoid confusing the two, as though they were in fact unrelated. I want to argue not only that they are inextricably connected鈥攖he tests serving, at least in theory, as the enforcement mechanism of the standards鈥攂ut also that the latter may be every bit as problematic as the former.

Of course, it鈥檚 reasonable to ask just what kind of standards are at issue here. The most relevant and widely accepted distinction is between outcome and content. Outcome standards specify how well students must do. At the highest level of generality (鈥淲e support high standards鈥�), the notion is unobjectionable but not terribly useful. When translated into specifics, it comes to mean cut scores on standardized tests and becomes downright dangerous. Outcome standards to a remarkable extent are based on confusing harder with better, an error I have already discussed in these pages and need not belabor. (鈥淐onfusing Harder With Better,鈥� Commentary, Sept. 15, 1999.)

Content standards, by contrast, specify what students will be taught. Rather than declaring that all such standards are bad鈥攐r, as is far more common, accepting all such standards uncritically鈥擨 propose that we judge a given set of standards or frameworks according to four criteria:

How specific? There are many reasons policymakers seek to impose detailed curriculum mandates. They may fundamentally distrust educators: Much of the current standards movement is just the latest episode in a long, sorry history of trying to create a teacher-proof curriculum. Alternatively, they may simply assume that more specificity is always preferable. In reality, just because it makes sense to explain to a waiter exactly how I鈥檇 like my burger cooked doesn鈥檛 mean it鈥檚 better to declare that students will study the perimeter of polygons (along with scores of other particular topics) than it is to offer broad guidelines for helping students learn to think like mathematicians.

The latter sort of standards, supported by practical guidance, can help students reason carefully, communicate clearly, and get a kick out of doing so. But long lists of facts and skills that teachers must cover may have the opposite effect. Thus, when Harold Howe II, the U.S. commissioner of education under President Johnson, was asked what a set of national standards should be like (if we had to adopt them), he summarized a lifetime of wisdom in four words: they should be 鈥渁s vague as possible.鈥�

His caution applies to state standards as well. On the one hand, thinking is messy, and deep thinking is very messy. On the other hand, standards documents are nothing if not orderly. Keep that contrast in mind and you will not be surprised to see how much damage those documents can do in real classrooms.

When the U.S. commissioner of education under President Johnson was asked what a set of national standards should be like, he summarized a lifetime of wisdom in four words: they should be "as vague as possible."

Considerable research has demonstrated the importance of making sure students are actively involved in designing their own learning, invited to play a role in formulating questions, creating projects, and so on. But the more comprehensive and detailed a list of standards, the more students (and even teachers) are excluded from this process, the more alienated they tend to become, and the more teaching becomes a race to cover a huge amount of material. Thus, meeting these kinds of standards may actually have the effect of dumbing down classrooms. As Howard Gardner and his colleagues wisely observed, 鈥淭he greatest enemy of understanding is 鈥榗overage.鈥欌€�

Some insist that these lists of facts and skills don鈥檛 prescribe how students will be taught; the standards are said to be neutral with respect to pedagogy. But this is nonsense. If the goal is to cover material (rather than, say, to discover ideas), that unavoidably informs the methods that will be used. Techniques such as repetitive drill-and-practice are privileged by curriculum frameworks based on a 鈥渂unch o鈥� facts鈥� approach to education. Of course, that kind of teaching is also driven by an imperative to prepare students for tests, but no less by an imperative to conform to specific standards.

Some people sincerely believe that to teach well is to work one鈥檚 way through a list of what someone decided every nth grader ought to know. But the question is not just what you or I think about that model; the question is whether that model should enjoy virtual monopoly status in American public schools. In effect, one particular, very debatable philosophy of education has been enshrined in state standards documents and has become the law of the land.

How quantifiable? The current accountability fad insists on mandates that are not only overly detailed but chosen according to whether they lend themselves to easy measurement. It鈥檚 not just that the tests are supposed to be tied to the standards; it鈥檚 that the standards have been selected on the basis of their testability. The phrase 鈥渟pecific, measurable standards鈥� suggests a commitment not to excellence but to behaviorism. It is telling that this phrase is heard most often from corporate officials and politicians, not from leading educational theorists or cognitive scientists.

We are talking about a worldview in which any aspect of learning, or life, that resists being reduced to numbers is regarded as vaguely suspicious. By contrast, anything that appears in numerical form seems reassuringly scientific; if the numbers are getting larger over time, we must be making progress. Concepts like intrinsic motivation and intellectual exploration are difficult for some minds to grasp, whereas test scores, like sales figures or votes, can be calculated and charted and used to define success and failure.

Unfortunately, meaningful learning does not always proceed along a single dimension, such that we can nail down the extent of improvement. As Linda McNeil of Rice University has observed, 鈥淢easurable outcomes may be the least significant results of learning.鈥� (That sentence ought to be printed out in 36-point Helvetica, framed, and tacked to the wall of every school administrator鈥檚 office in the country.) To talk about what happens in schools as moving forward or backward in specifiable degrees is not only simplistic, in the sense that it fails to capture what is actually going on; it is destructive, because it can change what is going on for the worse.

鈥楳easurable outcomes may be the least significant results of learning.鈥�

Consider a comment made by Sandra Stotsky, the deputy commissioner of education in Massachusetts: 鈥�Explore isn鈥檛 a word that can be put into a standard because it can鈥檛 be assessed.鈥� This assertion is obviously false because there are plenty of ways to assess the quality of students鈥� exploration鈥攗nless, of course, 鈥渁ssessment鈥� is equated with standardized testing. But suppose for the moment that Ms. Stotsky was correct. What if we really were faced with a trade-off between an emphasis on exploration in the classroom and the demands of measurement? Most thoughtful educators would unhesitatingly choose the former, whereas those who write and enforce state standards often opt for the latter. Clearly, it is much easier to quantify the number of times a semicolon has been used correctly in an essay than it is to quantify how well the student has explored ideas in that essay. Thus, the more emphasis that is placed on picking standards that are measurable, the less ambitious the teaching will become.

How uniform? We have heard the phrase 鈥渟tandardized testing鈥� so often that we may have become inured to the significance of that first word. To what extent do we really want our students to receive a standardized education? At a national conference last fall, a consultant announced with apparent satisfaction that now, thanks to standards-based reform, 鈥渇or the first time in my experience, people on a grade level, in a subject area, or teaching a course at a high school are [feeling] a responsibility to all have the same destination.鈥� That she did not even feel it necessary to defend this goal says something about the current acceptance of a one-size-fits-all model of education.

Once again, the problem is not just with the construction of the tests, but with the uniformity of the standards. Wanting to make sure that students in low-income communities don鈥檛 receive a second-rate education is a laudable objective. Wanting to make sure that all students in your state receive the same education, such that they are treated as interchangeable recipients of knowledge, is a very different matter. Even more troubling are grade-by-grade standards. Here, the prescribers are not just saying, 鈥淲e expect students to know the following stuff by the time they鈥檙e in 8th grade,鈥� but 鈥淲e expect them to learn all the items on this list in 5th grade, all the items on that list in 6th grade,鈥� and so on. Apart from the negative effects on learning, this rigidity about both the timing of the instruction and its content creates failures unnecessarily by trying to force all children to learn at the same pace.

Guidelines or mandates? There are standards offered as guidelines (鈥淪ee if this way of thinking about teaching can help you improve your craft鈥�), and then there are standards presented as mandates (鈥淭each this or else鈥�). Virtually all the states have chosen the latter course. The effect has been not only to control teachers, but to usurp the long-established power of local school districts to chart their own course. If there has ever been a more profoundly undemocratic school reform movement in U.S. educational history than what is currently taking place in the name of standards, I haven鈥檛 heard of it.

The effect of mandated standards has been not only to control teachers, but to usurp the power of local school districts to chart their own course.

Bullying reaches its apotheosis with high-stakes testing, the use of crude rewards and punishments to make people ratchet up the scores. The underlying logic is captured by an ironic sign spotted on a classroom wall: 鈥淭he beatings will continue until morale improves.鈥� But the standards themselves, if handed down as requirements, embody that same determination on the part of policymakers to do things to educators and students rather than to work with them. My nominee for the most chillingly Orwellian word now in widespread use is 鈥渁lignment"鈥攁s in, How can we make teachers 鈥渁lign鈥� their teaching to the state standards? A remarkable number of people, including some critics of high-stakes testing, have casually accepted this sort of talk despite the fact that it is an appeal to naked power. 鈥淎lignment鈥� isn鈥檛 about improvement; it鈥檚 about conformity.

Standards-as-mandates also imply a rather insulting view of educators鈥攏amely, that they need to be told what (and, by extension, how) to teach by someone in authority because otherwise they wouldn鈥檛 know. While plenty of teachers need help, virtually everyone is likely to resist having the state try to micromanage his or her classroom. Some will do their best to ignore the standards, while others will comply resentfully. Either way, the use of control leads to poor implementation of the standards (which, come to think of it, may not be such a bad thing). Others, including some of our best educators, will throw up their hands in disgust and find another career.

Based on these four criteria, the standards promulgated by disciplinary groups (the councils of teachers of mathematics and English, for example) come out considerably better than the standards issued by states. This doesn鈥檛 preclude our objecting to certain aspects of the former documents, of course, nor does it mean that all state standards are equally bad, as a side-by-side comparison of, say, Minnesota鈥檚 Profile of Learning and Virginia鈥檚 Standards of Learning will confirm.

Currently, however, there is considerable pressure to implement the kind of standards that I am suggesting are the worst. Chester E. Finn Jr. and his colleagues want states to spell out 鈥渨hich books children should read in English class, which individuals and events to study in history, and so on"; any other standards are simply 鈥渇luff.鈥� Pro-standards groups such as Achieve Inc. (a group of corporate officials and politicians) tend to give poor ratings to states whose standards aren鈥檛 sufficiently specific, measurable, uniform, or compulsory.

When the standards and tests fit together perfectly to create an airtight system of top-down, uniform, "bunch o' facts" schooling鈥攚ell, that's when we're really in trouble.

The difference between these evaluations and a report by, say, the National Rifle Association that assigns low grades to legislators who are not sufficiently pro-gun is that in the latter case everyone realizes the ratings reflect a specific and very debatable point of view. By contrast, those who mark down a state for granting too much autonomy to local school boards, or for having standards that wouldn鈥檛 satisfy behaviorists, would like us to accept this as an objective evaluation. (One could make a case that states given an A in annual evaluations really deserve an F, and vice versa鈥攐r that a state should commission a review of its standards and testing policy by one of these groups and then do precisely the opposite of what is recommended.)

An important side note here: There has been some grumbling lately about the use of off-the-shelf tests that are unrelated to the state鈥檚 standards documents鈥攆or example, in California. From a psychometric perspective, this practice doesn鈥檛 make much sense. From a pedagogical perspective, though, the only thing worse than tests that aren鈥檛 aligned to the standards are tests that are aligned to the standards. The former is silly because it is inefficient, while the latter is dangerous precisely because it is efficient ... at accomplishing a dubious goal. Not only politicians, but also some assessment experts sometimes forget that doing something well is not the same as doing something that鈥檚 worthwhile. When the standards and tests fit together perfectly to create an airtight system of top-down, uniform, 鈥渂unch o鈥� facts鈥� schooling鈥攚ell, that鈥檚 when we鈥檙e really in trouble.

The tests arguably constitute the most serious and immediate threat to good teaching, such that freeing educators and students from their yoke should be our top priority. But we should not limit our critique to the testing, which is, after all, one manifestation of a larger, and seriously wrong-headed, approach to pedagogy and school reform.

I am not troubled by those who disagree with my criteria or who like a given set of standards more than I do. In fact, I welcome such challenges. What troubles me is the rarity of such discussion, the absence of questioning, the tendency to offer instruction about how to teach to the standards before we have even asked whether doing so is a sound idea.

A version of this article appeared in the September 26, 2001 edition of Education Week as Beware of the Standards, Not Just the Tests