iPhone app iPad app Android phone app Android tablet app More

Featuring fresh takes and real-time analysis from HuffPost's signature lineup of contributors
Todd Farley

GET UPDATES FROM Todd Farley
 

Standardized Testing: A Decade in Review

Posted: 04/08/11 06:37 PM ET

There are those who think American public education is in shambles and needs to be completely remade, arguing that the standardized tests mandated by No Child Left Behind will be a vital tool in turning around U.S. schools: They say student promotion and graduation, teacher pay and employment, and school funding should all be tied to NCLB tests. As someone who spent many years working in the assessment industry -- not to mention someone who has been reading the newspaper for the last ten years -- I can't say I understand that idea. What exactly has occurred in standardized testing over the last decade that justifies such belief in large-scale assessments, or such blind faith in the completely unregulated, massively profitable industry that writes and scores NCLB tests?

We know that testing data can be manipulated to tell any story. We know that a school administration -- by making test questions easier or lowering cut scores -- can portray improvement in its classrooms even when such improvement doesn't really exist, as happened most recently in 2009 in the New York City schools. We know that "rogue" teachers or administrators -- by erasing incorrect student answers and changing them to correct ones -- can show student achievement even if there is no such achievement, as scandals in Atlanta and Detroit during 2010 both revealed and the current erasure debacle in Washington, D.C. also seems to show. And we know, from my book, that the testing companies fudge numbers all the time, whether reliability numbers (to show the industry is doing a more "standardized" job than it really is); validity numbers (to show the industry is doing a more accurate job than it really is); or score distribution numbers (when test scoring companies work to ensure student results match the predictions of their own psychometricians).

Psychometricians, of course, are the rock stars of the testing world, omniscient statisticians doing a job virtually no one comprehends. While I don't claim to understand their mysterious math, I do find it odd that during my long career writing and scoring tests I only once laid eyes on a psychometrician, and that was during a pick-up soccer game on the grassy grounds of ETS. Never when I wrote tests, or scored tests, or met with teachers to discuss those tests, did I see a psychometrician, meaning the most important people in the testing industry are people who don't often know what the tests look like and don't usually see the students' answers to them.

On top of what seems a general dubiousness about the numbers produced by standardized tests, we also know the testing industry regularly screws up. In the last decade or so, scoring errors have occurred on tests returned to students in Arizona (1999-2000), Washington (2000), Virginia (2005), Florida (2006), South Carolina (2008), and Minnesota (2010), not to mention Indiana, Illinois, Connecticut, etc. In 2000, a scoring error by NCS-Pearson (now Pearson Educational Measurement) led to 8,000 Minnesota students being told they failed a state math test when they did not, in fact, fail it (some of those students weren't able to graduate from high school on time because of it). In 2004, ETS erroneously informed over 4,000 teachers they had failed a PRAXIS exam that they had actually passed, leading to lost jobs and lawsuits aplenty. In 2006, Pearson again erred, giving lower scores than were deserved to more than 4,000 students taking the SAT, with the company making the excuse (apparently with a straight face) that their blunder resulted due to "abnormally high moisture content" in that year's score sheets. Keep in mind, also, that most of those errors were discovered only after a test-taker complained about a score, not when any company voluntarily disclosed the problem, raising questions about the legitimacy of every other test administered over the last 10 years.

Even without errors as obvious as the ones above, in my career, there seemed to be a major disconnect between the profit motive of the testing industry's major players (Pearson Education, McGraw-Hill, Riverside Publishing, ETS K-12, DRC) and any altruistic goals for American education. For the many years I scored students tests, I saw an industry primarily focused on meeting deadlines and completing contracts, with the importance of the correct scores being put on tests seeming to come in second to the rush to get any score put on them. My work in test development was no different, with the companies who employed me apparently willing to take huge shortcuts in developing tests because meeting a contract's deadline was clearly more important than the quality of any assessment.

Last year, I was amazed to see the management of a publishing company giving its test developers only four weeks to produce K-12 assessments for the Detroit Public Schools (a school system now bankrupt, but then, willing to pay millions to a testing company); later, however, that short time-frame looked like a leisurely vacation compared to breakneck pace the company next worked its employees at, when the test development staff was required to pound out more than 200 Common Core Standard tests over the next two months. Two hundred tests is probably more than a not-for-profit like ETS has developed in its entire history, but in a rush to address the new CCS market and get their hands on "Race to the Top" money that company worked its employees nearly to exhaustion and seemed willing to go to any length to write those tests: They recycled items used many times on previous tests, re-aligned items to link them to academic standards they were only sort of linked to, hired people with neither teaching nor testing experience to work as full-time test developers, employed any consultant off the streets willing to work, and re-hired testing vendors previously fired for the poor quality of their work (one of those vendors celebrated its renewed contract by immediately advertising on Craigslist, hoping to find anyone at all willing to write test questions for $8 each).

It's not like questions about the efficacy of the testing industry haven't long been raised, since even before the dramatic increase in testing that has recently resulted from NCLB. In 2001, a New York Times story about testing errors quoted various employees at a test-scoring factory in Iowa City who doubted the quality of the work being done: "There was a lack of personnel, a lack of time, too many projects, too few people," one said, while another noted the surfeit of work she faced meant she was concerned about "[her] ability, and the ability of the scorers, to continue making sound decisions and keeping the best interest of the students in mind."

In 2002, Amy Weivoda raised similar concerns at Salon.com, noting her experience scoring tests "led [her] to believe with absolute certainty that standardized tests are an utter waste of money and valuable teaching time, and that they measure nothing more than a state government's willingness to waste money." She pointed out that some of her test-scoring colleagues "were so spectacularly sociopathic they could find no other work. Some of the scorers had earned their degrees in prison." In 2009, a test scorer in Jacksonville, Florida wrote a two-act play about his career, a drama he said highlights the "silliness" of testing.

In 2010, Dan DiMaggio cited many of the same issues in The Monthly Review, writing of test scoring being standardized only in its "mystifying training process, supervisors who are often more confused than the scorers themselves, and a pervasive inability of these tests to foster creativity and competent writing." That dismay with the test scoring process was found again in a 2011 article in the Minneapolis City Pages, a story that concludes with one of the scorers commenting that the limitations of standardized testing were obvious to all: "Nobody is saying, 'I'm doing good work, I'm helping society,'" she says. "Everyone is saying, 'This isn't right.'"

The City Pages story ends with a quote from a Pearson Education spokesman, in which the company man notes the complaining scorers were "people who have a very limited exposure and narrow point of view on what is truly a science." Lest anyone buy too heavily into the "science" of standardized testing, it's telling to note that 2009 audits performed by the United States Department of Education of tests in Tennessee and Florida found identical problems to those the scorers detailed (not to mention other problems as well). Even if one concedes that all the complaints above are about test-scoring and not test development, it's important to remember that the open-ended questions on tests that are scored by humans seem to be the sort of "next generation" assessments the Obama administration is moving towards.

With even the President recently deriding the emphasis on "filling in bubbles" that results from multiple-choice tests, the current education reform agenda instead seems to be aimed at tests that address critical thinking skills, including "students' ability to read complex text, complete research projects, excel at classroom speaking and listening assignments, and work with digital media." Impressive jargon indeed, but every one of those tasks needs to be assessed by a living, breathing human being, and there's more than enough evidence that the living, breathing human beings currently doing that job either don't do it very well or don't think it can be done very well.

If we recognize from the evidence above that the testing industry seems ill-prepared to score all the student responses filled with incredibly complex thinking that the "next generation" of tests will surely generate, I can imagine only two other ways those students answers will be assessed. Either classroom teachers will be hired to score the answers to those tests, or the student responses will be "read" and scored by the new automated scoring technologies powered by artificial intelligence. The problem with teachers scoring the tests, of course, is that the education reformers believe this country's teachers can't be trusted to make decisions about kids, so that option seems unlikely. More likely is that those new tests will be scored by those vaunted automated scoring technologies, machines that can assess student answers to open-ended questions without being able to actually read them. Of note, no one is claiming those computers can read, only that it has been proven statistically that those automated systems can score student tests as accurately as do the temporary employees currently doing the job. That argument, of course, is presented as a defense of the new technologies, although I'm not sure how much solace we should find in knowing that a computer that can't read a student's answer understands it just as well as does some bored slacker being paid slave wages to give only fleeting glances to the work.

For the last 10 years in this country, we've regularly seen standardized tests results that can't be believed and standardized testing companies that can't be trusted. Still, the United States seems to be heading towards taking the decisions about American education out of the hands of American educators and instead placing that sacred trust in the welcoming arms of an industry run entirely without oversight and populated completely with for-profit companies chasing billions of dollars in business. In fact, even though the testing industry has proven consistently incompetent over the last ten years, we seem to be moving towards expanding our emphasis on it. To me that comes across as naïve, although a less forgiving person might think that entrusting public education to a bunch of bumbling, for-profit companies falls more towards the unethical/immoral/mercenary end of the spectrum.

In any case, when next some standardized test scores are found to be incorrect or fraudulent (because they will), or some standardized testing company commits or tries to cover up another egregious error (because they will), perhaps then we can admit large-scale assessment isn't the panacea it's often been touted to be. Perhaps then we can concede that an educational philosophy based on a system of national standardized tests isn't any Brave New World of American education; it's just a bad idea that even the Chinese are already turning away from as being too inefficient and antiquated.

Caveat emptor, America. Buyer beware.

 
There are those who think American public education is in shambles and needs to be completely remade, arguing that the standardized tests mandated by No Child Left Behind will be a vital tool in turni...
There are those who think American public education is in shambles and needs to be completely remade, arguing that the standardized tests mandated by No Child Left Behind will be a vital tool in turni...
 
 
  • Comments
  • 114
  • Pending Comments
  • 0
  • View FAQ
Comments are closed for this entry
View All
Favorites
Recency  | 
Popularity
Page: 1 2 3  Next ›  Last »  (3 total)
HUFFPOST SUPER USER
cdecisneros
my micro bio is empty because I went to the micro
05:18 PM on 04/13/2011
The company that sold the tests to Florida said the people with BA would be reading and grading the tests. Lye. Anybody making barely over minimum wage were reading and scoring the tests. One year they bought pencils that were so cheap the scantron reader could not read the marks.
HUFFPOST SUPER USER
Michael Morrison
Proud Dad, Engineer, Aspring Geophysicist
04:16 PM on 04/13/2011
If a student can't "fill-in-the-bubbles," she sure as heck isn't capable of any higher thinking skills.

As a country, we're always looking for short cuts...We now give kids calculators in 3rd grade, and wonder why, when they grow to be adults, they can't figure change or estimate a tip in their heads.

Standardized tests can't test everything we want our kids to learn, but the fact that so many kids are failing standardized tests tells us that either our educational system is doing an awful job, or we have unrealistic expectations of our educational system.
04:12 PM on 04/11/2011
I would like to add some reason into what is going on. As a current teacher and grad school student, this is the topic for many discussions everywhere I turn. I do agree that this testing is going against every moral fiber I have in myself. It is heart wrenching to use the standardized testing method as the only evaluation of my students, and myself. I feel I am cheating my students, cheating them of creativity and the CONFIDENCE to problem-solve without someone beating down their answers since it doesn't fit the "box." I hate that many comments I read from many non-educators are how awful we are due to test scores and now I am a scapegoat. However, I do see the point in all of this. We are trying to make teachers and students accountable. We DO NOT want to go back in time where lessons were haphazard and everyone was given total freedom. There does need some consistency. Unfortunately, I have to scratch my head like everyone else and wonder, "Well, where does this put us?" I don't know. All I do know is I try to squeeze in some sort of creativity and breath into these curriculums as possible. THAT I can control. My advice to the frustrated educators out there is: work with the system and do what you can to inspire and create. As hard as it is, this is what we are really here for.
06:29 PM on 04/10/2011
The first step needed in making the tests more useful is not giving them to the teachers. If the teachers don't know what is asked in the test they can't teach for the test. The result will be a more rounded education for the students and a better working atmosphere for the teachers.
photo
HUFFPOST SUPER USER
Welshish
The sadder but wiser girl for me.
09:19 PM on 04/12/2011
The tests are NOT given to the teachers in advance! But the results of previous years' tests and the questions on previous years' tests are available as a public document. I'm sure you can go online and see the Qs and As from your state, no problem.
ONE rationale for teachers resigning to 'teaching to the tests' is the idea that, if the state bd of ed deems what curriculum is most important, then who are teachers to buck that system?
HUFFPOST SUPER USER
dloitz
04:45 PM on 04/10/2011
While this article shows that testing is marred in Errors and lack of meaningful thought. It is more about the idea that somehow test like this are good for education.

We are discussing this in more depth over at the Cooperative Catalyst on Joe Bower's post the Illusion of Standardization.

http://coopcatalyst.wordpress.com/2011/03/13/the-illusion-of-standardization/

Please Join us and add your comment.

David Loitz
Cooperative Catalyst Editor
photo
poeticjustice4all
Past = Prologue
02:10 PM on 04/10/2011
Only about 1/3 of America's high school students are able to read this basic HuffPo article.

That's a reality check. I welcome that sort of assessment.Through testing and evaluation, we have discovered that far more children are being left behind than the schools wanted to admit -- most of those are poor children and students of color.

Yes, the testing companies are flawed. Our entire school system is deeply flawed. There is plenty to complain about. However, complaints are not solutions.

The testing foul up in Minnesota was distressing, but the incident doesn't even begin to compare to the horror of what the corrected scores showed: instead of shrinking, the achievement gap in Minnesota has grown in the last three years. Currently Minnesota has one of the largest achievement gaps -- or more precisely, education debts -- in the nation. We can easily fix a testing error. Fixing our failed schools is not so simple.

Go right ahead and change federal law. Stop the testing and evaluations altogether. Eliminating the reality check won't change the reality that a majority of our students cannot even be part of this conversation because they cannot read and respond in writing.
07:41 PM on 04/10/2011
Those are all very valid points, but did you stop to think that the overemphasis on the testing is what is MAKING the achievement gap grow? There is a huge dichotomy between the type of education suburban and urban kids receive. I know because I have taught in both settings and I have children of my own attending school in a suburban setting in the same grade level I teach in an urban setting. The juxtaposition is so painfully obvious everyday.

In the urban setting, districts and principals are so desperate to get SCORES up, that the quality of education has actually decreased. In my own school, we lose more and more time for social studies and science, and everything fun about school has been eliminated in favor of more "seat time" and "active engagement" and "Time on task". Basically anything that makes school even remotely interesting has been eliminated, and now we have even MORE students unmotivated to put forth any effort. This is not the case in suburban schools, where they are NOT taught with "cookie cutter curricula" and they still manage to find time to do the fun and interesting things, where they don't have "test score failure" hanging over their heads. In reality it's the over-emphasis on testing that is destroying our inner cities schools and the achievement of the kids.
01:01 PM on 04/13/2011
Poeticjustice4all...you are completely missing the point. Our current system is not justice for poor children and students of color. By and large, because of accountability pressures, schools that service these populations are forced into a rote, coverage model that is geared towards answering multiple choice tests. This is not justice. In fact, this is the antithesis of justice. The affluent get the best, most innovative curriculum. The marginalized get test prep. Not sure how you can support that system.
This comment has been removed due to violations of our [Guidelines]
photo
HUFFPOST SUPER USER
bessielil
trying to organize hummingbirds
09:56 AM on 04/10/2011
With all that we know about learning styles and other useful talents that kids have, testing one way in very few subjects does NOT reveal the quality of education going on. It reveals the percentage of students who do well on standardized tests. Any tests not being done diagnostically, to help students and inform teachers, are questionable.

Oh, and the industry is a scam itself. Yet another example of public funds drained away into the private sector.
07:41 PM on 04/09/2011
The testing industry isn't perfect. I really wonder how many different tests we need for 4th grade proficiency, and why each state needs its own -- seems crazy to me, a scam. Psychometrics is, what, 50 or 60 years old? We've known how to determine the difficulty, validity, and reliability of test questions for an awfully long time. It's no mystery, or shouldn't be. Whatever. Where there is money to be made doing it, there are people willing to sell crap. That does not mean that standardized tests have no value; just that the people purchasing them are ... hmmm ... under-qualified.

As for carester, I absolutely agree. There is nothing "tricky" about the questions she lists. Unless you define "tricky" as "requiring thought."
HUFFPOST SUPER USER
hairydodger
07:14 PM on 04/09/2011
Hot button topic for me. Texas sets the standard for American History textbooks for the nation. The conservative Texas school board cherry picks American history so it says what they want. They make our history heroic and leave out the flaws and the truth. They don't tell it like it was. I dislike them so very much. School teachers have their jobs directly attached to the standardized tests and how much improvement their students show. This is going to take the USA deeper into becoming a third world country. Now the teachers have to teach to the test instead of teaching to the students. (Don't forget who wrote the books and the accompanying tests.) In the next twenty years almost everyone will have been taught conservative Texas version of US history. They won't teach that Lewis & Clark took twelve ounces of Opium on their expedition. They won't teach what Lewis & Clark wrote about withdrawal. They won't teach that the very first law governing the growing of hemp were passed in the 1600's or that the nation's largest distiller in the late 1700's was George Washington or that Thomas Jefferson grew white poppies for medicinal purposes. We really have to change our educational system so it fits the new digital natives and take the propagandizing of our youth away from conservative texans.
07:42 PM on 04/09/2011
And this is why we need to do away with textbooks altogether. Who looks at them anyway? And talk about obsolete!
07:38 AM on 04/10/2011
Who looks at textbooks? Probably your classmates who did better than you at school did. That may be why they'd be less likely to say things like, "And this is why we need to do away with textbooks altogether­. Who looks at them anyway? And talk about obsolete!"
02:40 PM on 04/16/2011
Remember: the winner gets to write history. And that has been going on for milleniums!
photo
HUFFPOST SUPER USER
tultican
Thomas Ultican, MEd. BS Mecahnical Engineering
03:09 PM on 04/09/2011
Dewey explicitly said that we need to understand that we are not physically or intellectually equal. Testing and sporting competitions clearly demonstrates that. However, we do have a human right to have our unique potential educated so that we can make our unique contribution to society. These top down schemes for managing American education from capital cities cannot achieve the demands of that social and moral obligation. In the 1990’s, Chicago was a city who’s educational establishment was producing excellent understanding of what good pedagogy was. Today, after 12 years of “reform”, that city’s schools have been so harmed that its children are alienated and violence has replaced intellectual growth. This path of education reform is dangerous. Testing hell must be stopped and educators across the country must be allowed to do their job. Let teachers evaluate the students and breathe creativity back into these school systems that have been so damaged by amateurs, privatizers and the “Billionaire Boys Club.”
02:55 PM on 04/09/2011
So you're blaming the schools and teachers?: "We know that a school administration -- by making test questions easier or lowering cut scores -- can portray improvement in its classrooms ..." This is impossible in my state, Oregon, and I'm guessing in almost every state. These are state tests. School administrators have no control over test questions or cut scores.. "We know that "rogue" teachers or administrators -- by erasing incorrect student answers and changing them to correct ones...." Thanks for continuing to attack me and other teachers. Again, that cannot happen in my state and many others because the tests are all done online. There's nothing to erase.

That being said, I can easily believe in gross incompetence of the testing companies whose only goal is raking in the money. Your solution, these are crappy tests (agreed, they are pathetic) so let's scrap everything and not test the kids at all, is a cop out. It's also naive to imagine that standardized tests are going away. Let's get real.

STEP 1: Let teachers and universities make and grade the tests. That puts the experts in charge and removes the profit motive.

STEP 2: Make the schools' motive for achievement a carrot and not a stick: Schools with low scores should be given additional resources (money) to improve their facilities, hire more teachers and address the needs of these children.

There you go. Problem solved.
photo
HUFFPOST BLOGGER
Todd Farley
04:29 PM on 04/09/2011
I didn't blame teachers. I said testing data can't be believed, for many reasons including the fact there have been proven instances of adults changing students scores.

I also never said "let's scrap everything." I said we should be wary of believing in large-scale assessments.
02:28 PM on 04/09/2011
It would make sense for each state or district to create a standard curriculum in the core subjects and allow teachers to collaborate and design assessments to measure if in fact students learned core skills for that subject. I stress skills and concepts over facts and minute details. I wish parents would protest standardized testing and government to revamp NCLB and this testing madness. Life is not based on choice A B C or D.
photo
HUFFPOST SUPER USER
calamityjohn
01:41 PM on 04/09/2011
the over reliance on standardized testing comes from the same inertia that created mandatory sentencing .. politicians needing to show constituents they are tough on something .. but not having the desire to address the complex and difficult issues of poverty that are the root of the systemic problems in both crime and education ..
10:59 AM on 04/09/2011
This article fits well with information in "The Myths of Standardized Testing." Check my summary of this book at http://bit.ly/ifREGi.