More

Featuring fresh takes and real-time analysis from HuffPost's signature lineup of contributors
Tom Vander Ark

GET UPDATES FROM Tom Vander Ark

The Cost-Comparability Conundrum

Posted: 07/20/11 04:08 PM ET

We're stuck and $365 million may not help. The United States places an unusual degree of importance on the reliability of year-end tests. These tests have been around for 15 years and, because we have so little performance data, we try to use them for a variety of purposes. For many reasons, the tests haven't improved much. The new barrier is the dual fixation on cost and comparability.

Innovation occurs when markets are efficient -- where supply meets demand, where consumers quickly (and often ruthlessly) express preferences, where risk is rewarded with return. Blockages can occur either on the buy or the sell side, but they often slump into complacency together.

In the case of educational testing, we have a set of complicated political problems resulting in weak demand for assessment innovation. The next generation of artificial intelligence will help make better test items faster and cheaper to score. But this is more a political problem than a technical problem.

First and foremost, states are broke and can't afford to spend more money on testing. What they spend on testing appears to vary from $11.50 for cheap online multiple choice test to more than $50 per student for hand score tests with lots of constructed response.

Most states have made a commitment to work cooperatively on the development of new assessments. All but six states have joined one of two Race to the Top funded consortia. But if that means buying a common test, the cost of these new tests will be bounded by the low end of the range because members of the steering committee that represent states with cheap tests can't go back to their legislatures with a test that costs three times as much. As a result, intense cost pressure will limit the number and type of constructed response items.

The signatories to an Innosight letter think there's a better approach. Rather than everyone buying the same cheap test, they advocate for state testing systems that customize assessment strategies by state, encourage innovation, utilize sampling and comparison techniques, but retain sufficient comparability.

We're still new to using data in education. Hungry for actionable data, we quickly started trying to use state tests for everything: instructional improvement, school accountability, and teacher evaluations. We've been relying on a few dozen data points about each student for a lot of important decisions.

Testing politics have not yet been recalibrated for information abundance. With the shift to personal digital learning we'll soon have thousands of daily data points about every student. With a huge trail of evidence it won't be necessary to place so much weight on year-end (summative) assessments.

The era of big data will allow us to construct temporary agreements about how to interpret, use and compare assessment results. (They will have to be temporary, because the data will improve every year or two.) States that want rich performance-based feedback will get it. States that are only concerned about cost will get more for less.

I've interviewed many of the steering committee members for both consortia in the last three years. In every conversation, I feel the weight of the cost-comparability conundrum. If we are not thoughtful, decisions in the next 12 months will compound our anachronistic fixation on reliability with the new drive for comparability. We'll end up with 20 states using one test, 25 states using another test, and neither being as good as they could be. We'll be trapped for at least five years with tests that reinforce the worst practices of the old system.

Following are a set of incomplete and partially informed recommendation but directionally correct solutions to the cost-comparability conundrum.

  1. Think assessment frameworks not tests
  2. Host a capabilities demonstration in 60 days to showcase state of the art assessment and online scoring capabilities (inside and outside K-12)
  3. Build comparability models using NAEP like sampling and correlation strategies
  4. Correlate games, adaptive assessments, and performance tasks to a Common Core lexile scale and encourage districts/networks to submit standards-based grade-books as evidence of progress (i.e., a correlation service could verify, with a confidence interval, the correlation of any set of assessments)
  5. Encourage multiple collaborations: during the transition to personal digital learning, we could use 20 different testing systems not just two (ie., PARCC deployment could be groups of states working together in 8 state assessment collaboratives rather than one test)
  6. States should build in a strong waiver process for districts/networks that can show much stronger evidence models
  7. States should use a component strategies and plan on updating state testing systems at least every other year

Beyond these suggestions, it's quite possible that several market shaping strategies could be used to mobilize resources and accelerate innovation. Together, these strategies would give us that data our kids and teachers deserve.

 

Follow Tom Vander Ark on Twitter: www.twitter.com/tvanderark

 
 
  • Comments
  • 12
  • Pending Comments
  • 0
  • View FAQ
Comments are closed for this entry
View All
Favorites
Recency  | 
Popularity
03:20 PM on 07/26/2011
I have been teaching for 12 years and it is a known fact that students always score higher on paper and pencil tests over computerized tests. In our district two years ago when we were having trouble with an online service, students took the states reading test with pencil and paper and scores throughout our district went up 20%. predictably the next year when computerized tests returned and even though students have 3 opportunities to test, scores went back down 20%. I've experienced the same phenomenon when taking the pencil GRE and computerized GRE test.
These standardized tests are a product that when computerized are cheap, and they end up driving curriculum even though most teachers will tell you they do not measure learning nor are they designed to measure teaching.
11:33 AM on 07/25/2011
The only way you are going to get rich data on student learning is if that learning is on-line so that all responses are recorded and can be analyzed. Otherwise testing is a tax on learning itself.

So implicitly data driven learning will be largely on-line or hybrid learning, not traditional classes.
09:12 PM on 07/24/2011
Enough of your data driven data drivel. Enough data to drown the world and we do not need your marketing your version. Could care less what version it is. Just like all the selling or educational programs, who benefits? Not the students, not the teachers, just folks like you.
06:04 PM on 07/22/2011
To say that we're new to using data in education is preposterous. I was in school for 18 years and taught for nearly ten. I took thousands of tests and have given thousands, all providing very important data that informs education. If by data collection you mean we haven't given over the power to test students and teachers by using one idiotic test to one or three huge multi-million dollar corporations, thank God for that.
photo
HUFFPOST BLOGGER
Tom Vander Ark
www.EdReformer.com
04:36 PM on 07/23/2011
Good teachers have always used assessment data to promote learning. What we didn't have before 1994 at any scale was any comparable system data. As a result, most superintendents had little data to know how their schools were doing.
02:29 PM on 07/22/2011
Why market shaping strategies at all? Education doesn't fit the supply and demand model the way, say consumer goods do (the 'product' takes a couple decades to develop; The market for the product is unknown when its started; one doesn't have the choice whether to participate; the 'customer' is not necessarily who is paying; the 'parts' used to make the product are variable; etc, etc). And as it relates specifically to the testing market, I would argue the 'demand' is completely independent of forces related to whether students are actually learning.
photo
HUFFPOST BLOGGER
Tom Vander Ark
www.EdReformer.com
04:39 PM on 07/23/2011
States buy testing services. There are a few big providers. For a variety of reasons, this 'market' has not progressed rapidly. The question I'm investigating is, can new resources be mobilized to accelerate assessment innovation. Read my latest blog about how Cisco assesses students in Network Academies. It's a more thoughtful approach than we use in K-12.
photo
SF TKF
Cthulhu thinks you'd make a nice sandwich.
01:13 PM on 07/21/2011
Is it just me, or did American schools do a better job of educating kids back with that was the focus, rather than testing to prove they had had done so?
photo
HUFFPOST BLOGGER
Tom Vander Ark
www.EdReformer.com
04:39 PM on 07/23/2011
No they didn't. And to the extent that some did, we'd never know it.
photo
nypoet22
Psychology Ph.D., Civics Teacher, Songwriter
11:00 AM on 07/21/2011
that's an awful lot of words just to explain that extensive testing doesn't work, everybody hates it, and nobody wants to buy more.
photo
HUFFPOST BLOGGER
Tom Vander Ark
www.EdReformer.com
04:41 PM on 07/23/2011
The testing we impose on K-12 today is 1950 psychometrics. With the shift to personal digital learning, students and teachers will benefit from instant feedback, adaptive content, and powerful achievement analytics.
photo
nypoet22
Psychology Ph.D., Civics Teacher, Songwriter
12:58 PM on 07/29/2011
the alleged paradigm shift to programmed feedback hasn't changed much since the 1950's either. B.F. Skinner did all those things and more, without the need for anything high tech or digital. no matter how advanced or primitive the hardware, programmed education is expensive, marginally effective and deeply unpopular with teachers and students. that's a fact no amount of high-tech jargon will change.