iPhone app iPad app Android phone app Android tablet app More

New York City Teacher Ratings: How Its Value-Added Model Compares To Other Districts

New York Teacher Ratings

  Sarah Butrymowicz and Sarah Garland First Posted: 03/ 2/2012 12:48 pm Updated: 03/ 2/2012 12:59 pm

This piece comes to us courtesy of The Hechinger Report.

New York City schools erupted in controversy last week when the school district released its "value-added" teacher scores to the public after a yearlong battle with the local teachers union. The city cautioned that the scores had large margins of error, and many education leaders around the country believe that publishing teachers' names alongside their ratings is a bad idea.

Still, a growing number of states are now using evaluation systems based on students' standardized test-scores in decisions about teacher tenure, dismissal and compensation. So how does the city's formula stack up to methods used elsewhere?

The Hechinger Report has spent the past 14 months reporting on teacher-effectiveness reformsaround the country, and has examined value-added models in several states. New York City's formula, which was designed by researchers at the University of Wisconsin-Madison, has elements that make it more accurate than other models in some respects, but it also has elements that experts say may increase errors—a major concern for teachers whose job security is tied to their value-added ratings.

"There's a lot of debate about what the best model is," said Douglas Harris, an expert on value-added modeling at the University of Wisconsin-Madison who was not involved in the design of New York's statistical formula. The city used the formula from 2007 to 2010 before discontinuing it, in part because New York State announced plans to incorporate a different formula into its teacher-evaluation system.

Value-added models use complex mathematics to predict how well a student can be expected to perform on an end-of-the-year test based on several characteristics, such as the student's attendance and past performance on tests. Teachers with students who take standardized math and English tests (usually fewer than half of the total number of teachers in a district) are held accountable for getting students to reach this mark. If a teacher's students, on average, fall short of their predicted test-scores, the teacher is generally labeled ineffective, whereas if they do as well as or better than anticipated, the teacher is deemed effective or highly effective.

A number of states and districts across the country already tie student performance on standardized tests to teacher evaluations; others have plans to do so. Many education reformers, including those in the Obama administration, commend the practice. States were awarded points in the federal Race to the Top grant competition for creating policies that tie student academic growth to teacher evaluations.

In Florida, by 2014, all districts must use value-added ratings for at least half of a teacher's total evaluation score. Ohio districts will start doing so in 2013. This year in Tennessee, student test-score data will count for 35 percent of each teacher's evaluation. Value-added ratings make up 20 to 25 percent of New York's new teacher evaluation framework. And politicians in Nebraska and Louisiana are pushing for these measures to be included in new teacher-evaluation systems.

The new evaluations, which will generally use test-scores as one of multiple measures, including classroom observations, are increasingly being used in decisions about compensation, retention and tenure.

Advocacy groups like The New Teacher Project, now known as TNTP, and the National Council on Teacher Quality have cheered the inclusion of value-added scores in teacher-evaluation systems. In the past, most teachers were rated based on infrequent, "drive-by" principal observations that resulted in satisfactory ratings for up to 99 percent of teachers. But skeptics, including teachers unions and researchers, say that value-added models have reliability problems.

Depending on which variables are included in a value-added model, the ratings for teachers can vary dramatically, critics say. As an example, researchers at the University of Colorado examined the formula that an economist hired by the Los Angeles Times created to rate teachers there (the economist's work was funded in part by the Hechinger Institute on Education and the Media). The University of Colorado researchers found that more than a third of L.A. Unified teachers would have had different scores if a slightly different formula had been used.

A 2010 study by Mathematica Policy Research found that the error rate for value-added scores based on three years of data was 25 percent. In other words, a three-year model would rate one out of every four teachers incorrectly. The error rate jumped to 35 percent with only one year of data. The report cautioned against using value-added models for personnel decisions, a position that other experts have echoed.

In New York City, some of the teachers whose scores were published last week received ratings based on multiple years of data, according to a 23-page technical report describing the city's statistical formula. But other New York City teachers— a spokesperson for the city education department was unable to say exactly how many—were rated based on only one year of data.

Washington, D.C. also uses just one year of student test-scores in its statistical model. But the system that Bill Sanders, a researcher known as the "grandfather" of value-added measurement, designed for Tennessee uses five years of data in creating a score for each teacher. To ensure that elementary teachers aren't judged based on just one or two years of test-score data, the Tennessee model takes into account a student's performance in later years, Sanders says. For example, third-grade teachers are rated based in part on how their students do in subsequent grades.

"When any one student takes a math test, on any one day, there is a huge uncertainty around that score," Sanders told The Hechinger Report in an interview last year. "It could be the kid got lucky this year, and guessed two or three right questions. Or the kid this morning could not have been feeling well. Consequently that score on any one day is not necessarily a good reflection of a kid's attainment level."

Another question that educators and researchers have debated is whether the statistical models should account for student characteristics that are linked to achievement—for example, poverty, English ability and special-education status. In places like Florida and Washington, D.C., value-added models have accounted for such factors, in part because of the limitations of using fewer years of test-score data.

New York City's model does as well. Variables include race, gender, socioeconomic status, and even whole-class characteristics like the size of the class and how many students are new to the city.

Many researchers argue that adjusting for student demographic characteristics is unnecessary because the growth scores are calculated by comparing students against themselves. Sanders and others say that including student characteristics could bias the scores by making it easier for teachers of disadvantaged students to be rated more highly.

A black student, for example, might be expected to do worse than a white student in such a model, an assumption that Sanders says lowers expectations for the black student, along with the teacher who has that student in class.

In New York, high-rated teachers are evenly spread across both low-performing and high-performing schools, which experts say is partly a result of the formula's adjustments for student demographics. Teachers with demographically similar students—whether they are low-income, minority, or have special needs—are ranked relative to one other, not the entire teaching force.

Other researchers have argued that factors like student poverty should be taken into account, however, because concentrated poverty, for example, is linked to lower student performance, suggesting that a student's peers may affect how that student does in school and on tests. That is, a teacher who has a large number of disadvantaged students in class may have a more difficult job getting a higher rating than teachers with fewer disadvantaged students.

In an attempt to settle the question, Mathematica, the research group, is currently examining the effects of whole-class characteristics on teacher value-added ratings in a study of 30 districts across the country.

Although it gets much less attention, one of the biggest problems with value-added modeling, according to many experts, is that the ratings cover only a fraction of teachers—those whose students take standardized tests in math and English, typically in grades three through eight. As new teacher-evaluation systems go into effect in more districts and states in the next two years, many, including New York City, will be grappling with how to rate everyone else.

Rhode Island is using teacher-created goals on classroom work and tests. Colorado is planning to use off-the-shelf assessments and school-generated methods to gauge how teachers in subjects like physical education and music are performing. In Tennessee, teachers without value-added ratings are graded in part on how the teachers who do receive ratings in their school perform. And Florida is creating more tests, one for every subject and grade level down through kindergarten.

Harris calls Florida "an example of what not to do." Given the problems with value-added modeling, no matter which formula is used, he suggests that the best uses of the ratings might not be to make decisions about hiring, firing and tenure. Instead, they can be used to give low-rated teachers more training or principal observations, rather than pink slips.

This story also appeared on GothamSchools on March 1, 2012.

Also on HuffPost:

FOLLOW HUFFPOST EDUCATION

This piece comes to us courtesy of The Hechinger Report. New York City schools erupted in controversy last week when the school district released its "value-added" teacher scores to the public afte...
This piece comes to us courtesy of The Hechinger Report. New York City schools erupted in controversy last week when the school district released its "value-added" teacher scores to the public afte...
 
 
  • Comments
  • 30
  • Pending Comments
  • 0
  • View FAQ
Post Comment Preview Comment
To reply to a Comment: Click "Reply" at the bottom of the comment; after being approved your comment will appear directly underneath the comment you replied to.
View All
Favorites
Recency  | 
Popularity
Page: 1 2  Next ›  Last »  (2 total)
photo
HUFFPOST SUPER USER
Bill Jones123
11:15 PM on 05/09/2012
I have always wanted to ask a think tanker what exactly they MAKE AND SELL, and what product of value they offer in exchange for their wage.

What is it exactly that all of the edu-reform talking heads provide in terms of value that generates wealth for this country?

Does any other country buy their opinion or their advice?

I surmise that their efforts are entirely funded using non-profit status or the hard earned incomes of hard working folks.

Frankly, the edu-reform industry is robbing taxpayers with loads of big promises and then under performing on all possible metrics.

Let's be thankful for one thing. Those dumb reformers are not right now deciding to declare our nuclear submarine program a disaster in desperate need of their expert advice.
photo
HUFFPOST SUPER USER
Bill Jones123
11:10 PM on 05/09/2012
The bitter irony is this: Those making these VAM measures have tenure, a lifetime guaranteed job, representation by a union or association, and almost no method by which to be evaluated on their job performance.

Our think tanks, elite universities, non-profits, edu-reformers, and higher govt. positions are hidden from the public and any kind of accountability. They can screw up all they like and just say something like, " There are always small problems whenever innovations occurs on a large scale."

Our nation and our children are being ridden into the ground by these thugs.
HUFFPOST SUPER USER
jp90
07:07 PM on 03/05/2012
VAM measurements also mean nothing when students are not held in any way accountable for their results. We are about to begin 3 days of testing in my HS. Juniors will be testing for 3 days, one being the ACT. That one, they might only just take seriously (at least those hoping to attend college). The other 2 days are our Michigan State assessments. They could care less about those, and often finish their "bubbling" in about 15 minutes for a 45 or 60 minute test. The freshmen will take a 1 day "practice" test, and the 10th graders will take a PLANS test (practice ACT). Neither of these tests has any meaning or consequence to the students directly, and yet we are supposed to look at the data when it comes back (maybe by May if we're lucky), and have that "inform" our instructional practices for the next year. We will also be evaluated based on these scores. Again, on a test that students don't care about, and do not try on. How is that useful data??
10:36 AM on 03/05/2013
Right on. Students must have consequences for tests that are used for teacher evaluation. These tests must impact their grades and/or their college acceptance possibilities.
photo
HUFFPOST SUPER USER
calamityjohn
06:00 PM on 03/04/2012
What always fascinates me about the whole reform thing .. I constantly hear how we are behind other nations in international testing as a justification for "reform" .. yet never hear a proposal based on the things those other countries are doing that we are not.
Allthosewhowander
My micro-bio is a microclimate
06:24 PM on 03/04/2012
The proposal from the "reformer$" is and will continue to be; fire teachers, cut wages and benefits, cut positions in schools where people are employed to actually work with children, subsidize testing and curriculum companies, instill the business model to govern education, more testing, label schools failures, based on an absurd, one size fits all, once a year test, so they can become charters, allow businessmen and politicians to mandate legislation that governs schools, regardless of their education background and knowledge, allow hack administrators to run self promoting dictatorships at schools, with little to no knowledge of instruction, but most of all, the biggest part of the "reform" movement that everybody must agree upon...its all the teachers' fault.
11:50 AM on 03/05/2012
...And if you teach in NYC, publish teacher ratings on the internet and humiliate them into quitting and leaving the profession.
photo
HUFFPOST COMMUNITY MODERATOR
blindjester
English and ESL teacher
09:31 AM on 03/04/2012
They're making new tests for value-added scoring in my district. We've already got math and English; now they're making tests for everyone else.

My friend, who teaches photography in the art department will soon have to give the same paper-pencil test each semester that they give in the drawing classes and ceramics classes.

The teachers in weight training will be giving the same tests as the team sports teachers.

Sure, this is going to work.

Let's spend billions nationwide on this, waste enormous amounts of instructional time, and ruin the careers of good teachers to try to get a number to describe every teacher.
HUFFPOST SUPER USER
gremlin1
Compulsive lyer.
09:41 AM on 03/04/2012
Wow, way to take all the joy and creativity out of the arts and Phys. Ed. Our students are doomed in this climate of DE-form.
HUFFPOST SUPER USER
iwinagin
10:25 PM on 03/03/2012
Students know who the bad teachers are. Teachers know who the bad teachers are. Parents know who the bad teachers are. Why is education administration the only group that doesn't know the bad teachers?

Standardized tests are really unnecessary for determining teacher effectiveness. The true test of teacher effectiveness happens every night at my dinner table. I ask, for examplel "What did you learn in Government today?" If the answer comes back as "nothing" on a regular basis there is a strong indication that the current teacher may be ineffective in thier position.

Ineffective teachers need to be given support to help them reach effectiveness. If they cannot become effective they need to be moved to a position where they can be effective. and finally if other actions do not work they need to be fired.
Allthosewhowander
My micro-bio is a microclimate
06:28 PM on 03/04/2012
Education administrators are the dirty little secret in the education system. Many of them failed at one point in their education career, but knew the right people, took the right classes, cite the right research, and now hide from accountability in their district "leadership" positions. They are about maintaining the status quo for the bloated ineffective bureaucracy
06:43 PM on 03/04/2012
Yes, of course the problem is with administration, but once again the blame is being placed on "the teachers" and "the unions."

I taught for 42 years. The good administrators visited my classroom often and knew how my students were doing. Of course, this is what they are supposed to do. The incompetent principals hid in their offices, often doing personal work, such as purchasing insurance or asking questions of car dealers.

The problem of the "bad" teacher is an administrative one. If we treat it as such, we might begin to see some real improvements, instead of the expensive nonsense that is going around right now.
07:22 PM on 03/03/2012
“A 2010 study by Mathematica Policy Research … cautioned against using value-added models for personnel decisions, a position that other experts have echoed.”

In Gilbert Public Schools in Arizona, administrators make it up as they go along. Literally.
It was flabbergasting when the principal held an annual evaluation conference with a teacher … using student assessment data in the teacher’s evaluation report! The principal’s explanation:

"I put a statement about ATI Benchmark data in all the -- everybody's evaluation this year because year after next a component of the evaluation will be based on data and I didn't know what kind, but I thought I'd put something about data in here, so I picked ATI. And so according to ATI Benchmark data, out of 25 students total, 15 students were on course to meet the standards in AIMS Math. Two students are on track to exceed the standards in Math. And so I thought that was a really nice result."

What the principal wrote in the evaluation was different: "According to ATI Benchmark data, out of 25 students total, 15 students are "On Course" to meet the standards in AIMS Reading and Literature. ATI Benchmark data shows that 4 students are "On Course" to meet the standards in AIMS Math. Two students are on track to exceed the standards in Math."

The principal did not show the teacher any reports with this data. The principal just made it up. http://westernconnections.com/evaluations.html
12:26 PM on 03/03/2012
As a teacher from the UK I have to say that it is common practice for a teacher to be assessed by their value added performance. I have recently moved to New York and so know little about the education system here but it seems value added is only taken into account for standardised Maths and English. Do you not have a system that predicts student outcome for all subjects as we do back home? This makes it easier to assess all teachers in all subjects. On the subject of socioeconomic factors, as a Sociology specialist i feel it essential to take these into consideration as research shows that class, race and gender all affect a child's educational attainment!
08:21 AM on 03/04/2012
It's an increasingly common practice in the US, too. And it's ineffective.

I'm less concerned about who else does it and more concerned about the fact that it doesn't work.
07:52 AM on 03/03/2012
I believe there is an error in this article. I am a teacher in Tennessee and my ratings are based on only the time I have worked in this state as a teacher. That's two years of ratings; certainly not five.
This user has chosen to opt out of the Badges program
06:57 AM on 03/03/2012
This is pretty entertaining. All of a sudden, people who have never taught in their lives (including the current Education Secretary, Arne Duncan) are deciding which teachers are worthy of keeping their jobs and which ones are not.

Yesterday I watched "Morning Joe" for a while. They were having yet another one of their shows on education with the hosts and guests making arrogant and ignorant statements about 'what education should be' and 'how to get there'. I laughed, rolled my eyes, and thought "The blind leading the blind--that will get us far", and turned the channel.
08:18 AM on 03/03/2012
I tend to watch Morning Joe, but could not stomach yesterday. I can never understand why there is never a Master Teacher or successful school/district administrator as part of the discussion. There are many out there who succede and are not using these "reform" efforts. They are never in the conversation.
This user has chosen to opt out of the Badges program
10:42 AM on 03/03/2012
Simple. They don't want them on the show.

Anyone who is rational, moderate, fair-minded and calm does not make their ratings go up. They want the most extreme, partisan, ideological, angry people on their panels so they can all commiserate with each other about "failing schools", what terrible people teachers are, and how we're bringing down civilization and the economy all by ourselves. You gotta laugh at that.
This user has chosen to opt out of the Badges program
photo
12:05 AM on 03/03/2012
I remember when they were first proposing this whole concept of evaluating teachers based on students' test scores. Some fair minded educators gave those in power the benefit of the doubt: "No way, they'll never do that! It would't be fair. What about the teacher that has a lot of English language learners or special education kids in their regular ed classes? They wouldn't do that! It wouldn't be fair to compare those classes to the AP classes and pit those classroom teachers against each other. How is that a fair competition? How can they fairly apply merit pay like that? I just can't believe they would do that!" Well believe it. They are doing it. They don't care that it's not fair. Anyone who protests is just a whining, lazy, bad teacher.
12:49 PM on 03/03/2012
"It's not fair" isn't the best argument against it. The fact that it's not good for kids is. Judge teachers on students' scores, which have much more to do with parenting and student ability than they have to do with teaching, and you're more likely to keep bad teachers and get rid of good ones than you'd be if you evaluated them on, you know, teaching.

The teacher-bashing in this country has reached a fever pitch. Nobody cares that the policy is unfair to teachers. But it's also very bad for students.
07:27 PM on 03/03/2012
Apply merit pay fairly? The principals who evaluate the teachers also assign students to a teacher. Favored teachers will get the high-performing students; disfavored teachers will get the disciplinary problems, the kids who need special ed but don't get help, English Language Learners, and the bullies. Then not only will merit pay distributions be gamed, so will the value added scores that get the disfavored teacher fired.
This user has chosen to opt out of the Badges program
photo
10:25 PM on 03/03/2012
Absolutely, and the more outspoken teachers that dare speak out to stand up for themselves and their colleagues make themselves targets of this corrupt and unfair policy. There are actually teachers who prefer to work with students who need the most help. Now they have to fear doing so will be detrimental to their careers. This hurts the kids that should have great teachers who care for them and give them that extra boost that they need.
11:19 PM on 03/02/2012
How do New York's "value-added" scores compare with other districts'?

They don't accurately evaluate teaching. So they're pretty much all alike, where it counts.
10:00 PM on 03/02/2012
Another regressive and unfair education policy promoted by the Obama administration. Teachers' unions should be speaking out against democratic administrations in Washington, California, New York City/State, Philadelphia, etc. forcing these unfair, anti-teacher "reforms" on their school districts.
photo
HUFFPOST SUPER USER
Bill Jones123
11:02 PM on 05/09/2012
Obama is a collectivist of the left.

Romney, the Tea Party, social conservatives, and the GoP are collectivists on the right.

I am certain that you are not well read enough to know where I got this idea.

You kooks never read the books you quote or claim to quote.

Answer: Inside front cover of "The Road to Serfdom" by F. Hayek.

He warned people like me about the powerful collectivist forces of the right and left.

What we have in this country is a war of two typed of socialism.

The right is NOT about freedom. It is all about imposing ITS BRAND OF BIG GOVERNMENT on the rest of us.
10:39 AM on 03/05/2013
You lost your credibility in the last sentence.
08:12 PM on 03/02/2012
I am also worried about the state tests how are they created can they be manipulated like moving the goal post to make more students pass or more students score low. Also 1 test does not really measure a students ability. and how about hearing and vision. How many students need glasses or have a hearing problem. This is a problem in urban schools and parents who can not afford to purchase glasses.
DavidBCohen
http://snipurl.com/24q1orf
04:54 PM on 03/02/2012
I think the data and reporting we've seen in New York City should have advocates of VAM and public reporting all pausing to reconsider. The results look almost random and the public sphere is full of acrimony rather than any productive dialogue about improving education. You might also have mentioned in this article that not only do experts cast serious doubt on the formulas used in VAM and the application of the formulas, but to make matters worse, the leading professional organizations for educational research and measurement do not even support the idea that tests of learning are valid measures of teaching (unless the test has been specifically designed and validated for that purpose). For more, see http://accomplishedcaliforniateachers.wordpress.com/2012/02/28/big-apple-rotten-ratings/