research reports | Jay P. Greene's Blog

Some States are Serious about K-12 Reform, Others Shirley

June 19, 2013

(Guest Post by Matthew Ladner)

John Chubb and Constance Clark have a very interesting new study out from Education Sector called The New State Achievement Gap: How NCLB Waivers Could Make it Worse or Better.

Chubb and Clark examine NAEP data and find that states are diverging into leaders and laggards. In the relative blink of an eye between 2003 and 2011 they found the gap between the performance of students in the best and worst performing states grew to 60 percent of the size of the White-Black achievement gap on the combined NAEP exams (4^th/8^th reading and math).

Note that part of what has happened here is that the White-Black gap shrank a bit. Note however that it is still sickeningly large-keep in mind that 10 points roughly equates to a grade level worth of average progress on NAEP- so 105 points across four tests is quite disgusting. The state achievement gap meanwhile grew steadily.

Chubb and Clark’s paper would have benefitted from examination of the gory details about how some states are playing fast and loose regarding NAEP inclusion standards for special needs and English language learners- especially in the case of Maryland and Kentucky. These details do not however take away the broad point- some states are improving and some are getting left behind.

The study gets even more interesting as the authors compare the NCLB waivers, accountability systems and standards choices of states with strong and weak NAEP gain performances. Included among these is a comparison between Florida and South Carolina. The referee needs to step in and wrap up Maryland before he pummels West Virginia to death. “Self-reflection” for teacher evaluation Mountaineers? Surely you can’t be serious…

In a not-quite-elliptical fashion, Chubb and Clark note a clustering of states with a recent history of weak NAEP gains with unconvincing NCLB waiver promises and the Smarter Balanced Assessment Consortium. I’m shocked…

Chubb and Clark have turned in a very interesting piece- go read it.

Leave a Comment » | NAEP, research reports | Tagged: Constance Clark, Education Sector, john chubb, The New State Achievement Gap: How NCLB Waivers Could Make it Worse or Better | Permalink
Posted by matthewladner

Momma Ain’t Happy

May 9, 2013

(Guest post by Greg Forster)

My colleagues at the Friedman Foundation have released a big new survey of mothers of school-age kids. And let me tell you, momma ain’t happy:

61% of school moms say education’s on the wrong track; just 32% say it’s on the right track.
Watch out, Common Core test consortia: 79% of school moms rate the federal government’s handling of education as fair or poor; only 17% said good or excellent.
82% of school moms gave an A or B to their local private schools, compared to 43% for public schools. (Momma ain’t unhappy enough!)

The study also surveyed non-moms, so you can compare and contrast. Unsurprisingly, the differences aren’t large – because if momma ain’t happy…

Image

2 Comments | research reports, vouchers | Tagged: Friedman Foundation, Greg Forster | Permalink
Posted by Greg Forster

We Win Pop Culture! Also, a Podcast on Win-Win

May 2, 2013

(Guest post by Greg Forster)

In a major news development, today the Heartland Institute described JPGB as a “widely read education reform-pop culture blog.” After all these years of struggling for recognition as a major voice in the pop culture world, at long last our toil and struggle has been vindicated.

Oh, and they have this podcast I did on the Win-Win report showing that the research consistently supports school choice. If you’re, you know, into that kind of thing.

In case you forgot what that column of zeros on the right looks like, here it is again.

Leave a Comment » | civic values, competitive effects, methodology, random pop culture, research reports, tax-credit scholarships, vouchers | Tagged: Greg Forster, Heartland Institute, Win-Win | Permalink
Posted by Greg Forster

Third Edition of “Win-Win” Adds a Third Win

April 17, 2013

Win-Win 3.0 cover

(Guest post by Greg Forster)

This morning, the Friedman Foundation releases the third edition of my biannual report summarizing the empirical research on school choice. As in previous years, I survey all the available studies on academic effects – both for students who use school choice and for public schools. Hence the title “A Win-Win Solution” – school choice is a win for both those who use it and those who don’t.

New in this edition of the report, I also survey the impact of school choice on the democratic polity in three dimensions: fiscal impact on taxpayers, racial segregation and civic values and practices (such as tolerance for the rights of others). Guess what it shows? School choice is not just win-win, it’s actually win-win-win. It not only benefits choosing families and non-choosing families; it also benefits everyone else through fiscal savings and the strengthening of social and civic bonds.

Here’s the most important part of the report – that unbroken column of zeros on the right remains as impressive as it ever was. Do please read the rest if you’d like to know more!

Win-Win 3.0 chart

3 Comments | civic values, competitive effects, personal tax credits/deductions, research reports, tax-credit scholarships, vouchers | Tagged: Greg Forster, Win-Win | Permalink
Posted by Greg Forster

Wolf v. Ravitch/Welner on the Effects of School Choice

April 8, 2013

(Guest Post By Jason Bedrick)

Is school choice effective at improving measurable student outcomes?

That question has been at the center of a heated debate between Patrick Wolf of the University of Arkansas and Diane Ravitch, former U.S. Assistant Secretary of Education, and one of her supporters. The controversy began when Ravitch attempted to critique Wolf’s studies of voucher programs in Milwaukee and Washington D.C.

After questioning Wolf’s credibility, Ravitch made three main empirical claims, all of which are misleading or outright false:

1) Wolf’s own evaluations have not “shown any test score advantage for students who get vouchers, whether in DC or Milwaukee.” The private schools participating in the voucher program do not outperform public schools on state tests. The only dispute is “whether voucher students are doing the same or worse than their peers in public schools.”

2) The attrition rate in Wolf’s Milwaukee study was 75% so the results only concern the 25% of students who remained in the program.

3) Wolf’s study doesn’t track the students who left the voucher program. (“But what about the 75% who dropped out and/or returned to [the public school system]? No one knows.”)

Wolf then rebutted those claims:

1) Ravitch ignores the finding that vouchers had a strong positive impact on high school graduation rates. Moreover, there was evidence of academic gains among the voucher students:

The executive summary of the final report in our longitudinal achievement study of the Milwaukee voucher program states: “The primary finding that emerges from these analyses is that, for the 2010-11 school year, the students in the [voucher] sample exhibit larger growth from the base year of 2006 in reading achievement than the matched [public school] sample.” Regarding the achievement impacts of the DC program, Ravitch quotes my own words that there was no conclusive evidence that the DC voucher program increased student achievement. That achievement finding was in contrast to attainment, which clearly improved as a result of the program. The uncertainty surrounding the achievement effects of the DC voucher program is because we set the high standard of 95% confidence to judge a voucher benefit as “statistically significant”, and we could only be 94% confident that the final-year reading gains from the DC program were statistically significant.

2) The attrition rate in the Milwaukee study was actually 56%, not 75%. Ravitch was relying on a third party’s critique of the study (to which Ravitch linked) that had the wrong figure, rather than reading the study herself. Moreover, the results regarding the higher attainment of voucher students are drawn from the graduation rate for all students who initially participated in the voucher program in the 9^th grade in the fall of 2006, not just those who remained in the program.

3) Wolf’s team used data from the National Clearinghouse of College Enrollment to track these students into college.

Ravitch responded by hyperventilating about Wolf’s supposed “vitriol” (he had the temerity to point out that she’s not a statistician, didn’t understand the methods she was critiquing, and that she was relying on incorrect secondary sources) and posting a response from Kevin Welner of the University of Colorado at Boulder, who heads the National Education Policy Center (NEPC), which released the critique of Wolf’s study upon which Ravitch had relied.

Welner didn’t even attempt to defend Ravitch’s erroneous first and third claims, but took issue with Wolf’s rebuttal of her second claim. Welner defends the integrity of his organization’s critique by pointing out that when they read Wolf’s study, it had contained the “75% attrition” figure but that the number had been subsequently updated a few weeks later. They shouldn’t be faulted for not knowing about the update. As Welner wrote, “Nobody had thought to go back and see whether Wolf or his colleagues had changed important numbers in the SCDP report.”

That would be a fair point, except for the fact that they did know about the change. As Wolf pointed out, page four of the NECP critique contains the following sentence: “Notably, more than half the students (56%) in the MPCP 9th grade sample were not in the MPCP four years later.” In other words, the author of the NECP critique had seen the corrected report but failed to update parts of his critique. This is certainly not the smoking gun Welner thought it was.

Ravitch replied, again demonstrating her misunderstanding of intention-to-treat (“And, I dunno, but 56% still looks like a huge attrition rate”) and leaving the heavy lifting to Welner. Welner’s main argument is that Wolf should have “been honest with his readers the first time around, instead of implying ignorance or wrongdoing as a cheap way to scores some points against Diane Ravitch and (to a lesser extent) NEPC.” Welner would have had a point if Wolf’s initial response had been to NECP and not Ravitch, but Wolf’s point was that Ravitch was holding herself out as an expert when she had never read the primary source material that she was criticizing. Instead, she relied on a secondary source that cited two contradictory figures. She either didn’t notice or intentionally chose what she thought was the more damning of the two figures—though, again, the figure doesn’t matter for purposes of an intention-to-treat study.

We all make mistakes. Wolf’s team made a mistake in their report and corrected it within a few weeks. Welner has stated that his team will correct the NECP report now that their error has come to their attention a year later. Ravitch should also correct her erroneous assertions regarding the results and methodology of the studies.

(Edited for typo)

1 Comment | research reports, vouchers | Permalink
Posted by Jay P. Greene

Sports and Academics: Coleman vs. Coleman

February 5, 2013

Nerdiness vs. Athleticism

The path-breaking sociologist, James Coleman, was not a fan of high school sports. He thought the culture of athletic prowess swamped the culture of academic success. Schools should get rid of sports and channel that competitive spirit into inter-scholastic academic contests, like Quiz Bowl.

But James Coleman also believed that the enhanced social capital produced by church attendance was key to the success of Catholic schools. The adults would get together at church, share information about their kids and school, and thus be better positioned to work together to improve their school academically. The adult culture of academic success could prevail more easily if the adults were better connected with each other by seeing each other on a regular basis at church.

But maybe high school sports are the secular equivalent of church. Perhaps Friday night football is an event, like church, that gathers parents, allows them to share information about their kids and school, and more effectively work together to improve their school.

So which James Coleman is right? Is it the one who fears athletic success subordinating academic success or the one who thinks social capital is the key to school improvement?

Dan Bowen and I decided to examine this issue with an analysis of Ohio high schools. We look at whether high schools that give greater priority to athletic success do so at the expense of academic success. The results of our analysis are in the current issue of the Journal of Research in Education.

We found that high schools that devote more energy to athletic success also tend to produce more academic success. In particular, we looked at whether high schools with a higher winning percentage in sports also had higher test scores as well as higher rates of educational attainment. We also looked at whether high schools that offered more sports and had a larger share of their student body participating in sports also tended to have higher test scores and higher attainment.

Using several different specifications, we find that higher rates of athletic success and participation were associated with schools having higher overall test scores and higher educational attainment, controlling for observed school inputs. For example, we found:

With regard to attainment, a 10 percentage point increase in a school’s overall winning percentage is
associated with a 1.3 percentage point improvement in its CPI, which is an estimate of its high
school graduation rate.

We also looked at whether schools that offered more opportunities to participate in sports had different rates of attainment:

When we only examine winter sports, an increase of one sport improves CPI by 0.01, which would be a 1
percentage point increase in the high school graduation rate. For the winter, the addition of 10
students directly participating in sports is associated with a 0.015 improvement in CPI, or a 1.5%
increase in high school graduation rate.

In addition to attainment, we also looked at achievement on state tests:

We observe similar positive and statistically significant relationships between the success
and participation in high school sports and student achievement as measured by the Ohio
standardized test results. A 10 percentage point increase in overall winning percentage is
associated with a 0.25 percentage point increase in the number of students at or above academic
proficiency. (See Table 4) When we examine the effect of winning percentage in each sport
separately, once again winning in football has the largest effect. Girls’ basketball also remains
positive and statistically significant (at p < 0.10), but boys’ basketball is not statistically
distinguishable from a null effect.

Lastly, we looked at the effect of participation rates in Ohio high schools on overall student achievement:

As for participation and achievement, the addition of one sport increases the number of
students at or above academic proficiency by 0.2 of a percentage point. The addition of 10
students directly participating in a sports team improves the proportion of students at or above
proficient by 0.4 of a percentage point. Both of these results are statistically significant at p < 0.01. (See Table 5) When examining just the winter season, adding one winter sport increases the
percentage of students performing proficiently by 0.4 of a percentage point, while an additional
10 student able to directly participate in sports during the winter season relates to a 0.6
percentage point increase in students at or above proficiency (see Table 5)

It is a common refrain among advocates for education reform that athletics “have assumed an unhealthy priority in our high schools.” But these advocates rarely offer data to support their view. Instead, they rely on stereotypes about dumb jocks, anecdotes, and painful personal memories as their proof.

Our data suggest that this claim that high school athletic success comes at the expense of academic success is mistaken. Of course, we cannot make causal claims based on our analyses about the relationship between sports and achievement. It’s possible that schools that are more effective at winning in sports and expanding participation are also the kinds of schools that can produce academic success. But the evidence we have gathered at least suggests that any trade-offs between sports and achievement would have to be subtle and small, if they exist at all. Descriptively, it is clear that high schools that devote more energy to sports also produce higher test scores and higher graduation rates.

I guess James Coleman was right — er, I mean, the James Coleman who focused on social capital, not the other one who feared the culture of athletic competition.

[Updated for clarity and to correct typos]

7 Comments | research reports | Permalink
Posted by Jay P. Greene

Head Start Revealed

January 14, 2013

Despite the obvious effort to delay and conceal the disappointing results from the official and high quality evaluation of Head Start, the Wall Street Journal shines the light on the issue in today’s editorial. DC’s manipulating scumbags might want to take note that efforts to hide negative research might just draw more attention. It’s comforting to see that the world may sometimes look more like Dostoevsky’s Crime and Punishment than Woody Allen’s Crimes and Misdemeanors.

The Journal reveals that Head Start supporters have not only ignored the latest study, but they are trying to sneak an extra $100 million for Head Start into the relief package for victims of Hurricane Sandy. They also note that the most recent disappointing Head Start result is just the latest in a string of studies failing to find benefits from the program despite a cumulative expenditure of more than $180 billion.

And then the Journal finishes with this:

The Department of Health and Human Services released the results of the most recent Head Start evaluation on the Friday before Christmas. Once again, the research showed that cognitive gains didn’t last. By third grade, you can’t tell Head Start alumni from their non-Head Start peers.

President Obama has said that education policy should be driven not by ideology but by “what works,” though we have to wonder given his Administration’s history of slow-walking the release of information that doesn’t align with its agenda.

In 2009, the Administration sat on a positive performance review of the Washington, D.C., school voucher program, which it opposes. The Congressionally mandated Head Start evaluation put out last month was more than a year late, is dated October 2012 and was released only after Republican Senator Tom Coburn and Congressman John Kline sent a letter to HHS Secretary Kathleen Sebelius requesting its release along with an explanation for the delay. Now we know what was taking so long.

Like so many programs directed at the poor, Head Start is well-intentioned, and that’s enough for self-congratulatory progressives to keep throwing money at it despite the outcomes. But misleading low-income parents about the efficacy of a program is cruel and wastes taxpayer dollars at a time when the country is running trillion-dollar deficits.

A government that cared about results would change or end Head Start, but instead Congress will use the political cover of disaster relief to throw more good money after proven bad policy.

[UPDATE: And here is a good follow-up op-ed on the study by Lindsey Burke on the Fox News web site.]

3 Comments | politics, research reports | Tagged: Head Start | Permalink
Posted by Jay P. Greene

What Success Would Have Looked Like

January 10, 2013

Yesterday I described the Gates Foundation’s Measuring Effective Teachers (MET) project as “an expensive flop.” To grasp just what a flop the project was, it’s important to consider what success would have looked like. If the project had produced what Gates was hoping, it would have found that classroom observations were strong, independent predictors of other measures of effective teaching, like student test score gains. Even better, they were hoping that the combination of classroom observations, student surveys, and previous test score gains would be a much better predictor of future test score gains (or of future classroom observations) than any one of those measures alone. Unfortunately, MET failed to find anything like this.

If MET had found classroom observations to be strong predictors of other indicators of effective teaching and if the combination of measures were a significantly better predictor than any one measure alone, then Gates could have offered evidence for the merits of a particular mixing formula or range of mixing formulas for evaluating teachers. That evidence could have been used to good effect to shape teacher evaluation systems in Chicago, LA, and everywhere else.

They also could have genuinely reassured teachers anxious about the use of test score gains in teacher evaluations. MET could have allayed those concerns by telling teachers that test score gains produce information that is generally similar to what is learned from well-conducted classroom observations, so there is no reason to oppose one and support the other. What’s more, significantly improved predictive power from a mixture of classroom observations with test score gains could have made the case for why we need both.

MET was also supposed to have helped us adjudicate among several commonly used rubrics for classroom observations so that we would have solid evidence for preferring one approach over another. Because MET found that classroom observations in general are barely related to other indicators of teacher effectiveness, the study told us almost nothing about the criteria we should use in classroom observations.

In addition, the classroom observation study was supposed to help us identify the essential components of effective teaching . That knowledge could have informed improved teacher training and professional development. But because MET was a flop (because classroom observations barely correlate with other indicators of teacher effectiveness and fail to improve the predictive power of a combined measure), we haven’t learned much of anything about the practices that are associated with effective teaching. If we can’t connect classroom observations with effective teaching in general, we certainly can’t say much about the particular aspects of teaching that were observed that most contributed to effective teaching.

Just so you know that I’m not falsely attributing to MET these goals that failed to be realized, look at this interview from 2011 of Bill Gates by Jason Riley in the Wall Street Journal. You’ll clearly see that Bill Gates was hoping that MET would do what I described above. It failed to do so. Here is what the interview revealed about the goals of MET:

Of late, the foundation has been working on a personnel system that can reliably measure teacher effectiveness. Teachers have long been shown to influence students’ education more than any other school factor, including class size and per-pupil spending. So the objective is to determine scientifically what a good instructor does.

“We all know that there are these exemplars who can take the toughest students, and they’ll teach them two-and-a-half years of math in a single year,” he says. “Well, I’m enough of a scientist to want to say, ‘What is it about a great teacher? Is it their ability to calm down the classroom or to make the subject interesting? Do they give good problems and understand confusion? Are they good with kids who are behind? Are they good with kids who are ahead?’

“I watched the movies. I saw ‘To Sir, With Love,'” he chuckles, recounting the 1967 classic in which Sidney Poitier plays an idealistic teacher who wins over students at a roughhouse London school. “But they didn’t really explain what he was doing right. I can’t create a personnel system where I say, ‘Go watch this movie and be like him.'”

Instead, the Gates Foundation’s five-year, $335-million project examines whether aspects of effective teaching—classroom management, clear objectives, diagnosing and correcting common student errors—can be systematically measured. The effort involves collecting and studying videos of more than 13,000 lessons taught by 3,000 elementary school teachers in seven urban school districts.

“We’re taking these tapes and we’re looking at how quickly a class gets focused on the subject, how engaged the kids are, who’s wiggling their feet, who’s looking away,” says Mr. Gates. The researchers are also asking students what works in the classroom and trying to determine the usefulness of their feedback.

Mr. Gates hopes that the project earns buy-in from teachers, which he describes as key to long-term reform. “Our dream is that in the sample districts, a high percentage of the teachers determine that this made them better at their jobs.” He’s aware, though, that he’ll have a tough sell with teachers unions, which give lip service to more-stringent teacher evaluations but prefer existing pay and promotion schemes based on seniority—even though they often end up matching the least experienced teachers with the most challenging students.

The final MET reports produced virtually nothing that addressed these stated goals. But in Orwellian fashion, the Gates folks have declared the project to be a great success. I never expected MET to work because I suspect that effective teaching is too heterogeneous to be captured well by a single formula. There is no recipe for effective teaching because kids and their needs are too varied, teachers and their abilities are too varied, and the proper matching of student needs and teacher abilities can be accomplished in many different ways. But this is just my suspicion. I can’t blame the Gates Foundation for trying to discover the secret sauce of effective teaching, but I can blame them for refusing to admit that they failed to find it. Even worse, I blame them for distorting, exaggerating, and spinning what they did find.

(edited for typos)

13 Comments | research reports | Tagged: gates foundation, Measuring Effective Teachers | Permalink
Posted by Jay P. Greene

Understanding the Gates Foundation’s Measuring Effective Teachers Project

January 9, 2013

If I were running a school I’d probably want to evaluate teachers using a mixture of student test score gains, classroom observations, and feedback from parents, students, and other staff. But I recognize that different schools have different missions and styles that can best be assessed using different methods. I wouldn’t want to impose on all schools in a state or the nation a single, mechanistic system for evaluating teachers since that is likely to be a one size fits none solution. There is no single best way to evaluate teachers, just like there is no single best way to educate students.

But the folks at the Gates Foundation, afflicted with PLDD, don’t see things this way. They’ve been working with politicians in Illinois, Los Angeles, and elsewhere to centrally impose teacher evaluation systems, but they’ve encountered stiff resistance. In particular, they’ve noticed that teachers and others have expressed strong reservations about any evaluation system that relies too heavily on student test scores.

So the folks at Gates have been trying to scientifically validate a teacher evaluation system that involves a mix of test score gains, classroom observations, and student surveys so that they can overcome resistance to centrally imposed, mechanistic evaluation systems. If they can reduce reliance on test scores in that system while still carrying the endorsement of “science,” the Gates folk imagine that politicians, educators, and others will all embrace the Gates central planning fantasy.

Let’s leave aside for the moment the political reality, demonstrated recently in Chicago and Los Angeles, that teachers are likely to fiercely resist any centrally imposed, mechanistic evaluation system regardless of the extent to which it relies on test scores. The Gates folks want to put on their lab coats and throw the authority of science behind a particular approach to teacher evaluation. If you oppose it you might as well deny global warming. Science has spoken.

So it is no accident that the release of the third and final round of reports from the Gates Foundation’s Measuring Effective Teachers project was greeted with the following headline in the Washington Post: “Gates Foundation study: We’ve figured out what makes a good teacher,” or this similarly humble claim in the Denver Post: “Denver schools, Gates foundation identify what makes effective teacher.” This is the reaction that the Gates Foundation was going for — we’ve used science to discover the correct formula for evaluating teachers. And by implication, we now know how to train and improve teachers by using the scientifically validated methods of teaching.

The only problem is that things didn’t work out as the Gates folks had planned. Classroom observations make virtually no independent contribution to the predictive power of a teacher evaluation system. You have to dig to find this, but it’s right there in Table 1 on page 10 of one of the technical reports released yesterday. In a regression to predict student test score gains using out of sample test score gains for the same teacher, student survey results, and classroom observations, there is virtually no relationship between test score gains and either classroom observations or student survey results. In only 3 of the 8 models presented is there any statistically significant relationship between either classroom observations or student surveys and test score gains (I’m excluding the 2 instances were they report p < .1 as statistically significant). And in all 8 models the point estimates suggest that a standard deviation improvement in classroom observation or student survey results is associated with less than a .1 standard deviation increase in test score gains.

Not surprisingly, a composite teacher evaluation measure that mixes classroom observations and student survey results with test score gains is generally no better and sometimes much worse at predicting out of sample test score gains. The Gates folks trumpet the finding that the combined measures are more “reliable” but that only means that they are less variable, not any more predictive.

But “the best mix” according to the “policy and practitioner brief” is “a composite with weights between 33 percent and 50 percent assigned to state test scores.” How do they know this is the “best mix?” It generally isn’t any better at predicting test score gains. And to collect the classroom observations involves an enormous expense and hassle. To get the measure as “reliable” as they did without sacrificing too much predictive power, the Gates team had to observe each teacher at least four different times by at least two different coders, including one coder outside of the school. To observe 3.2 million public school teachers for four hours by staff compensated at $40 per hour would cost more than $500 million each year. The Gates people also had to train the observers at least 17 hours and even after that had to throw out almost a quarter of those observers as unreliable. To do all of this might cost about $1 billion each year.

And what would we get for this billion? Well, we might get more consistent teacher evaluation scores, but we’d get basically no improvement in the identification of effective teachers. And that’s the “best mix?” Best for what? It’s best for the political packaging of a centrally imposed, mechanistic teacher evaluation system, which is what this is all really about. Vicki Phillips, who heads the Gates education efforts, captured in this comment what I think they are really going for with a composite evaluation score:

Combining all three measures into a properly weighted index, however, produced a result “teachers can trust,” said Vicki Phillips, a director in the education program at the Gates Foundation.

It’ll cost a fortune, it doesn’t improve the identification of effective teachers, but we need to do it to overcome resistance from teachers and others. Not only will this not work, but in spinning the research as they have, the Gates Foundation is clearly distorting the straightforward interpretation of their findings: a mechanistic system of classroom observation provides virtually nothing for its enormous cost and hassle. Oh, and this is the case when no stakes were attached to the classroom observations. Once we attach all of this to pay or continued employment, their classroom observation system will only get worse.

I should add that if classroom observations aren’t useful as predictors, they also can’t be used effectively for diagnostic purposes. An earlier promise of this project is that they would figure out which teacher evaluation rubrics were best and which sub-components of those rubrics that were most predictive of effective teaching. But that clearly hasn’t panned out. In the new reports I can’t find anything about the diagnostic potential of classroom observations, which is not surprising since those observations are not predictive.

So, rather than having “figured out what makes a good teacher” the Gates Foundation has learned very little in this project about effective teaching practices. The project was an expensive flop. Let’s not compound the error by adopting this expensive flop as the basis for centrally imposed, mechanistic teacher evaluation systems nationwide.

(Edited for typos and to add links. To see a follow-up post, click here.)

22 Comments | research reports | Tagged: gates foundation, Measuring Effective Teachers | Permalink
Posted by Jay P. Greene

Head Start Manipulating Scumbags

December 20, 2012

I’ve heard that the latest round of results from the federal evaluation of Head Start is due to be released tomorrow afternoon. And my psychic powers tell me that the results will show no lasting benefit from Head Start, just like the two previous rounds of results.

You heard that right — the federal government is releasing results that the administration dislikes on a Friday afternoon just before Christmas. They might as well put the results on display in a locked filing cabinet in a disused lavatory behind the sign that says “beware of the leopard.”

Why is the Department of Health and Human Services burying this study just like they delayed, buried, or distorted the previous ones? Well, because the study is an extremely rigorous and comprehensive evaluation, involving random assignment of a representative sample of all Head Start students nationwide, that I expect will find no enduring benefits from this program that politicians, pundits, and other dimwits constantly want to expand and fund. Anyone who casts doubt on think tank research should cast a critical eye toward gross manipulations and abuse of research that are perpetrated by the federal government.

I should repeat that the researchers have done an excellent job evaluating Head Start in this case. It is the bureaucratic class at the Department of Health and Human Services who have cynically manipulated, delayed, and misreported this research. The pending report is already delayed several years and has been around for a long time. The decision to release it on the Friday afternoon before Christmas is completely calculated.

I don’t know your names, but I’m going to invest a little energy in tracking down who is responsible for this cynical abuse of research. If there were any reporters worth their salt left out there, they would bother to expose you but I guess that job has now been passed to bloggers and enterprising individuals. When I do find your names I will post them so folks can know who the scumbags are who think they can manipulate the policy community by delaying, burying, or misreporting research. And then when you get hired by that DC think tank, advocacy organization, or other waste of space we’ll be able to remember who you are and assign no credibility to what you have to say. These kinds of dastardly acts by public servants should not be cost free and if I have any say in the matter they will not be in this case.

29 Comments | politics, research reports | Tagged: Head Start, Head Start evaluation | Permalink
Posted by Jay P. Greene

Jay P. Greene's Blog

Some States are Serious about K-12 Reform, Others Shirley

Momma Ain’t Happy

We Win Pop Culture! Also, a Podcast on Win-Win

Third Edition of “Win-Win” Adds a Third Win

Wolf v. Ravitch/Welner on the Effects of School Choice

Sports and Academics: Coleman vs. Coleman

Head Start Revealed

What Success Would Have Looked Like

Understanding the Gates Foundation’s Measuring Effective Teachers Project

Head Start Manipulating Scumbags

Recent Posts

Archives

Meta

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Recent Posts

Archives

Meta