Another School Choice Random Assignment Study

August 23, 2012

A student just brought to my attention another random-assignment experiment on school choice that was recently released.  The results are, again, positive.  The study was conducted by Justine Hastings, Christopher Neilson, and Seth Zimmerman of Brown and Yale Universities and was released by NBER.  I’ll let their abstract explain the findings:

Using data on student outcomes and school choice lotteries from a low-income urban school district,
we examine how school choice can affect student outcomes through increased motivation and personal
effort as well as through improved school and peer inputs. First we use unique daily data on individual-level
student absences and suspensions to show that lottery winners have significantly lower truancies after
they learn about lottery outcomes but before they enroll in their new schools. The effects are largest
for male students entering high school, whose truancy rates decline by 21% in the months after winning
the lottery. We then examine the impact attending a chosen school has on student test score outcomes.
We find substantial test score gains from attending a charter school and some evidence that choosing
and attending a high value-added magnet school improves test scores as well. Our results contribute
to current evidence that school choice programs can effectively raise test scores of participants. Our
findings suggest that this may occur both through an immediate effect on student behavior and through
the benefit of attending a higher-performing school.

School choice has to be the most rigorously studied education policy and has a relatively consistent set of positive findings.


New Study on Vouchers and College Enrollment

August 23, 2012

Matt Chingos and Paul Peterson have a new study based on a random-assignment experiment of the effects of privately funded vouchers on college enrollment.  They followed the students who had received a private school scholarship in New York City.  Earlier research found significant achievement benefits from that program for African American students but there was some controversy over whether those findings were robust to reasonable alternative specifications.

Now the students are old enough for college and Chingos and Peterson went back to see if achievement gains translated into higher college enrollment.  They did.  African American students who received a private school scholarship were 24% more likely to enroll in college.  The improvement was more dramatic in the chances that African American students would attend private 4 year colleges and even selective private colleges.

No significant effects were observed for Latino students nor for the very few white students in the sample.  It appears that choice has the biggest effect on those whose options would have been the worse in the absence of a choice program. It is also worth remembering that the scholarship was good for only half of private school tuition, so it is possible that a more generous program would have a broader effect.

You can read Chingos and Peterson’s summary of their findings in the Wall Street Journal.


Teacher Compensation Debate in Education Next

August 21, 2012

Education Next has an excellent debate about teacher compensation.  On one side Jason Richwine and Andrew Biggs argue that teacher compensation is significantly higher than similarly skilled workers in other occupations.  Lawrence Mishel and Joydeep Roy are on the other side arguing that teacher compensation is lower than for comparable workers.  It’s a great debate, so be sure to check it out.


Reforming Gates

August 14, 2012

In my last post I wrote about the pattern at the Gates Foundation of abusing the idea of “research” and “evidence” to advance its education policy agenda.  Gates has an organizational culture that permits intellectual corruption.  There are good people at Gates doing good work, but there is something rotten about the organization that needs to be changed if they hope to succeed over the long run.

In addition to their abuse of research and evidence, the Gates Foundation suffers from a bloated staff and paralyzing bureaucracy.  As their 990 tax filings show, their assets doubled over the last decade, but their staffing levels increased ten-fold — even more rapidly than the increase in assets as Buffett adds his money to Gates to create a philanthropic Leviathan.  They have so many people that they needed to build the $500 million palace pictured above to hold all of them.

But with huge size, staffing, and wealth comes the huge danger of corruption.  If an organization becomes bloated, inefficient and corrupt in the profit-seeking sector, the possibility of a hostile takeover can help check or eliminate abuses.  But in the non-profit sector there are no corporate raiders.  No outside shareholders can come in to take over the Gates Foundation, sell off its over-priced facilitates, cut staffing, reduce corruption and focus on the core mission.

Instead, non-profits need to check the danger of corruption that comes with wealth and power in the same way that governments do — by creating institutional constraints, dividing power, and pitting ambition against ambition.  In short, non-profits need a Constitution.

Specifically, the Gates Foundation has just become too damn big for its own good.  It’s so big and powerful that just about everyone in the education policy world gets money from them or hopes to.  It’s so big that everyone within the organization is too eager to gain control over it, causing in-fighting and the need for rigid top-down controls.  It’s so big that they can indulge foolish ideas and make irresponsible claims without fear of consequences.

One way Gates could check these problems is to divide its education unit into a Team A and Team B, each of which would operate independently with its own theory of action and reform agenda.  The different Teams within Gates could then compete with each other to develop and pursue the best reform ideas.  They could also help keep each other honest by having an interest to detect, reveal, and stop any intellectual dishonesty in the other.

Many people wrongly believe that organizations function best when they achieve greater scale and are streamlined, but this is incorrect in the peculiar world of government and non-profit organizations.  As Patrick Wolf and James Q. Wilson’s work on bureaucracy shows, redundancy within government can be a desirable arrangement.  Having the FBI, DEA, and ATF all chasing drug dealers is beneficial because they compete with each to do the best job and win a larger share of congressional appropriations.  Redundancy can simulate the choice and competition of the private market.

Similarly the division of power and responsibility between local, state, and federal governments as well as between the different branches of each government helps check abuse and corruption while providing some of the positive effects of choice and competition.  Smart non-profits should likewise develop a policy to split themselves into smaller competing units once they reach a certain size.

Another institutional arrangement that might help right the ship at Gates is to develop an independent internal research department that reports directly to the board and not to the heads of any of its programmatic units.  A research department that does not report to the programmatic unit heads is less likely to feel compelled to verify the wisdom of the paths chosen by the programmatic units.

In addition, non-profits like Gates should develop a policy that prevents them from ever conducting public evaluations of their own projects internally.  The research unit’s responsibilities should be limited to contracting out research to independent third parties and then reporting the results to the board.  One of the real problems with the MET project at Gates is that is was funded and conducted internally by Gates.  That made it very hard for them to report that the project had failed to find what they had hoped.  You shouldn’t be the judge and jury in your own case.

Lastly, changing the organizational culture to one that gives primacy to intellectual integrity requires cleaning house at the leadership level.  Vicki Phillips, the head of the education unit, has to go.  She has repeatedly mis-described the findings of their own research.  I’m not sure whether it is because she does not understand the research or because she doesn’t care about being accurate (and I’m not sure which would be worse), but you can’t effectively lead an organization if you can’t honestly describe your own research.  It might be good for Gates to consider appointing a well-respected scholar to head its education units, just as the Carnegie Foundation did when it selected University of Chicago researcher, Anthony Bryk, as its president.

Tom Kane, who until recently served under Vicki Phillips, brought impressive research credentials to the table but unfortunately has chosen to compromise those credentials.  Kane is an incredibly capable and accomplished researcher, but even the best can be tarnished.  Gates needs to cut all remaining ties with Kane to set the example that accurate and honest reporting of research is of primary value.

Of course, none of this can or will happen unless Bill Gates wants it to.  Perhaps Gates himself is the source of the problem.  If that’s true, then no organizational or staffing change will improve the situation.  But I suspect that Gates does not want to see his wealth squandered.  He doesn’t want to be the next Walter Annenberg, whose $500 million “Challenge” ultimately “had little impact on school improvement and student outcomes…”

The MET project is already almost $400 million that Gates has spent with little to show for it.  I don’t think Gates wants to keep doing that.  And even if his internal research declares success and others are too timid to publicly question that claim, over the long run (especially when the money stops flowing in this direction) people will think of Gates as they think of Annenberg — as someone who failed to use his enormous wealth for positive effect in education.

In the end, this is all up to Bill Gates.  He can choose to make a giant bonfire of his fortune by squandering it on palatial buildings, excessive staffing, and foolish enterprises whose failure is only temporarily disguised by dishonesty.  Or he can choose to make the organizational and staffing changes necessary to get the Gates Foundation back on track.

(edited for some typos)


Confusing Evidence and Politics

August 13, 2012

In education reform debates it is far too common to hear someone say “the evidence shows” something that is just their preferred policy that is not supported by research at all.  People confuse what makes good sense and is good politics with what is actually supported by evidence.  At the Gates Foundation this problem is endemic.  They have repeatedly confused evidence and politics.

I think I can clearly illustrate this confusion of evidence and policy preference at the Gates Foundation in the most recent article by Tom Kane in Education Next summarizing the Measuring Effective Teachers (MET) project results.   MET is an ambitious project to record several thousand classroom lessons, survey students, and administer multiple standardized tests to identify the best way to measure teacher effectiveness and eventually identify teaching practices that are associated with greater learning.  The study costs $45 million on top of the $335 million reported cost of implementing the program in several school districts.

The main claimed finding of MET at this point is that combining classroom observation and student survey scores with student achievement gains is the best way to measure teacher effectiveness.  As Kane writes, “the evidence reveals that…  rather than rely on any single indicator, schools should try to see effective teaching from multiple angles.”  I’m willing to agree with Kane that using multiple measures of teacher effectiveness is supported by political wisdom and sound theory, but the evidence they produced does not demonstrate the merits of multiple measures.

Kanes summarizes “the case for multiple measures” in the second to last section of his article.  He states, “First, combining [multiple measures] generates less volatility from course section to section or year to year, and greater predictive power.” The results don’t exactly support this claim.  As can be seen below in Figure 1 reproduced from his article and in Table 16 on p. 51 of the Measuring Effective Teachers report, an equally weighted combination of student achievement gains with classroom observation scores and student survey results actually lowers predictive power.  You are better at predicting teacher value added in a class just by using the teacher value added measure from another class than by combining that achievement gain measure equally with classroom observation scores and student survey responses, which is the opposite of “combining them generates… greater predictive power.”

The only way using multiple measures could have a roughly equal predictive power to achievement gains alone is if they are combined such that achievement gains constitute 75.8% of the combined measure, with only 4.2% of the combined measure coming from classroom observation scores and 20.0% coming from student surveys.

But there are significant difficulties and costs associated with collecting the classroom observation scores.  Every observer had to receive 17 to 25 hours of training and even after that 23% of the observers had to be excluded for lack of reliability.  And then as Kane acknowledges: “Even with trained raters, we had to score four lessons, each by a different observer, and average those scores to get a reliable measure of a teacher’s practice. Given the high opportunity cost of a principal’s time, or the salaries of professional peer observers, classroom observations are the costliest source of feedback.”  All of this was necessary for a measure that constituted 4.2% of a combined measure with about the same predictive power as forgetting about classroom observations and just using achievement gains.

Someone reviewing this evidence who was not already committed to the policy of using multiple measures would obviously conclude that classroom observations were not worth the significant expense and bother.  The conclusion Kane and Gates offer is not driven by “the evidence” but by their preference for a policy that is based on other political and theoretical reasons.

I should note that the increase in reliability from combining measures of teacher effectiveness provides little consolation.  Kane measures reliability as the correlation of the evaluation score from class to class for the same teacher.  You could improve reliability without improving or even while hurting predictive power simply by adding another  variable to the combined measure.  It would be more consistent (reliable), but it would be more consistently wrong.

Kane then offers another argument for combining measures: “A second reason to combine the measures is to reduce the risk of unintended consequences, to lessen the likelihood of manipulation or ‘gaming.’ Whenever one places all the stakes on any single measure, the risk of distortion and abuse goes up.”  This makes very good sense and is a persuasive argument.  The only problem is that it is not in any way derived from “the evidence” produced by their study.  It’s just a sound theoretical argument.  Kane and Gates shouldn’t say “the evidence” supports multiple measures when they aren’t actually relying on evidence to make their claim.  They didn’t need to spend almost $400 million to implement and study MET to make this point.

And finally, Kane suggests that “[t]here is a third reason to collect multiple measures: conflicting messages from the multiple sources of information send a signal to supervisors that they should take a close look at what’s going on in the classroom.”  Again, this is a theoretical argument rather than from any evidence the study collected.  And unfortunately, the evidence from the study suggests that a combined message will send “conflicting messages” almost all of the time.  The correlation between classroom observation or student survey scores and achievement gains was no higher than .13.  With such a low correlation, administrators will very often see differences between teacher effectiveness as measured by each of the three types of measures.  Kane might as well suggest that supervisors should always take a close look at every teacher.

I’m inclined to agree with Kane and Gates that it is better to use multiple measures when evaluating teacher effectiveness.  I just don’t see how “the evidence” does anything to support this view.  The argument for multiple measures is largely theoretical and political.  Theory suggests that a single measure is more subject to manipulation and unwanted distortion in teacher behavior.  And politics suggests that teachers will be more resistant to any system that is based solely on test scores.  These are all fine reasons for supporting multiple measures, we just shouldn’t debase the currency of research by falsely claiming that they are supported by the evidence when the evidence shows no such thing.  They are just confusing evidence and policy preferences justified by considerations that have nothing to do with research. The truth is that MET was a very expensive effort that failed to produce the evidence they wanted, but in Orwellian fashion they declare victory and still say it supports their pre-determined conclusion.

Unfortunately, there is a pattern at  Gates of this abuse of “evidence” and “research” to support preferred policies.  I’ve written a number of posts in the past detailing this problem at Gates.  In addition to spinning the multiple measures claim, I’ve pointed out that they falsely claimed that their student survey results showed that “[t]eaching to the test makes your students do worse on the tests.”  I noted that Gates was again indifferent to evidence when they abandoned their small schools strategy without waiting for the results of a random-assignment evaluation that ultimately showed that small schools were effective.  And Gates has backed the push for Common Core standards with phony science.

Unlike other critics of the Gates Foundation, I am not motivated by the belief that it is illegitimate for billionaires to use their wealth to try to advance education reform.  On the contrary, I’ve focused on Gates because I believe they are squandering their great potential to have a positive impact.  I’d like them to do better.

But even more importantly, I’ve harped on these abuses of “evidence” and “research” to advance the Gates policy agenda because I fear that Gates is undermining the use of real evidence and research by others to positively influence policy.  I understand that people and organizations can favor policies without having the evidence to prove their merit.  But I cannot understand or accept abusing the idea of evidence and research to advance preferred policies.  Doing so ruins the use of evidence by everyone by feeding the cynical belief that all research is just a way to manipulate others to get what you want.  The more that the general credibility of research and evidence are damaged, the more that policy outcomes will be determined by the brute power of involved interests, which means that the unions are more likely to prevail.  A belief in research and evidence is the only way for weaker interests to triumph, so it is essential that the ed reform movement not debase their own currency.

I still have hope that Gates can right their ship.  It won’t be easy, but they can take important steps to change their organizational culture and structure so that they do not repeatedly abuse claims of research and evidence for their policy preferences.  In the next post, I’ll explain what they should do to reform themselves.

(Edited to correct typo)


The Way of the Future in Ed Reform Advocacy

August 11, 2012

Matt has been a leader in noting how technology will change the way we educate students in the future.  But technology is already fundamentally changing how people advocate for their preferred reforms.  Documentaries and movies are displacing print forms of advocacy at a rapid clip.

We’ve seen documentaries like Waiting for Superman and Race to Nowhere have far greater impact than any blog, article, or book.  And now dramatic films, like Won’t Back Down are making the case for parent trigger laws more powerfully than any print argument.  For better or worse, ed reform is going Hollywood.

In part this shift of ed reform advocacy to film is a manifestation of my earlier argument that the intellectual debate over the broad principles of education reform is over.  A broad consensus among elites has developed that lack of resources is not the central problem with our education system and that simply pouring more money into schools will have little effect.  There is also a broad consensus that parents should have some choice in where their children go to school and that those choices are not only fair to parents and children but also the competition they produce will help improve schools.  These ideas have been found in speeches given by President Obama, in the Democratic Party’s platform, and in liberal establishment newspapers like the Washington Post and not just in the conferences organized by the American Enterprise Institute.  And the collection of athletes and other celebrities joining the ed reform party is rapidly growing.

In addition, the teacher unions are finally being treated as the special interest group they are rather than as credible players in the discussion over the merits of various education policies.  When Campbell Brown takes on the unions the game is over.   

Of course, the unions are still quite powerful and the battles over each policy and the regulations that are appropriate will continue for a long time, but the big intellectual war over ed reform is over.  Similarly, Brown v. Board of Education marked the end of the big intellectual war over racial equality in America, but the battles over the best policies to promote equality have and will continue to rage.

The end of the big intellectual war over education reform has opened the door to Hollywood’s elites to join the fray.  Guess Who’s Coming to Dinner came along after the dust from Brown had settled, not before.  Similarly, the wave of Hollywood films on ed reform is just starting.

And it’s not just Hollywood that’s getting into the ed reform act.  Last night I watched a Bollywood film, Three Idiots, that makes the case for a more student-centered education.  I’m not saying that it is a great film or that it’s argument is well-made.  I’m just saying that technology is being brought to ed reform advocacy and movies are playing an increasingly important role.  And it is worth noting that Three Idiots broke records for the highest grossing Bollywood opening and highest overseas revenues.

You can watch the entire movie for free on YouTube, but here was the most entertaining part.  Don’t worry about the lack of subtitles in the clip since the words don’t really matter.  Once people can see the beautiful colors and fun of ed reform advocacy in a film, why will they ever read a blog post again?


McShane on Louisiana Teacher Union Thuggery

August 1, 2012

One of our super best friends, AEI’s new hotshot, Michael McShane, has a piece in National Review Online on teacher union efforts in Louisiana to intimidate private schools from participating in the state’s voucher program with a letter to each  threatening that those schools will individually be sued if they participate.  This despite the unions losing the first round in courts to stop the program.  If they can’t win in court, the unions hope to at least scare schools away from offering opportunities to kids the threat of legal harassment.

Here’s a taste of McShane’s piece:

While the union’s behavior is disgusting, it certainly isn’t shocking.

Unfortunately, this is just another example of unions choosing to harass educators when they have lost a political or legal battle. It was just about a year ago that the pro-union protesters who were attempting to recall Scott Walker put on a shameful display in Wisconsin. At Messmer Prep, a private non-union school in Milwaukee, protesters super-glued doors shut, creating a fire hazard that endangered children; they berated Brother Bob Smith, the school’s leader and an anchor to Milwaukee’s black community; and they denigrated the school’s teachers who had done nothing more than show up to work that day.

If unions do not like vouchers, there are plenty of outlets for them to voice their distaste. If they wish to change the law, they should lobby their legislators. If they don’t like the governor, they should try and vote him out in the next election. If they don’t like the way the court rules, they should camp outside in protest. Any of these remedies are well within both their rights and the scope of appropriateness and decency.

What they shouldn’t do is badger, demean, or harass people that are working hard every day to educate children. When they do that, they look less like an organization with the best interests of children in mind and more like a power-hungry interest group that will stop at nothing to maintain its hegemony.

I’ll pre-empt Ladner…  BOOOOOOOOOM!!!!!!!!!!!!


The Dark Knight Rises: The Mythology of Our Time

August 1, 2012

I’ve been on a Greek mythology kick this summer.  If the desire to create stories, like those of classical mythology, is universal, what are the myths of today?   I’d argue that superhero stories are the modern equivalent of classical mythology.

They are basic stories and characters that are familiar to almost all of us.  The artist doesn’t invent the characters or their stories, he provides his own twist with his own telling of these familiar stories.  Similarly, Sophocles did not invent the story of Oedipus, Euripedes did not invent the story of Iphigenia, etc… Each play or each telling of an epic poem was like each “re-boot” of the Batman, Spiderman, or Superman sagas — changing the emphasis and minor plot points in order to create a new meaning from a familiar character and story.

Greg has suggested this connection between modern superhero stories and classical mythology by trying to connect current directors and writers with their ancient equivalents.  But I want to take the point even further.  Not only are the modern makers of superhero movies like the playwrights of antiquity, their stories serve the same purpose for us and do so in very similar ways.  Neither the new Batman series nor others attempt to capture realism in their plots.  As real as they make the action and special effects, the plots and characters are obviously unrealistic.  We only accept them because they fit within the genre of a hero story — with their defining flaws, archetype villains, and endurance for suffering and sacrifice.  As ironic and post-modern as we like to think of ourselves, we are as willing to suspend disbelief for hero stories as were the ancients.

In addition, the plots of the Christopher Nolan Batman as well as other superhero sagas are designed to make sense of the world and offer some moral guidance, just as ancient myths did.  (SPOILER ALERT)  The Nolan Batman evokes images of our post 9-11 experience with terror, the need for security, and the price we pay for that need.

In his earlier posts, Greg suggested that the message of Nolan’s Dark Knight is that political and social order may require a lie.  The new movie makes clear that lies have their consequences and are ultimately self-defeating.  And as the earlier Dark Knight films emphasized the need not to be paralyzed by fear, the current movie suggests the opposite danger of being completely without fear.  And in earlier movies we learned that the rich and powerful were fundamentally corrupt, but in the new movie we see that rule by The People is at least as horrifying.  And a final paradox– in previous movies Batman learned that his success requires not trusting others because they are unreliable, but in the current movie we see that success ultimately requires trusting others despite their unreliability.

Perhaps the moral of Nolan’s Batman is also drawn directly from the Greeks.  We know that “Carved into the temple [to Apollo at Delphi] were three phrases: γνῶθι σεαυτόν (gnōthi seautón = “know thyself“) and μηδέν άγαν (mēdén ágan = “nothing in excess”), and Ἑγγύα πάρα δ’ἄτη (eggýa pára d’atē = “make a pledge and mischief is nigh”),[10]   I’m not sure Nolan could have summarized the messages of his Batman trilogy more succinctly.


New Social Promotion Study

July 25, 2012

Marcus Winters and I have a new study on the effects of ending social promotion in Florida that appeared this month in the journal, Education Finance and Policy.  In our earlier published research we observed that retained students made greater academic gains in subsequent grades than did promoted students who were just like them.  But we could only track students for 2 years after the intervention, so we didn’t know if the benefits we observed compounded or faded over time.

In the new study we track students for as many as 5 years after retention.  The benefits of the policy do diminish, but they remain statistically significant and educationally substantial through middle school.  We hope to continue tracking these students through high school, graduation, and even college, but so far it looks like there are enduring benefits to ending social promotion.


President Bush Discusses Global Report Card

July 19, 2012

Last fall Josh McGee and I developed the Global Report Card (GRC) for the George W. Bush Institute. The GRC is a tool that allows people to compare the level of academic achievement in virtually every school district in the United States to the average for their state, the country, and a comparison group of 25 industrialized countries.

Above is a new interview with President Bush in which he discusses the Global Report Card (it’s around minute 25).

The Global Report Card received a fair amount of coverage when it was released, but keep your eyes out for an updated and improved version this upcoming fall.  The results of the GRC are consistent with other international comparisons, including a series of pieces by Eric Hanushek, Paul Peterson, and Ludger Woessmann (the most recent of which can be found here).  But the GRC goes a step further by allowing comparisons to be made at the school district  level.  GRC 2.0 will also have some new features and comparisons that people might find useful.