Florida’s Grammar Controversy

May 24, 2012

(Guest Post by Lindsey Burke)

In the most recent administration of the Florida Comprehensive Assessment Test (FCAT) – the state’s criterion-referenced assessment of student achievement – Florida students were asked to pay a little more attention to punctuation, grammar, and spelling in order to get a passing grade on the writing assessment. FCAT cut scores were to reflect that, with proficiency status awarded to those students who could meet the requirements of the new grammar-sensitive assessment.

This rather trivial change has set off a firestorm in the Sunshine State, which just released this year’s FCAT scores, graded under the more rigorous standards.

In 2011, a whopping 81 percent of Florida’s fourth graders scored a 4 or better on the writing portion of the FCAT. Just 27 percent of the youngsters scored proficient under the more rigorous standards this year. Eighth and tenth graders saw similar declines.

The dramatic drop prodded the state board of education to revise the cut scores downward, temporarily dropping the passing mark from 4 to 3 (out of a possible 6 points).

Over the past decade, Florida has made dramatic gains in academic achievement. Florida skyrocketed from 5th worst in reading performance on the NAEP in 1998 to 8th best by 2007, significantly increased the number of students who take and pass AP exams, and began to narrow the achievement gap between white and minority students (Black and Hispanic students in Florida had twice the reading gains of the national average from 1998 to 2009). But evidence suggests that progress the Sunshine State had begun to taper out, with students plateauing on recent assessments.

Keen to ensure student achievement continued apace, Florida proactively raised the rigor of the FCAT – something they’ve done every other year or so since Gov. Jeb Bush’s A-PLUS plan was implemented. According to Commissioner Gerard Robinson, the board of education “asked scorers to grade essays more strictly, with an eye to punctuation, grammar and the quality of word choice and relevance.”

As Florida reels under the draconian requirements of – gasp! – punctuation awareness in a writing assessment, there’s a lesson to be learned for federal and state policymakers eager to adopt national standards and tests.

The backlash against Florida’s efforts to improve the rigor of the FCAT begs the question: what is the correct level of rigor for the 46 states that have adopted Common Core national standards that will not elicit similar reactions? We have yet to learn where the Common Core central planners will set their cut scores, or how they plan to go about setting passing marks on which both Alabama and Massachusetts will agree.

It is a cautionary tale for national standards proponents. Much of Florida’s success over the past decade can be attributed to the state continuously improving its standards and tests. With rigid national standards in place, that flexibility would be lost. And if mistakes are made in the standards, they’re here to stay.

Florida will likely succeed, as it has over the past few years, at striking the right balance on the FCAT. But being able to define what Florida students should know and be able to do, and crafting standards and tests to reflect that, will be lost if the state goes through with Common Core adoption.

Florida strengthened state tests to make sure kids could spell, apply punctuation, and grasp other grammar concepts. These are nuances the state will no longer be able to enjoy come 2014, when national standards and tests are to be fully implemented. The Sunshine State wants to continue its march to the top of the NAEP, and has been working to strengthen standards to achieve that goal. But that ability will soon be lost, which is the ultimate lesson that should be gleaned from the FCAT controversy.

(edited to fix a typo)


ChoiceMedia.TV Jabs Teacher Unions

May 22, 2012

Another video from ChoiceMedia.TV:


Pass the Popcorn: Truth and Consequences

May 21, 2012

I recently saw the Israeli film, Footnote, which was nominated for best foreign film in last year’s Oscars.  It reminded me of a movie from the 90s that I thoroughly enjoyed, called Big Night.  In both films we encounter a character who is committed to the truth of his craft.

In Footnote we meet the elder Prof. Eliezer Shkolnik, a philoligist whose painstaking, scholarly analysis of words in the Talmud led him to discover that the current version of the Talmud differs from the one that was in common use centuries ago.  Before he can publish his findings, he is scooped by a competing scholar who stumbles upon an ancient copy of the Talmud in an archive, thus proving the same point without the careful scholarship.  Eclipsed by this chance discovery, Prof. Eliezer Shkolnik, toils away in obscurity, bitter that his dedication to his craft remains unrecognized while flashy, lucky, and shallow scholars earn the laurels he believes he should be receiving.

One of those flashy, lucky, shallow scholars is Prof. Eliezer Shkolnik’s son, Prof. Uriel Shkolnik, who writes popular books about the Talmud and is a celebrity on TV and the lecture circuit (only in Israel could a Talmudic scholar be a celebrity, but think of him as an Israeli version of Malcolm Gladwell).  The simmering animosity and resentment between the elder and younger Prof. Shkolnik boils into a crisis when the father accidentally receives a prestigious award that was meant to be given to the son.  The father falsely believes that he has finally been recognized for his commitment to the truth of his craft and the son, who lacks his father’s zealous pursuit of the truth, would rather engineer a falsehood to save his father’s honor than take the prize himself.

Besides the painfully (and hilariously) accurate depiction of the pettiness and self-importance of much of academia, the movie raises difficult questions about how important the pursuit of truth really is.  Is it more important than family, harmony, or love?

The movie Big Night raised very similar questions.  Two brothers, Primo and Secondo, own an Italian restaurant that is completely committed to the truth of their craft.  The only problem is that they have no customers.  People don’t seem to appreciate the truth.  Watch this perfect scene in which a rare customer wants a side of pasta with her risotto to see the tension between giving the customer what she wants and remaining committed to the truth of their craft:

Meanwhile a competing Italian restaurant owner, Pascal, violates every truth of Italian cuisine but his restaurant is packed with customers.  He’s flashy and crude, but the customers seem to love it.  Pascal offers to help Primo and Secondo by bringing a celebrity to their restaurant for a Big Night, which he promises will put their place on the map.  The brothers pour every cent they have and all of their craft into the Big Night, but when the celebrity doesn’t show they are ruined.  Their hopes are raised and they throw the best party with the best Italian food, hoping to remain true to their craftwhile also succeeding, but then they are left with nothing — or nothing except the truth of their craft and the love of each other.  This is the morning after their Big Night.  (It’s long and without dialogue, but watch the whole thing since it’s incredibly powerful and touching, at least it was after seeing the whole movie):

After watching Big Night I was persuaded that remaining true to one’s craft was of primary importance.  Remaining true may cost us dearly, but it is all that we have.  After watching Footnote I’m not so sure about this anymore.  Truth at the expense of all else can be incredibly destructive.  In Big Night they kept both truth and love.  In Footnote truth comes at the expense of love.  Maybe it is love that is of greater importance.  Or maybe the singular pursuit of any virtue is dangerous.  We need truth and love, but neither completely.


Another “Reform School” Clip

May 11, 2012

Enjoy


Why I Favor Decentralized Governance of Education

May 8, 2012

Peter Meyer at the Fordham Institute asked me to contribute a piece to his Board’s Eye View blog to address The BIG Questions on school governance.  Here is what I sent him:

Being against greater national control over education policy is not the same as being for local school districts.  I appreciate Peter Meyer for giving me the opportunity in this space to explain what I am for when it comes to school governance.

Fundamentally, I am for parental control over the education of their children, so I guess that I am for as little governance over education as we can manage.  In my ideal world, which I’ve tried to explain and justify at greater length in this book chapter, parents would be given as much money as is minimally necessary to fulfill their obligation to educate their children and would choose the location, manner, and content of that education.  Since education is just a subset of all of the activities in which parents engage to raise their children to be productive adults, we should defer to parents as much in how they educate their children as how they raise those children more generally.  As long as parents do not neglect or abuse their children, the government should have as little role in education as is possible.

But we don’t live in my ideal world and I have no expectation that we will.  All that I can hope for is that we will inch closer to my ideal rather than further away from it.  With that in mind, I favor governance arrangements that facilitate greater parental choice and control over education over those that would reduce parental choice and control.

So, I have no particular love for local school districts.  They just more closely approximate parental choice and control than does granting more power over education to the state or national governments.  It would be even better in my view to abolish school districts and have every school be like a charter school – a publicly regulated school of choice that would choose its own method and content of education and would have to attract willing families to generate the revenue to pay for it.  But I understand the idea of abolishing school districts and having every school operate as a charter school is only slightly less unrealistic than a virtually unregulated world of parental choice and control.

As unrealistic as making every school a charter school may be, we have been inching in that direction.  A little more than two decades ago we had no charter schools.  Today charter schools constitute nearly 5% of all public schools and educate about 3% of all students.  And the expansion of parental choice and control has been even greater when one considers the fully array of choices that have been introduced over the last two decades, including vouchers, tax credit funded scholarships, virtual schools, inter-district choice, magnet schools, etc…  My ideal world may be an unattainable fantasy, but my vision of gradual progress toward that ideal has been a fairly accurate description of the trends over the last few decades.

But there are some people, primarily edupundits located within the DC beltway, who have very different fantasies about ideal governance arrangements.  Rather than shifting arrangements directly toward greater parental choice and control, they dream about measures granting greater control to state and national authorities.  They rightly point out the defects of local school districts, but they wrongly see the solution in greater centralization of power rather than in the expansion of parental choice and control.

Their justifications for increasing the power of state and national authorities over education are more like empty political slogans than actual intellectual arguments based on principle.  For example, we’ll hear some say that a decentralized system of education cannot meet our needs in the 21st century: “The system of schooling we have today is the legacy of the 19th century — and hopelessly outmoded in the 21st.”  Of course, representative democracy is also a legacy of the 18th and 19th centuries, but that doesn’t mean we need to dispense with it to meet the challenges of our brave new 21st century world.  Saying that the 21st century demands certain skills or governance arrangements is just sloganeering and manipulating people to submit to a proposal, not a real argument.

Some attempt to justify greater centralization in education by saying that our current system is too uncoordinated, contradictory, duplicative, and confusing.  We need the greater coherence, planning, and order that more centralized control can offer.  Do you notice how the central authorities in these proposals are always imagined to be highly competent and benevolent?  They never entertain the very real possibility that the central authority might be coherent, well-planned, and orderly in pursuing something awful.  Those attracted to central planning in education may want to consider how well economic central planning has turned out.

Some attempt to justify granting more power to state and national authorities by looking overseas and claiming that the highest achieving countries have more centralized governance arrangements.  Let’s ignore for a moment that these are not accurate descriptions of how many high-achieving countries have structured their governance – Canada and Australia, for example, are high achieving and have decentralized governance arrangements.  The more fundamental problem is that the “best practices” movement of imitating some of the practices of others who are successful fails to consider what actually caused others to be successful.  Just imitating some of what they do is like the Cargo Cults found in Pacific Islands following WW II, where locals believed that if they built imitations of planes, runways, and control towers, the cargo and plentiful goods that had arrived during the war would return.  They didn’t understand that imitating the trappings of an airport doesn’t cause cargo to arrive any more than imitating the trappings of other countries’ governance arrangements will cause high achievement.

Lastly, some advocates of centralization argue that you actually need to centralize certain things in order to facilitate better decentralized control over other things.  They describe this approach as “tight-loose,” where the central authority assumes greater control over determining and regulating the goals of education and local authorities are then given greater flexibility over the means for meeting those goals.  Of course, ends and means are not so easily separated.  Ends often dictate or at least constrain the selection of means.  In addition, in what fantasy world would the central authority carefully limit its role to setting and regulating ends once it is given authority over an issue?  At least I recognize that my fantasy of parental choice and control is unrealistic.

Dreaming about a world in which parents almost entirely control the education of their children at least provides me with a principle by which I can judge policy proposals.  I favor policies that move us closer to my ideal and oppose those that move us farther away.  But the advocates of greater centralization in education do not appear to be guided by any particular principle, or at least none that they are willing to articulate.  Instead, they seem to mostly spew empty political slogans to manipulate or bully us into ceding more power to central authorities.  I may not love local school districts, but I would prefer them over these central planning fantasies.


Charter Benefits Are Proven by the Best Evidence

May 7, 2012

It’s National Charter Schools Week, so here is the post I’ve written for the George W. Bush Institute Blog on the issue:

According to the Global Report Card, more than a third of the 30 school districts with the highest math achievement in the United States are actually charter schools.  This is particularly impressive considering that charters constitute about 5% of all schools and about 3% of all public school students.  And it is even more amazing considering that some of the highest performing charter schools, like Roxbury Prep in Boston or KIPP Infinity in New York City, serve very disadvantaged students.

As impressive and amazing as these results by charter schools may be, it would be wrong to conclude from this that charter schools improve student achievement.  The only way to know with confidence whether charters cause better outcomes is to look at randomized control trials (RCTs) in which students are assigned by lottery to attending a charter school or a traditional public school.  RCTs are like medical experiments where some subjects by chance get the treatment and others by chance do not.  Since the two groups are on average identical, any difference observed in later outcomes can be attributed to the “treatment,” and not to some pre-existing and uncontrolled difference.  We demand this type of evidence before we approve any drug, but the evidence used to justify how our children are educated is usually nowhere near as rigorous.

Happily, we have four RCTs on the effects of charter schools that allow us to know something about the effects of charter schools with high confidence.  Here is what we know:  students in urban areas do significantly better in school if they attend a charter schools than if they attend a traditional public school.  These academic benefits of urban charter schools are quite large.  In Boston, a team of researchers from MIT, Harvard, Duke, and the University of Michigan, conducted a RCT and found:  “The charter school effects reported here are therefore large enough to reduce the black-white reading gap in middle school by two-thirds.”

A RCT of charter schools in New York City by a Stanford researcher found an even larger effect: “On average, a student who attended a charter school for all of grades kindergarten through eight would close about 86 percent of the ‘Scarsdale-Harlem achievement gap’ in math and 66 percent of the achievement gap in English.”

The same Stanford researcher conducted an RCT of charter schools in Chicago and found:  “students in charter schools outperformed a comparable group of lotteried-out students who remained in regular Chicago public schools by 5 to 6 percentile points in math and about 5 percentile points in reading…. To put the gains in perspective, it may help to know that 5 to 6 percentile points is just under half of the gap between the average disadvantaged, minority student in Chicago public schools and the average middle-income, nonminority student in a suburban district.”

And the last RCT was a national study conducted by researchers at Mathematica for the US Department of Education.  It found significant gains for disadvantaged students in charter schools but the opposite for wealthy suburban students in charter schools.  They could not determine why the benefits of charters were found only in urban, disadvantaged settings, but their findings are consistent with the three other RCTs that found significant achievement gains for charter students in Boston, Chicago, and New York City.

When you have four RCTs – studies meeting the gold standard of research design – and all four of them agree that charters are of enormous benefit to urban students, you would think everyone would agree that charters should be expanded and supported, at least in urban areas.  If we found the equivalent of halving the black-white test score gap from RCTs from a new cancer drug, everyone would be jumping for joy – even if the benefits were found only for certain types of cancer.

Unfortunately, many people’s views on charter schools are heavily influenced by their political and financial interests rather than the most rigorous evidence.  They don’t want to believe the findings of the four RCTs, so they simply ignore them or cite studies with inferior research designs in which we should have much less confidence.

Progress will be made in our application of research to charter school policies by encouraging everyone to focus on the most rigorous studies, of which we have several.  To do that, supporters of charter schools also have to refrain from citing weaker evidence, which only serves to legitimize the use of inferior studies by charter opponents.  As exciting as the outstanding performance of charter schools is in my own Global Report Card research, that evidence shouldn’t be used to endorse charter schools.  Supporters don’t need to rely on the Global Report Card to make the case for charter schools because they have four gold-standard RCTs on their side.  Opponents of charter schools have no equally rigorous evidence on their side.  And that’s the point we should all be making.


Reform School — Coming to a PBS Station Near You

May 1, 2012

The folks at ChoiceMedia.TV have developed a new PBS series focused on education reform issues called “Reform School.”  You can see some clips of the pilot episode with yours truly here:


Public Service Announcement: This Study Stinks

April 23, 2012

(Guest Post by Mike McShane and Gary Ritter)

That’s it, you heard it here first folks, packs of wild dogs have seized control of America’s major cities.

As crazy as that sounds, a study has been circulating the AERA-/Blogo-/twittersphere that’s states that urban Texas school districts have a black male graduation rate of over 80%.  We all know how much some folks here on the Jay P Greene Blog love Texas, but that is just a bit too hard to swallow.

Let’s back up a minute.  Over the past few days, the press (or actually, Diane Ravitch’s twitter page, and then the press, obediently) picked up a story about a “new” study.  OK, it actually isn’t “new” (it first came out in the Berkeley Review of Education in the Fall of 2011) nor is it really a “study”, but more on that later.   The purpose of this study was clear (to attack KIPP) but in the authors’ zeal, they ended up reporting something too good to be true.

Very quickly, using some rough data from schools in Texas, the authors claim to find that charter schools in Texas, and KIPP schools in particular, have higher attrition rates than comparable public schools, even though KIPP schools allegedly spend more money per pupil.

It appears that the authors, in their haste to smear KIPP schools and disprove the strawman idea that choice (as envisioned today) is a “panacea” (using a 20 year old quote) for all of the ills of the American education system, made some pretty shocking errors and omissions that call into question nearly all of their conclusions.

First of all, several of the alleged “findings” were not “found” in this “study”.  Rather, the authors fill their abstract and conclusions with rehashed claims from an earlier, widely discredited study (see this and this and this and this criticism of the flawed Gary Miron paper).

As for the errors in this paper, there are several.  We’ll just highlight a few of the most glaring:

  1.  First, we derive the 80% graduation number from tables 7 and 8 (pg. 169), which report an annual dropout rate from black students of 3% for grades 6-12 in the “comparable urban districts” of Austin, Dallas, and Houston.  Before we dive into the glaring problems of tables 7 and 8, we must first draw attention to the author’s violation of the denominator law.  We don’t know, in the context of this report, what 3% even means.  That is, what is the numerator and what is the denominator that created that rate?  Is that a yearly figure?  Is that a cohort figure?  The authors are absolutely unclear.  Our best guess is that this is a yearly figure, which if compounded, would put the dropout rate for those districts at about 20% for that time period.  As a point of comparison, the dropout rate nationwide for Black males is 53%; if the authors are right, we should all move to the Lone Star State!

 2.  If that is too hard to believe, the tables also report that this 3% figure is lower than the 4% of black dropouts in the rest of the state.  So, if the Texas miracle didn’t do enough to impress you, you can find Texas to be probably the only state where suburban and rural areas have higher dropout rates than cities.

 3.  In addition to farcically large results, tables 7 and 8 (on pg. 169) also appear to have either basic arithmetic mistakes and/or are missing many of their observations when calculating their graduation rates.  The first two columns “Majority black” and “not majority black” should be comprehensive; that is, all of the observations should fall into one of those two categories.  The same is true with the third and fourth column “>100 (Black Students)” and “<100 (Black Students)”. Thus, both of the numbers in the N’s of these columns should sum to the same number.  However, they don’t. In table 7, the first two columns sum to 167 total charter schools, while the second two columns sum to 245 (incidentally the same number as the “All Charters” N).  The same holds true in Table 8, where the first two columns sum to 243, while the second two sum to 373 (again the same number as the “All Charters”).  So where did the other schools go?

Beyond these problems with the author’s primary analyses, this article eschews higher quality studies of the question at hand to focus on clearly flawed research on the topic.  Mathematica already looked into this question in rigorous studies that found positive impacts on achievement, and “did not find levels of attrition among these KIPP middle schools systematically higher (or lower) than those of other “ schools within their districts (they were also clear about the descriptions and sources of the numbers used in the analysis).

In short, any reasonable person who actually read the content of this “new study” would immediately see so many red flags as to take some serious pause before disseminating the findings unqualified to the universe of education news followers.  (We wonder how closely Ms. Ravitch reviewed the study?  She may well have tweeted first and asked questions later!)  Unfortunately, we live in a world populated by many, many, many unreasonable people.

On the bright side, good research continues to show that KIPP schools are effective for underserved students, most serious people disregard “new studies” that are neither “new” nor “studies”, and hard-working KIPP students, teachers, and school leaders keep going about their work each day.

By the way, if you want to see KIPP’s response to this study, it is here.


Tucker Responds

April 18, 2012

I don’t think Marc Tucker liked my review of his book in Education Next.  He responds in this post on the Ed Next blog.  And my reply to his response is also on the Ed Next blog.

Above I’ve posted a video of Tucker explaining his response.


Robot Essay Grading

April 16, 2012

I received this amazing press release from Tom Vander Ark about how computer grading of essays may be as accurate as human grading.  I’m not sure if this means that computer grading has really advanced or if human grading really stinks.  Besides, I don’t even know why the scientists invented the robots.

In any event, here is the release:

A direct comparison between human graders and software designed to score student essays achieved virtually identical levels of accuracy, with the software in some cases proving to be more reliable, a groundbreaking study has found.

The study, which was underwritten by the William and Flora Hewlett Foundation and conducted by experts in educational measurement and assessment, will be released here on Monday, April 16th, at the annual conference of the National Council on Measurement in Education. An advance copy of the study is available today at http://bit.ly/HJWwdP.
“The demonstration showed conclusively that automated essay scoring systems are fast, accurate, and cost effective,” said Tom Vander Ark, CEO of Open Education Solutions, which provides consulting serves related to digital learning, and co-director of the study.
That’s important because writing essays are one important way for students to learn critical reasoning, but teachers don’t assign them often enough because grading them is both expensive and time consuming. Automated scoring of essays holds the promise of lowering the cost and time of having students write so they can do it more often.
Education experts believe that critical reasoning and writing are part of a suite of skills that students need to be competitive in the 21st century. Others are working collaboratively, communicating effectively and learning how to learn, as well as mastering core academic content. The Hewlett Foundation calls this suite of skills Deeper Learning and is making grants to encourage its adoption at schools throughout the country.
“Better tests support better learning,” says Barbara Chow, Education Program Director at the Hewlett Foundation. “This demonstration of rapid and accurate automated essay scoring will encourage states to include more writing in their state assessments. And, the more we can use essays to assess what students have learned, the greater the likelihood they’ll master important academic content, critical thinking, and effective communication.”
For more than 20 years, companies that provide automated essay scoring software have claimed that their systems can perform as effectively, more affordably and faster than other available methods of essay scoring. The study was the first comprehensive multi-vendor trial to test those claims. The study challenged nine companies that constitute more than ninety-seven percent of the current market of commercial providers of automated essay scoring to compare capabilities. More than 16,000 essays were released from six participating state departments of education, with each set of essays varying in length, type, and grading protocols. The essays were already hand scored according to state standards. The challenge was for companies to approximate established scores by using software.
At a time when the U.S. Department of Education is funding states to design and develop new forms of high-stakes testing, the study introduces important data. Many states are limited to multiple-choice formats, because more sophisticated measures of academic performance cost too much to grade and take too long to process. Forty-five states are already actively overhauling testing standards, and many are considering the use of machine scoring systems.
The study grows from a contest call the Automated Student Assessment Prize, or ASAP, which the Hewlett Foundation is sponsoring to evaluate the current state of automated testing and to encourage further developments in the field.
In addition to looking at commercial vendors, the contest is offering $100,000 in cash prizes in a competition open to anyone to develop new automated essay scoring techniques. The open competition is underway now and scheduled to close on April 30th. The pool of $100,000 will be awarded the best performers. Details of the public competition are available atwww.kaggle.com/c/ASAP-AES . The open competition website includes an active leader board to document prize rules, regularly updated results, and discussion threads between competitors.
The goal of ASAP is to offer a series of impartial competitions in which a fair, open and transparent participation process will allow key participants in the world of education and testing to understand the value of automated student assessment technologies.
ASAP is being conducted with the support of the Partnership for Assessment of Readiness for College and Careers and Smarter Balanced Assessment Consortium, two multi-state consortia funded by the U.S. Department of Education to develop next-generation assessments. ASAP is aligned with the aspirations of the Common Core State Standards and seeks to accelerate assessment innovation to help more students graduate from college and to become career ready.
Jaison Morgan, CEO of The Common Pool, a consulting business that specializes in developing effective incentive models for solving problems, and co-director of the study, said the prize and studies will raise broader awareness of the current capabilities of automated scoring of essays.
“By offering a private demonstration of current capabilities, we can reveal to our state partners what is already commercially available,” Morgan said. “But, by complimenting it with a public competition, we will attract new participants to the field and investment from new players. We believe that the public competition will trigger major breakthroughs.”
ASAP is preparing to introduce a second study, in which private providers and public competitors will be challenged to reveal the capabilities of automated scoring systems for grading short-answer questions. The second study will be conducted this summer. There are another three ASAP studies in development.