糖心动漫vlog

Reading & Literacy

Study Supports Essay-Grading Technology

By Ian Quillen 鈥 April 24, 2012 6 min read
  • Save to favorites
  • Print
Email Copy URL

After a that suggested automated essay graders are as effective as their human counterparts in judging essay exams, 鈥渞oboreaders鈥 are receiving a new wave of publicity surrounding their possible inclusion in assessments and classrooms.

But while developers of the technology are happy to have the attention, they insist the high profile has more to do with timing of policy changes such as the push to common standards than with any dramatic evolution in the essay-grading tools themselves.

鈥淲hat鈥檚 changed is the claims people are willing to make about it. 鈥 [I]t鈥檚 not because the technology has changed,鈥 said Jon Cohen, an executive vice president of the Washington-based American Institutes for Research, one of nine organizations developing software that participated in the study.

鈥淚 think, over time, a mixture of technologies will make this really good not only for scoring essays,鈥 but also for other assignments, said Mr. Cohen, the director of AIR鈥檚 assessment program. 鈥淏ut we really need to be clear about the limits of the applications we are using today so we can get there.鈥

Human vs. Machine

The study, underwritten by the Menlo Park, Calif.-based William and Flora Hewlett Foundation, is driven by the push to improve assessments related to the shift to the Common Core State Standards in English/language arts and math, and is based on the examination of essays written specifically for assessments. (The Hewlett Foundation also provides support to Education Week for coverage of 鈥渄eeper learning.鈥)

Essay Graders

A recent study examined essay-grading software developed by the following organizations:

American Institutes for Research

Carnegie Mellon University

CTB/Mcgraw-Hill

Educational Testing Service

Measurement Inc.

MetaMetrics

Pacific Metrics

Pearson Knowledge Technologies

Vantage Learning

SOURCE: 鈥淐ontrasting State of the Art Automated Scoring of Essays: Analysis鈥

Each developer鈥檚 software graded essays from a sample of 22,000 contributed by six states, using algorithms to measure linguistic and structural characteristics of each essay and to predict, based on essays previously graded by humans, how a human judge would grade a particular submission. All six states are members of one of two state consortia working to develop assessments for the new standards.

By and large, the scores generated by the nine automated essay graders matched up with the human grades, and in a press release, study co-director Tom Vander Ark, the chief executive officer of Federal Way, Wash.-based Open Education Solutions, a blended-learning consulting group, said, 鈥淭he demonstration showed conclusively that automated essay-scoring systems are fast, accurate, and cost-effective.鈥

Mr. Cohen of AIR cautioned that interpretation could be too broad.

鈥淚 think the claims being made about the study wander a bit too far from the shores of our data,鈥 he said.

Mark Shermis, the dean of the college of education at the University of Akron, in Ohio, and a co-author of the study, said the paper doesn鈥檛 even touch on the most exciting potential of automated essay graders, which is not their ability to replace test scorers (or possibly teachers) with a cheaper machine, but their ability to expand upon that software to give students feedback and suggestions for revision.

鈥業nspiring Composition鈥

Two vendors in the study鈥攖he Princeton, N.J.-based Educational Testing Service and , with headquarters in Yardley, Pa.鈥攁lready have offered for most of the past decade software that gives students some basic feedback on the grammar, style, mechanics, organization, and development of ideas in their writing, Mr. Shermis said.

鈥淚t鈥檚 designed to be a support, so that a teacher can focus him- or herself completely on inspiring composition of writing or creative composition of writing,鈥 he said. 鈥淚t鈥檚 possible that an administrator will say, 鈥業鈥檓 just going to throw it all to the computer,鈥 鈥 but that鈥檚 not what we would ever recommend.鈥

Further, one entrant in the study, the LightSIDE software developed by Teledia, a research group at Carnegie Mellon University in Pittsburgh, was created as an extension of research its developers say is only loosely related to automated essay graders.

Their examination of natural language processing, or the science of how computers interact with human language, has focused on the idea that software could help students hold more-productive collaborative discussions about any range of academic subjects, said , an associate professor of language technology and human-computer interaction.

For example, one project involves using artificial intelligence to drive discussions on an online platform provided by the university to secondary students in the 25,000-student Pittsburgh public schools. A computer-generated persona interacts as one of several participants in an online discussion, asking questions of the students and at times even interjecting humor into a tense situation among students involved in the discussion.

Creating an automated essay grader based on that research came out of a curiosity to see whether the researchers鈥 methods of evaluating student discussion could transfer to assessment of student composition, said Elijah Mayfield, a doctoral candidate in language and information technology working with Ms. Rose. Commercial vendors involved in the study did not possess a similar background in studying student interaction, perhaps because they couldn鈥檛 afford to do so from a business standpoint, he said.

鈥淚 think it gets caught up between what machine learning is aiming for and what is commercially feasible,鈥 Mr. Mayfield said.

Smarter Computers

John Fallon, the vice president of marketing with Vantage Learning, said that using current policy momentum鈥攊ncluding the drive for the creation of new, more writing-intensive assessments鈥攚ill only help drive improvements in all realms of natural-language-processing study. That includes projects like those at Carnegie Mellon, as well as those at his own company.

鈥淎 lot of it comes down to, the more submissions we get, the smarter the [computer] engine gets,鈥 said Mr. Fallon, who asserts that his company鈥檚 offerings are able not only to score student writing, but also to give those students feedback for improvement.

鈥淭he transition to the common core and what that鈥檚 going to require is really bringing a much stronger focus for writing,鈥 he said. 鈥淎nd the challenge has always been how can we get teachers to get students to write more and maintain interaction at the student level.鈥

But Will Fitzhugh, the publisher of the Sudbury, Mass.-based , a quarterly scholarly journal that publishes secondary students鈥 academic writing, said he is skeptical of whether there is any application of automated essay graders that would enhance students鈥 educational experience.

Contrary to those concerned about how the technology would change the roles of teachers, Mr. Fitzhugh said the greater issue is that such software encourages the assignment of compositions to be written in class and the use of assessments in which learning the content before writing about it is undervalued.

And he disputes the notion that understanding organization, sentence structure, and grammar alone is enough to give students the writing command they鈥檒l need in future careers.

鈥淭he idea that the world of business or the world of whatever wants you to write something you know nothing about in 25 minutes is just a mistake,鈥 Mr. Fitzhugh said. 鈥淚 haven鈥檛 looked deeply into what the computer is looking at, but I don鈥檛 think they are capable of understanding what the student is actually saying.鈥

Related Tags:

A version of this article appeared in the April 25, 2012 edition of Education Week as Study Supports Essay-Grading Technology

Events

This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
College & Workforce Readiness Webinar
Smarter Tools, Stronger Outcomes: Empowering CTE Educators With Future-Ready Solutions
Open doors to meaningful, hands-on careers with research-backed insights, ideas, and examples of successful CTE programs.
Content provided by 
Reading & Literacy Webinar Supporting Older Struggling Readers: Tips From Research and Practice
Reading problems are widespread among adolescent learners. Find out how to help students with gaps in foundational reading skills.
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
Reading & Literacy Webinar
Improve Reading Comprehension: Three Tools for Working Memory Challenges
Discover three working memory workarounds to help your students improve reading comprehension and empower them on their reading journey.
Content provided by 

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

Reading & Literacy How to Build a Reading Block: Two Teachers Share Their Approaches
Studies don't prescribe how best to knit together components of reading鈥攍eaving it up to teachers to devise.
7 min read
Students in Anjanette McNeely's class work on their letters during a reading block at Windridge Elementary School in Kaysville, Utah, on Dec. 4, 2025.
What's the best way to attend to all the elements of the 'science of reading' in a literacy block? Research doesn't specify a specific answer, but kindergarten teacher Anjanette McNeely has designed hers to incorporate foundational skills, content, and writing. McNeely's class works on their letters at Windridge Elementary School in Kaysville, Utah, on Dec. 4, 2025.
Niki Chan Wylie for Education Week
Reading & Literacy Many Teens Lack Basic Reading Skills. These Teachers Are Trying to Change That
Schools are building programs to provide sustained reading support to older students.
6 min read
Loralyn LaBombard, a reading specialist, reads 鈥淎mong the Hidden鈥 by Margaret Peterson Haddix with a group of students in a 7th grading reading class at Bow Memorial School in Bow, N.H., on Oct. 29, 2025.
Loralyn LaBombard, a reading specialist, reads <i>Among the Hidden</i> by Margaret Peterson Haddix with a group of students in a 7th grade reading class at Bow Memorial School in Bow, N.H., on Oct. 29, 2025. Nationally, experts say there is a lack of resources available to help middle and high school students learn basic reading skills.
Sophie Park for Education Week
Reading & Literacy When Older Students Can't Read: How This Middle School Is Tackling Literacy
Structured literacy classes at a New Hampshire middle school have helped some students crack the code.
14 min read
A student shows their spelling of the word 鈥渒new鈥 during an exercise in a fifth grade structured literacy class at Bow Memorial School in Bow, N.H. on Oct. 29, 2025. Bow Memorial School is a middle school that has developed a systematic approach to addressing foundational reading gaps in middle school students.
Bow Memorial School has developed a systematic approach to addressing foundational reading gaps among middle schoolers, integrating sound-letter skills with a rich diet of reading materials. A student shows their spelling during an exercise in a 5th grade class at the school in Bow, N.H. on Oct. 29, 2025.
Sophie Park for Education Week
Reading & Literacy 4 Tips for Supporting Older Struggling Readers, From Researchers and Experts
No matter the age, reading draws on the same underlying skills. But teens may need different supports.
5 min read
Photo illustration of a female teen hanging from the very top of a tall stack of books. The background is a sky with clouds.
iStock/Getty