Welcome to The Valve
Login
Register


Valve Links

The Front Page
Statement of Purpose

John Holbo - Editor
Scott Eric Kaufman - Editor
Aaron Bady
Adam Roberts
Amardeep Singh
Andrew Seal
Bill Benzon
Daniel Green
Jonathan Goodwin
Joseph Kugelmass
Lawrence LaRiviere White
Marc Bousquet
Matt Greenfield
Miriam Burstein
Ray Davis
Rohan Maitzen
Sean McCann
Guest Authors

Laura Carroll
Mark Bauerlein
Miriam Jones

Past Valve Book Events

cover of the book Theory's Empire

Event Archive

cover of the book The Literary Wittgenstein

Event Archive

cover of the book Graphs, Maps, Trees

Event Archive

cover of the book How Novels Think

Event Archive

cover of the book The Trouble With Diversity

Event Archive

cover of the book What's Liberal About the Liberal Arts?

Event Archive

cover of the book The Novel of Purpose

Event Archive

The Valve - Closed For Renovation

Happy Trails to You

What’s an Encyclopedia These Days?

Encyclopedia Britannica to Shut Down Print Operations

Intimate Enemies: What’s Opera, Doc?

Alphonso Lingis talks of various things, cameras and photos among them

Feynmann, John von Neumann, and Mental Models

Support Michael Sporn’s Film about Edgar Allen Poe

Philosophy, Ontics or Toothpaste for the Mind

Nazi Rules for Regulating Funk ‘n Freedom

The Early History of Modern Computing: A Brief Chronology

Computing Encounters Being, an Addendum

On the Origin of Objects (towards a philosophy of computation)

Symposium on Graeber’s Debt

The Nightmare of Digital Film Preservation

Richard Petti on Occupy Wall Street: America HAS a Ruling Class

Bill Benzon on Whatwhatwhatwhatwhatwhatwhat?

Nick J. on The Valve - Closed For Renovation

Bill Benzon on Encyclopedia Britannica to Shut Down Print Operations

Norma on Encyclopedia Britannica to Shut Down Print Operations

Bill Benzon on What’s an Object, Metaphysically Speaking?

john balwit on What’s an Object, Metaphysically Speaking?

William Ray on That Shakespeare Thing

Bill Benzon on That Shakespeare Thing

William Ray on That Shakespeare Thing

JoseAngel on That Shakespeare Thing

Bill Benzon on Objects and Graeber's Debt

Bill Benzon on A Dirty Dozen Sneaking up on the Apocalypse

JoseAngel on A Dirty Dozen Sneaking up on the Apocalypse

JoseAngel on Objects and Graeber's Debt

Advanced Search

Articles
RSS 1.0 | RSS 2.0 | Atom

Comments
RSS 1.0 | RSS 2.0 | Atom

XHTML | CSS

Powered by Expression Engine
Logo by John Holbo

Creative Commons Licence
This work is licensed under a Creative Commons License.

 


Blogroll

2blowhards
About Last Night
Academic Splat
Acephalous
Amardeep Singh
Beatrice
Bemsha Swing
Bitch. Ph.D.
Blogenspiel
Blogging the Renaissance
Bookslut
Booksquare
Butterflies & Wheels
Cahiers de Corey
Category D
Charlotte Street
Cheeky Prof
Chekhov’s Mistress
Chrononautic Log
Cliopatria
Cogito, ergo Zoom
Collected Miscellany
Completely Futile
Confessions of an Idiosyncratic Mind
Conversational Reading
Critical Mass
Crooked Timber
Culture Cat
Culture Industry
CultureSpace
Early Modern Notes
Easily Distracted
fait accompi
Fernham
Ferule & Fescue
Ftrain
GalleyCat
Ghost in the Wire
Giornale Nuovo
God of the Machine
Golden Rule Jones
Grumpy Old Bookman
Ideas of Imperfection
Idiocentrism
Idiotprogrammer
if:book
In Favor of Thinking
In Medias Res
Inside Higher Ed
jane dark’s sugarhigh!
John & Belle Have A Blog
John Crowley
Jonathan Goodwin
Kathryn Cramer
Kitabkhana
Languagehat
Languor Management
Light Reading
Like Anna Karina’s Sweater
Lime Tree
Limited Inc.
Long Pauses
Long Story, Short Pier
Long Sunday
MadInkBeard
Making Light
Maud Newton
Michael Berube
Moo2
MoorishGirl
Motime Like the Present
Narrow Shore
Neil Gaiman
Old Hag
Open University
Pas au-delà
Philobiblion
Planned Obsolescence
Printculture
Pseudopodium
Quick Study
Rake’s Progress
Reader of depressing books
Reading Room
ReadySteadyBlog
Reassigned Time
Reeling and Writhing
Return of the Reluctant
S1ngularity::criticism
Say Something Wonderful
Scribblingwoman
Seventypes
Shaken & Stirred
Silliman’s Blog
Slaves of Academe
Sorrow at Sills Bend
Sounds & Fury
Splinters
Spurious
Stochastic Bookmark
Tenured Radical
the Diaries of Franz Kafka
The Elegant Variation
The Home and the World
The Intersection
The Litblog Co-Op
The Literary Saloon
The Literary Thug
The Little Professor
The Midnight Bell
The Mumpsimus
The Pinocchio Theory
The Reading Experience
The Salt-Box
The Weblog
This Public Address
This Space: The Fire’s Blog
Thoughts, Arguments & Rants
Tingle Alley
Uncomplicatedly
Unfogged
University Diaries
Unqualified Offerings
Waggish
What Now?
William Gibson
Wordherders

Saturday, October 27, 2007

Monkey Grammar

Posted by Bill Benzon on 10/27/07 at 08:28 AM

Over at Language Log Geoffrey Pullum has a rant against computerized grammar-checking.  He quotes on Jason Snell as saying that “an infinite number of monkeys will analyze your writing and present you with useless grammar complaints.” Pullum concludes:

For the most part, accepting the advice of a computer grammar checker on your prose will make it much worse, sometimes hilariously incoherent. If you want an amusing way to while away a rainy afternoon, take a piece of literary prose you consider sublimely masterful and run the Microsoft Word™ grammar checker on it, accepting all the suggested changes.


Comments

Hmmm, like many, I assume, I run Word’s Grammar Check for a couple of reasons: first, it does find typos, misspelled words, and the occasional unintended grammatical “error”; second, I get to feel superior by clicking “Ignore” repeatedly.

I like that grammar checks are imperfect. If or when they approach perfection, I’ll be seriously bummed, just as I was when Deep Blue beat Garry Kasparov.

By on 10/27/07 at 09:59 AM | Permanent link to this comment

Belittling the current state of a technology is always to miss the point. The nature of technology is that version n+1 will probably be a little better than version n.

I’d be interested in a post that argued why computer grammar checking can’t ever be useful. Until then, I’m afraid Geoffrey Pullum’s rant does not rise above technophobia.

(Example: I just corrected a spelling mistake caught by Firefox’s built-in spell checker for text boxes. It’s currently a pretty crappy spell checker, but in a year I bet I’ll be relying on it.)

By tom s. on 10/27/07 at 10:26 AM | Permanent link to this comment

Don’t confusing spelling checkers with grammar checkers. Finding typos is a good thing.

But English grammar is not something that can really be analyzed programmatically. Believe me, as a professional technology writer (which is why I brought up the subject in the first place) I’m about as far away from technophobia as you can imagine. But as a writer and a technology guy, I can tell you that grammar checkers are a failure because the rules of our language are far too complicated for today’s computer programs to grasp. I’ve yet to see any algorithm that could actually understand sentence and paragraph construction.

Instead, what computer grammar checkers do is look for basic (and debatable if not outright wrong!) “errors” such as split infinitives, run-on sentences (which often aren’t), and the like.

Believe me, on the day that someone comes up with a grammar checker that actually works like an editor or schoolmarm looking over your shoulder, I’ll be the first to turn it on. But right now all they can provide is a false sense of security and constant nudges about things that are probably not wrong. Hence my reference to an infinite number of monkeys. They bring a lot of calculation power to bear, but you end up with more crap on the walls than coherence on the page.

By Jason Snell on 10/27/07 at 11:26 AM | Permanent link to this comment

Well you have a point. This seems a matter for a bet that in five years grammar checkers will be useful (or not), but I have no idea what the criteria would be.

By tom s. on 10/28/07 at 09:08 AM | Permanent link to this comment

But English grammar is not something that can really be analyzed programmatically.

... And even if it were, many of the habits considered “grammatical errors” are better described as “stylistic infelicities” or “correct but clumsy writing”.

Leaving aside the whole question of whether there is an objective grammatical standard to adhere to, most of the rules that find their way into prescriptive grammar should be broken on occasion, because the alternative is worse. (Oops, passive voice… “One should break most of the rules on occasion”?)

The problem with these things is that they’re too ambitious. It’s good to be be told about repeated function words. That’s not spell-checking, but it is catching objective grammatical mistakes, and ones that are often made and surprisingly difficult to spot. But once you start making “rules for grammar checking” you’re more or less committed to trying to address the “common grammatical errors” that people think are there, even if they’re “debatable if not outright wrong”.

(Tom: they’d be useful now if you could choose which individual rules to turn off and which to leave on. Only they’d be doing such a limited task it’s arguable whether we should call them “grammar checkers” or not.)

By Tikitu on 10/28/07 at 11:04 AM | Permanent link to this comment

Last I saw Word’s gramerchek systematically advised at least one error. With subjects like “The men of the family....” Word would sometimes (always?) suggest that the verb agree agree with the word “family”: “The men of the family was gathered in front of the house.”

Iglesias nedes a beter spelchek.

By on 10/28/07 at 02:11 PM | Permanent link to this comment

I proofread quite a bit of English prose written by Polish friends. I had a go writing a “depolariser” to add and remove articles as necessary. Unsurprisingly it didn’t work. The results have a weird charm of their own though.

(I was lazy and went for bi- and trigram frequencies; it turns out the dependencies you’re looking for are almost always long-distance so that doesn’t work. But according to Ray Kurzweil you can use them for poetry.)

By Tikitu on 10/28/07 at 04:02 PM | Permanent link to this comment

This study—papyr.com/hypertextbooks/grammar/gramchek.htm— confirms my experience: grammar checkers are sometimes useful.

Obviously, they are tools best used by those who understand grammar.

By on 10/28/07 at 05:09 PM | Permanent link to this comment

The study doesn’t count false positives; typically you test algorithms like this on both recall (how many targets did it find?) and precision (how many results returned were really targets?). (That’s the standard for search algorithms, anyway, and seems highly applicable to this case too.)

You can easily improve recall by returning more results, but precision tends to suffer—and while you can easily test recall on a constructed example like this one (with one chosen example of each “error") you really need to test precision on live text.

By Tikitu on 10/28/07 at 06:11 PM | Permanent link to this comment

I don’t know what we are arguing here. Of course grammar checkers are imperfect. If that means one should never use them, then I disagree. Luckily, we can all make our own choices.

By on 10/28/07 at 07:45 PM | Permanent link to this comment

Tikitu: I had to teach the use of articles ("a", “an”, “the") to Chinese once, who have nothing comparable. I failed. Everyone fails. Screw the articles.

Every native speaker knows how to use (the?) articles correctly, but no one can explain what they do, prescriptive grammars say nothing because it isn’t a native-speaker mistake, descriptive and analytic grammars say little because it isn’t a deep technical question, and ESL texts seem to neglect the topic.

The dog came into the room.
A dog came into the room.
The dog is a loyal creature.
Dogs are loyal creatures.
NOT: The dogs are loyal creatures.
PROBABLY NOT: The dog came into the room. It was a loyal creature. (Sounds parodic, actually).

By John Emerson on 10/28/07 at 09:43 PM | Permanent link to this comment

Gerrold: you’re right, of course there’s no basic disagreement. And in fact I don’t use a grammar checker because I’m not writing in a word-processor and so there isn’t one there to use… There’s something like an objective question of whether they do more harm than good, especially for people already weak on grammar (that’s where the recall/precision question comes up: if you’re just accepting all changes, do you end up worse than you started?) but I wouldn’t care to take a stand there. And, as you say, probably those people shouldn’t be relying on the tool anyway.

By Tikitu on 10/29/07 at 08:22 AM | Permanent link to this comment

John: Polish (and all the Slavic languages?) has same problem with the articles: they aren’t there.

I was curious, so I went back and looked at how my depolariser was working. Typical example is “finite models corresponding to ***A*** given quantifiers”, which is much more clear-cut. (It would presumably have worked if I’d taken 3/4-grams, since plural “quantifiers” would block the addition. But the sparse data problems, the laziness, etc...) For something it got right (also fairly clear-cut I think, although I can’t tell you what the rule might be): “In ***THE*** second part of the paper examples of syntax learning
models are shown”.

(Starred words are the additions from the algorithm; it seems I didn’t get it to mark deletions at all, which is a shame.)

By Tikitu on 10/29/07 at 08:31 AM | Permanent link to this comment

ALSO NOT: The dog, a loyal creature, came into the room.

By John Emerson on 10/29/07 at 11:32 AM | Permanent link to this comment

Add a comment:

Name:
Email:
Location:
URL:

 

Remember my personal information

Notify me of follow-up comments?

Please enter the word you see in the image below: