Big data meets the Bard
John Sunyer:
Here's some advice for bibliophiles with teetering piles of books and not enough hours in the day: don't read them. Instead, feed the books into a computer program and make graphs, maps and charts: it is the best way to get to grips with the vastness of literature. That, at least, is the recommendation of Franco Moretti, a 63-year-old professor of English at Stanford University and unofficial leader of a band of academics bringing a science-fiction thrill to the science of fiction.
For centuries, the basic task of literary scholarship has been close reading of texts. But for digitally savvy academics such as Moretti, literary study doesn't always require scholars actually to read books. This new approach to literature depends on computers to crunch "big data", or stores of massive amounts of information, to produce new insights.
Who, for example, would have guessed that, according to a 2011 Harvard study of four per cent (that is, five million) of all the books printed in English, less than half the number of words used are included in dictionaries, the rest being "lexical dark matter"? Or that, as a recent study using the same database carried out by the universities of Bristol, Sheffield and Durham reveals, "American English has become decidedly more 'emotional' than British English in the last half-century"?
Posted by Jim Zellmer at June 17, 2013 12:31 AM
Subscribe to this site via RSS/Atom: Newsletter signup | Send us your ideas