Subject: Google Image Labeler, the ESP Game, and Human-Computer Symbiosis
From: "O'Reilly Radar"
URL: http://feeds.feedburner.com/~r/oreilly/radar/rss10/~3/18904054/more_on_google_image_labeler.html
By tim
In comments on my previous entry, Stephen Rondeau pointed out that [Google Image Labeler][1] appears to be based on the [ESP Game][2] developed by Professor [Luis von Ahn][3] of CMU. An anonymous commenter pointed to the video of [Luis von Ahn's tech talk at Google][4] on July 26, 2006.
The tech talk was fascinating, both because there was no hint in it that Google was about to announce something based on von Ahn's work -- the talk is all about his previously published games -- and for the actual thought-provoking content, which gives a lot of background on the design of this kind of game, and in general, the idea of harnessing humans to work as program components via games.
Luis started out his talk by looking at Captchas. Most of this was familiar material, although he has a great definition of a Captcha: "A program that can generate and grade tests that most humans can pass, but that current computer programs cannot pass." (This is an interesting variation on the Turing test, in which humans generate and grade tests that most humans can pass, but current computer programs cannot pass. Is there another variation in the future, in which computers generate and grade tests that computers can pass, but humans cannot pass?)
At about 7 minutes into the talk, he makes a staggering assertion about the amount of time spent on casual games: in 2003, 9 billion hours were spent playing solitaire. By comparison, it took only 7 million human hours (6.8 hours of solitaire) to build the Empire State Building, and only 20 million human hours (less than a day of solitaire) to build the Panama Canal. He then states his thesis, "human computation":
> "We're going to consider the human brain as an extremely advanced processor that can solve problems that computers cannot yet solve. Even more, we're going to consider all humanity as an extremely advanced and large scale distributed processing unit that can solve large scale problems that computers cannot yet solve. I claim that the relationship between humans and computers is extremely parasitic.... What I want to advocate for in this talk is a symbiotic relationship, a symbiosis in which humans solve some problems, computers solve others, and together we work to create a better world."
At that point, the audience dissolves in laughter, as the image he's projecting on the screen is [the symbolic waterfall of the Matrix][5].
Getting serious again, he talks about how games can be used to harness some of this unused processing power, and goes specifically into how they can be used for labeling images. First, he describes the ESP Game. Two players are paired up, and given the same image to label. When they agree on an label, they are given a new image. Their goal is to type as many possible labels as quickly as possible till the two players find a match.
Evidence shows that people find this activity to be fun. He says that people have played as long as 15 hours at a stretch, that many people play over 20 hours a week, and that over three years 75,000 players have agreed on over 15 million labels for images. But so far, what he's done is fairly small-scale. (When I logged in to the ESP Game this morning, there were 179 simultaneous players.) With 5000 simultaneous players -- a number found on many popular online gaming sites, he claims it would take about 2 months to label all images on Google images.
He talks also about some of the things he does to make the game fun, a lot of it eerily reminiscent of Kathy Sierra's insights on [Creating Passionate Users][6]. And I have to say, comparing the ESP Game against Google Image Labeler, I wonder if Google hasn't missed some important points in von Ahn's work. While his implementation is MUCH slower, it's more fun, with sound effects, scoring feedback, and other game features making it much more engaging. He even identifies the partner by login name. (He talks about how people love to play because they feel a really good connection to their partner when they agree on a word. He shows some amazing quotes describing how passionate players are about the game.)
He also talks about how to prevent cheating: They give people test images that have already been labeled, and only store their guesses when it's clear that they are playing fair. And once you have enough images labeled, you can get additional labels by prohibiting labels that have already been used. And of course, you can even implement one player versions of the game, and zero-player versions (in which you get additional matches by re-running the sessions from different player combinations that used the same image.)
But the ESP Game is only one instance of a general class of game that Luis calls "games with a purpose", i.e. games that "run a computation in people's brains rather than in silicon processors." Some of the other games he talked about include:
![][7]
[1]: http://images.google.com/imagelabeler/
[2]: http://www.espgame.org
[3]: http://www.cs.cmu.edu/~biglou/research.html
[4]: http://video.google.com/videoplay?docid=-8246463980976635143
[5]: http://www.calwhite.com/pix/matrix%20001.jpg
[6]: http://headrush.typepad.com/
[7]: http://feeds.feedburner.com/~r/oreilly/radar/rss10/~4/18904054
URL: http://feeds.feedburner.com/~r/oreilly/radar/rss10/~3/18904054/more_on_google_image_labeler.html
This archive was generated by hypermail 2b30 : Tue Jun 09 2009 - 05:00:13 EDT