Auto Check

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Thursday, 26 April 2012

Breaking down the language barrier—six years in

Posted on 11:00 by Unknown
The rise of the web has brought the world’s collective knowledge to the fingertips of more than two billion people. With just a short query you can access a webpage on a server thousands of miles away in a different country, or read a note from someone halfway around the world. But what happens if it’s in Hindi or Afrikaans or Icelandic, and you speak only English—or vice versa?

In 2001, Google started providing a service that could translate eight languages to and from English. It used what was then state-of-the-art commercial machine translation (MT), but the translation quality wasn’t very good, and it didn’t improve much in those first few years. In 2003, a few Google engineers decided to ramp up the translation quality and tackle more languages. That's when I got involved. I was working as a researcher on DARPA projects looking at a new approach to machine translation—learning from data—which held the promise of much better translation quality. I got a phone call from those Googlers who convinced me (I was skeptical!) that this data-driven approach might work at Google scale.

I joined Google, and we started to retool our translation system toward competing in the NIST Machine Translation Evaluation, a “bake-off” among research institutions and companies to build better machine translation. Google’s massive computing infrastructure and ability to crunch vast sets of web data gave us strong results. This was a major turning point: it underscored how effective the data-driven approach could be.

But at that time our system was too slow to run as a practical service—it took us 40 hours and 1,000 machines to translate 1,000 sentences. So we focused on speed, and a year later our system could translate a sentence in under a second, and with better quality. In early 2006, we rolled out our first languages: Chinese, then Arabic.

We announced our statistical MT approach on April 28, 2006, and in the six years since then we’ve focused primarily on core translation quality and language coverage. We can now translate among any of 64 different languages, including many with a small web presence, such as Bengali, Basque, Swahili, Yiddish, even Esperanto.

Today we have more than 200 million monthly active users on translate.google.com (and even more in other places where you can use Translate, such as Chrome, mobile apps, YouTube, etc.). People also seem eager to access Google Translate on the go (the language barrier is never more acute than when you’re traveling)—we’ve seen our mobile traffic more than quadruple year over year. And our users are truly global: more than 92 percent of our traffic comes from outside the United States.

In a given day we translate roughly as much text as you’d find in 1 million books. To put it another way: what all the professional human translators in the world produce in a year, our system translates in roughly a single day. By this estimate, most of the translation on the planet is now done by Google Translate. (We can’t speak for the galaxy; Douglas Adams’s “Babel fish” probably has us beat there.) Of course, for nuanced or mission-critical translations, nothing beats a human translator—and we believe that as machine translation encourages people to speak their own languages more and carry on more global conversations, translation experts will be more crucial than ever.

We imagine a future where anyone in the world can consume and share any information, no matter what language it’s in, and no matter where it pops up. We already provide translation for webpages on the fly as you browse in Chrome, text in mobile photos, YouTube video captions, and speech-to-speech “conversation mode” on smartphones. We want to knock down the language barrier wherever it trips people up, and we can’t wait to see what the next six years will bring.

Posted by Franz Och, Distinguished Research Scientist, Google Translate

Email ThisBlogThis!Share to XShare to Facebook
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Hulu Plus now works with Chromecast
    Hulu has added Chromecast support to their Hulu Plus app—just in time for the fall television season. Now you can easily enjoy your favori...
  • Providing a springboard for women entrepreneurs in India
    Meghana Musunuri was a typical female entrepreneur in India. Born and brought up in Medak , she received a good education and spent time ab...
  • A look inside our 2011 diversity report
    We work hard to ensure that our commitment to diversity is built into everything we do—from hiring our employees and building our company cu...
  • Software downloads in Syria
    Free expression is a fundamental human right and a core value of our company—but sometimes there are limits to where we can make our product...
  • Celebrating teachers on National Teacher Day
    One of the best parts of my job working on the Google Education team has been hearing inspiring stories time and again of great teachers who...
  • Shiver me timbers, the 2012 D4G Winner is....
    After 114,000 submissions and millions of your votes, second grader Dylan Hoffman of Caledonia, Wisc. is this year’s U.S. Doodle 4 Google N...
  • Supporting Innovation in African News
    Cross-posted from the European Public Policy Blog We’re eager to see journalism flourish in the digital age, in all forms and on all contine...
  • Google+ Hangouts On Air: broadcast your conversation to the world
    Last year we introduced Hangouts On Air to a limited number of broadcasters, enabling them to go live with friends and fans, for all the wo...
  • New research shows smartphone growth is global
    Last October, we launched Our Mobile Planet , a resource enabling anyone to visualize the ways smartphones are transforming how people conne...
  • Local—now with a dash of Zagat and a sprinkle of Google+
    Finding the best places to go is an essential part of our lives, as are the people and resources that help us make those decisions. In fact,...

Categories

  • accessibility
  • acquisition
  • ads
  • Africa
  • Android
  • apps
  • Asia
  • books + book search
  • chrome
  • chrome + chrome os
  • commerce
  • computing history
  • crisis response
  • Cultural Institute
  • culture
  • developers
  • display advertising
  • diversity
  • doodles
  • education
  • education and research
  • energy
  • enterprise
  • entrepreneurs at Google
  • entrepreneurship
  • Europe
  • events
  • faster web
  • free expression
  • g2g
  • giving
  • Google Apps highlights
  • google ideas
  • google play
  • google.org
  • google+
  • googleplus
  • googlers and culture
  • government transparency
  • green
  • innovation
  • ipv6
  • journalism and news
  • Latin America
  • local
  • maps and earth
  • mobile
  • online safety
  • open source
  • personalization
  • photos
  • policy and issues
  • politics
  • privacy
  • privacy and security
  • publishers
  • scholarships
  • search
  • search stories
  • search trends
  • security
  • security and safety tips
  • small business
  • transparency
  • youtube and video

Blog Archive

  • ►  2013 (190)
    • ►  December (11)
    • ►  November (13)
    • ►  October (15)
    • ►  September (12)
    • ►  August (10)
    • ►  July (13)
    • ►  June (28)
    • ►  May (16)
    • ►  April (21)
    • ►  March (18)
    • ►  February (19)
    • ►  January (14)
  • ▼  2012 (269)
    • ►  December (25)
    • ►  November (20)
    • ►  October (18)
    • ►  September (16)
    • ►  August (19)
    • ►  July (20)
    • ►  June (28)
    • ►  May (30)
    • ▼  April (19)
      • Supporting data innovation in journalism throughou...
      • In Nashville, the sweet sound of entrepreneurship
      • Breaking down the language barrier—six years in
      • From countering radicalization to disrupting illic...
      • The Google Photography Prize 2012 winner
      • Introducing Google Drive... yes, really
      • YouTube Marketing Ambassadors play big at Google
      • Planting some green this Earth Day
      • Exploring Jerusalem’s Old City streets with Street...
      • Inside view on ads review
      • Spring-cleaning … in spring!
      • Making the web work for major brands
      • Celebrating the Google Photography Prize Finalists
      • Technologists and muckrakers pursuing a more perfe...
      • Toward a simpler, more beautiful Google
      • One desk chair—hold the formaldehyde
      • Google+ Hangout with the UN Secretary-General
      • Celebrating six students receiving the AP-Google J...
      • Going global in search of great art
    • ►  March (27)
    • ►  February (23)
    • ►  January (24)
  • ►  2011 (41)
    • ►  December (33)
    • ►  November (8)
Powered by Blogger.

About Me

Unknown
View my complete profile