Google’s Love to Machine Translation Is beyond my Understanding

Svetlana Gladkova,


Google Reader logoHonestly, I did not want to write about Google Reader team proudly reporting on making the world wide web truly worldwide by offering the functionality to translate any feed you read into your native language (on condition that your native language is among those supported by Google’s machine translation system). Really, I thought I have said enough about poor quality of machine translation overall and Google translation in particular as well as about the dubious practice of using volunteers for their own translation projects instead of paying to experienced translators to localize their multiple products into multiple languages to ensure acceptable quality of translation instead of what we now have with Google’s products often used in English by many people knowing Google’s native language well enough not to have to deal with the poorly translated versions in their own native languages.

But still it looks like Google is very enthusiastic about automated translation and can’t resist trying to help you translate everything you want into any language you want for free using yet another Google’s tool. Unfortunately for Google as a person with 7 years of translation experience myself I can tell that you will hardly ever find a translator who will agree that machine translation can be useful for anything, even to understand the general meaning of the text simply because meaning is exactly what a machine-translated text lacks.

Yet we see another experiment from Google promising to help everyone in the world understand everyone else, no matter what combination of languages the two persons speak. Yesterday Google officially added translation functionality to Google Reader. This functionality allows you to read any feed of your choice in your native language (or another language you want it to be displayed in). In the settings of any feed you can now choose to have this feed automatically translated into your language.

First of all, I have my doubts about the necessity of this functionality simply because it is pretty difficult to figure out how we are supposed to discover the feeds in other languages that you might be interested in reading. Are we supposed to start building some worldwide database of non-English blogs worth subscribing to for everyone to find something interesting in another language?

But my major concern here is quite usual: instead of actually reading anything you will see a text displayed in the language of your choice - but in many cases lacking any meaning or understandability at all. Of course the English-only blogosphere is very excited about the functionality (as just about everything Google comes up with) but I have my doubts about people speaking having languages other than English as their native ones. There are obviously some language pairs that work more or less acceptable while others lack coherence entirely.

As a native Russian speaker I started my research with the Russian language, obviously. So first of all, I tried to translate the feed with Techmeme top stories I am subscribed to into Russian. As you know if you are subscribed to Techmeme top stories yourself, this feed only contains titles of the posts or articles, names of their authors along with the publications they appear in and a short introduction from the article itself. I guess this feed must be quite a valid example since if I did not know English but wanted to be up to date with the most discussed technology events, I’d definitely choose Techmeme for my quick share of news.

I actually hoped the translation would provide a feed in Russian that will be understandable enough to get a general understanding of what is going on in the technology world but to my dismay it just did not happen and if I did not read these very news in English before, I swear I’d never been able to understand what they were about even remotely. I hope it is clear that I turned the translation setting for this feed off immediately after completing the post. And of coursed I switched my language in the account settings back to English as Google Reader itself in Russian is pretty difficult to use.

I know that my own opinion should not be trusted 100% so I went looking for additional evidence in the Russian-language blogosphere. All of the tests carried out by bloggers here in Russia had some hilarious examples of translations Google performed for their feeds - and this should have been expected. After reading a few of the Russian-language bloggers talking about the service, I have a feeling that people are not appalled by the service as it can produce decent results for some general news topics (at least to understand what countries or people the piece of news in question is about). But of course the agreement is that if you know English enough to understand general meaning of a text, you’d better off trying to read it in English than struggling to understand what the Russian translation is about.

The most important thing that I’ve seen on blogs in French and Spanish is that people admit the translation is not perfect but the translation system is getting better and I think this must be the best compliment Google could get for the efforts.

I have also checked the comments to the English-language blog posts covering the announcement from people native in other languages sharing their experience with the new feature. People reported poor translation from English to Chinese and Japanese along with some “Asian” language which I have no idea about. There were some language pairs where people reported decent translation - like English to Italian - and it is understandable as some languages are just easier handled by machine translation systems than others.

But unfortunately there are languages that are simply beyond efforts of any machine translation system I’ve ever heard of - my own native Russian including. And if you choose Google to help you read things written in these languages or to translate feeds into these languages for you, you will be sure to have lots of hilarious examples of machine translation quality and tons of time spent trying to figure out the meaning of what you are supposed to understand from the translation.

At least the team over at Google Reader promises that our reading experience will be getting better as the translation technology develops and gets better itself. But my question is why they did not wait until the automated translation technology is mature enough to provide coherent reading experience instead of just hilarious things in your feeds or at least roll the functionality out only for the languages that are reported by users to produce acceptable quality of translation.

Google Reader translated Techmeme feed into Russian