» tagged pages
» logout

sorted by: recent | see : popular
Content Tagged with translation + Web

Machines Can Ease the Olympics Translation Crunch

The Olympics is a boon to translators. Much of the reporting, interpretation and documentation for the massively international event is handled by humans, but human translators with the right skills can be scarce. “Between some pairs of languages, there are very few people who are experts in both,” said Sanford Cohen, founder of message translation firm SpeakLike. But it’s not just the languages needed, either. New forms of communication like IM, email, voicemail and the web demand different approaches, and computers can help with both challenges.

Machine translation is nothing new: Systran, founded in 1968 to help translate Cold War communications, powered the 1997 launch of the Babelfish service that popularized online translation, and until recently, it was behind Google’s translation systems. Humorous results aside, machine translation works well when software has access to sample text or past translations. “There has been a significant improvement in translation quality because of computing power,” Dimitris Sabatakakis, Systran’s CEO, said.

But relying on previous translations and large samples doesn’t work as well for IM because short messages lack context. And in translation, context is everything. “If you’re trying to translate ‘serveur’ and you know it’s about food, you’ll probably choose ‘waiter,’” Sabatakakis said. “If it’s about computers, you’ll probably choose ‘server.’”

To compensate for context, IM translation company Speaklike uses a combination of software and people and requires that subscribers use its own IM client. Speaklike currently offers four languages — English, Spanish, Portuguese and Chinese — and the system “daisy chains” languages to get from, say, Portuguese to Chinese by going through English. While having two interpreters might be awkward in person, the only impact in an IM context is a slightly increased delay.

Competitor Meglobe, which launched its translation client on Tuesday, is machine-only, meaning it can offer speed, privacy and more languages but lacks the benefit of humans checking translations. Meglobe, which uses Jabber instead of its own client, hopes the wisdom of the crowds can help make its translations better over time by suggesting better interpretations.

VoIP service provider JAJAH is also getting into the interpreter game, starting with English-to-Mandarin Chinese. Their new JAJAH Babel service, developed in conjunction with IBM, lets users dial their local JAJAH number, say an English phrase, and hear it played back in Chinese or vice-versa. The software combines voice recognition and translation, but getting it to understand what you’re saying can be a challenge even on a clear phone line.

SpeakLike’s Cohen sees significant opportunity for alternative ways of communicating. “The Chinese government trained a hundred thousand people to speak English. Typically that’s two or three designated speakers in each company,” he said. “If an engineer in the U.S. is talking to his counterpart in China, everything goes through a point person. This technology breaks down the barriers for direct communications across the organization.”

Technology-News: GigaOm

Google Translation Center: The World’s Largest Translation Memory

Disclosure: I am the founder of Der Mundo, a multilingual blogging service and translation community that combines human and machine translation (provided in part by Google), and I have researched translation technology for more than 10 years via the Worldwide Lexicon project.

Blogoscoped reports that Google is preparing to launch Google Translation Center, a new translation tool for freelance and professional translators. This is an interesting move, and it has broad implications for the translation industry, which up until now has been fragmented and somewhat behind the times, from a technology standpoint

Google has been investing significant resources in a multi-year effort to develop its statistical machine translation technology. Statistical MT works by comparing large numbers of parallel texts that have been translated between languages and from these learns which words and phrases usually map to others — similar to the way humans acquire language. The problem with statistical MT is that it requires a large number of directly translated sentences. These are hard to find, and because of this SMT systems use sources like the proceedings from the European Parliament, United Nations, etc. Which are fine if you’re writing in bureaucrat-speak, but aren’t so great for other texts. Google Translation Center is a straightforward and very clever way to gather a large corpus of parallel texts to train its machine translation systems.

Part machine translator and part translation memory (a sort of search engine for translation that helps translators to recall translations), GTC will help translators by providing a free, global translation memory, and in turn drive costs down by reducing the amount of work needed to complete a text. It will help Google by providing an excellent source of high quality parallel texts that can be fed back into the statistical translation systems.

If Google releases an API for the translation management system, it could establish a de facto standard for integrated machine translation and translation memory, creating a language platform around which projects like Der Mundo can build specialized applications and collect more training data.

On the other hand, GTC could be bad news for translation service bureaus — especially those that use proprietary translation management systems as a way to hold customers and translators hostage. Most translation bureaus aren’t really technology companies and aren’t very competent at building quality software. Google Translation Center fills a void in the translation tools market that was created when the few independent companies, such as Trados, were acquired.

For freelancers, GTC could be very good news; they could work directly with clients and have access to high quality productivity tools. Overall this is a welcome move that will force service providers to focus on quality, while Google, which is competent at software, can focus on building tools. Google has a pretty mixed track record with consumer-facing services outside its core search business. But if it positions itself as a neutral service provider, it could enable projects like Der Mundo and others to create powerful and easy-to-use translation services for a broad range of industries.

Translation management is more complex than it appears, with different practices in different industries. If you’re translating a news story, you want minimal cost and fast turnaround time (publish early, correct often). If you’re translating a product spec sheet, you’re willing to spend more to have it done right before it goes to press. Google would be smart to position GTC as a utility for translators and to encourage service bureaus to standardize around it, much as it did around earlier tools like Trados, and much as it has done with their keyword ad business. That strategy would also eliminate a potential conflict of interest, as translation professionals are understandably wary of contributing to something that could put them out of work, as well as avoid channel conflicts with partners who will be their best advocates in selling to various clients.

While it’s my guess that Google has no intention of directly monetizing the service (charging a commission on transactions it brokers would expose Google to a billing and payment disbursal nightmare), the R&D value of collecting millions of parallel sentences in every language pair imaginable is indisputable, and it will pay off in unforeseen ways. So, my guess is Google will make this a free tool for the translation industry to use, and it will figure the money part out later. It can afford to be patient.

Translation is a very difficult problem. If it weren’t, it would have been solved a long time ago. I remain convinced that a multilingual web will be a reality in a short time, and that a menagerie of tools and services will emerge over the next few years — some geared toward helping translators, some toward building translation communities, and others that make publishing multilingual sites and blogs easy and intuitive.

As these emerge, the web will begin translating itself, and within a short time, we’ll be able to read content from sources worldwide just as we currently explore the web in our own language today.

Technology-News: GigaOm

The Translate Toolkit and Pootle provide a way to make it easier to localise using standards based file formats (PO and XLIFF).

The Translate Toolkit and Pootle provide a way to make it easier to localise using standards based file formats (PO and XLIFF). Our goal is to make the life of a localiser easier, making it easy to localise, creating high quality tools that help you to wo

License:GPL: del.icio.us tag/gpl

HTML 4.01仕様書

そもそもHTML4.01の仕様書ちゃんと読もう。

W3C: Del.icio.us W3C Tags

Internationalization Tag Set (ITS) Version 1.0

This document defines data categories and their implementation as a set of elements and attributes called the Internationalization Tag Set (ITS). ITS is designed to be used with schemas to support the internationalization and localization of schemas and d

W3C: Del.icio.us W3C Tags

Geek to Live: Ten Must-Have Bookmarklets | Lifehacker

Acronym lookup / Google Translate / Urban Dictionary lookup / Wikipedia lookup / Alexa site profile / del.icio.us linkbacks / View cookies / Toss cookies / TinyURL / View passwords

Firefox: del.icio.us/tag/firefox

From Hyperlinks to Hyperwords - YouTube

HyperWords is a free FireFox Addon. Once Hyperwords is activated, whenever You double-click on any word on a web page, a<sep/>

Firefox: del.icio.us/tag/firefox

SourceForge.net: Entrans

"Entrans is an online collaborative translation tool used for editing and translation of PO files."

open-source: del.icio.us tag/open-source

ネットを日本語化する:Japanize

対訳表を元にサイトを無理矢理日本語化してしまう。

Firefox: del.icio.us/tag/firefox

L10n:Localization Process - MozillaWiki

The most recent site devoted to helping you figure out how to localize Firefox (into Latin, in my case, just as soon as I finally get around to it--or someone else beats me to it).

Firefox: del.icio.us/tag/firefox

Page 1 | Next >>