Machine Translation: What is it and How Does it Work?

Machine Translation: What is it and How Does it Work?

Gone are the days when translating one language to another required the use of a bilingual dictionary. Nowadays, when you come across words that are in a foreign language, you log on to an online translation platform and get the results instantly. This is basically what machine translation is; an automated process of rendering one language to another. The use of machine translation has become so common that Google Translate reports that it translates over 100 billion words a day.

Aside from personal use, machine translation (MT) helps brands and businesses expand their reach to global audiences. Now more than ever, website content is being translated into numerous languages to help break the language barrier. By doing so, they are not only able to expand to new international markets but are also helping to demarginalize groups that were initially not privy to information on the internet.

Machine translation has an edge over human translation due to its speed and cost. With computers, the translation is instantaneous, at less than a third of the cost. Despite the improvement of the output from MT, the general perception by professionals and businesses that it cannot be a substitute for human translation is still prevalent. Some companies opt to integrate their translation process; using MT for initial translation then doing some editorial work to further improve the quality and accuracy. Used correctly, machine translation can expedite the translation process without compromising on the quality of the output content.

When Should I Use Machine Translation?

When Should I Use Machine Translation?

How does Machine Translation Works?

Machine translation works by using software to convert one language (source language) to another (target language). Although it sounds straightforward, complex processes go into making even the most basic of translations possible. Different types of MT systems are in use today:

1. Rules-Based Machine Translation

This was the first commercial translation system to be used. RBMT is based on the premise that languages have grammatical, syntactical, and semantic rules that govern them. These rules are predefined by human experts in both the source and target languages and rely heavily on a robust bilingual dictionary. The translation takes place in three phases: analysis, transfer, and generation.

Building RBTM is often time-consuming and expensive but has higher quality outputs compared to others. The vocabulary used can be updated or edited easily to refine the quality of the translated text. These refinements can help the texts read more fluently and remove the machine-like quality that some tend to have.

RBMT works best when translating between languages whose rules are dynamic and abstract.

2. Statistical-Based Machine Translation

SBT relies on the use of statistics to generate translations based on parameters that are derived from the analysis of existing bilingual sets of texts, known as text corpus. Unlike Rules-Based Machine Translation which is word-based, SBT makes uses of phrases that reduce the rigidity imposed on the algorithm by word to word translation.

The statistical models that are derived from the extensive analysis of the bilingual corpus (original and target languages) and the monolingual corpus (target language) work to define which words or phrases are more likely to be used.

The large volume of texts required to run SBT systems has become more available due to the extensive use of the internet and cloud computing. Although the translated output of SBT has higher fluency compared to RBTM, the statistical-based translated text is less consistent.

3. Neural Machine Translation

Neural Machine Translation is the advanced version of SBT. It makes use of a large artificial neural network that predicts the likely sequence of long phrases and sentences. Unlike statistical-based translation, NMT uses less memory since the models are trained jointly to maximize the quality of the translations.

Neural networks (like the ones in our brains) make use of encoder/ decoder technology. During the learning phase, these networks automatically correct the set parameters by comparing the output to the expected translation and then make the necessary adjustments. This means that they require to be trained by humans in order to work. It involves feeding the program with large volumes of data; a process that takes only a few weeks.

NMT is the most advanced method of machine translation and makes use of complex algorithms such as deep learning and AI. This enables it to learn new languages and can be relied upon to produce consistently high-quality output. NMT is currently in use on successful translation platform, Google Translate.

Advantages of Neural Machine Translation

Disadvantages of Neural Machine Translation

Evaluation of Machine Translation

Evaluation of Machine Translation

The quality of the translated content is the most important aspect of translation. This is why linguists and programmers have been trying to create tools that can rate the quality of translations ever since the inception of machine translation in the 50s.

Two approaches can be used while evaluating translations:

The evaluation is based on a predetermined set. The set comprises of the sentences in the source language and their partner texts in the target language. Translated texts are then compared against these sets, and if they are of the same style, a match will be detected.

Types of MT Evaluation

Types of MT Evaluation

1. Manual Evaluation

Humans read through the final text to check its accuracy. The main pointers during the evaluation are fluency and adherence to the meaning of the source text.

When checking for fluency, the source text is unimportant. The evaluator reads through the translation to ensure that it is free of grammatical or syntactical errors.

Then, the text is compared to the original to ensure that it has not veered too far from the message of the source material.

2. Automatic Evaluation

The score obtained is based on the premise that the output should be as close to human translation as possible. Therefore, automatic evaluation relies heavily on pre-existing translations. To improve the accuracy of the score, the evaluation process should be repeated often due to the dynamic nature of languages and MT systems.

Metrics used in the evaluation include METEOR, NIST, and BLEU. They compare the translated text to the reference material and often work without the need for human interference.

Final Thoughts

Machine translations have improved significantly over the years and are helping many firms localize their content to reach a global audience. Properly used, they can produce high-level output with minimal human input. Although they are far from being used independently, MTs are useful for large volumes of content where human translation may be impossible.

Looking for a reliable and secure Machine Translation for your confidential business material? Book a demo now to see Tarjama’s proprietary MT that enables you to deliver multilingual content quickly, securely, and accurately – geared to incorporate your industry’s jargon and writing style into all your translations.


Keep Reading