Interactive machine translation

Last updated

Interactive machine translation (IMT), is a specific sub-field of computer-aided translation. Under this translation paradigm, the computer software that assists the human translator attempts to predict the text the user is going to input by taking into account all the information it has available. Whenever such prediction is wrong and the user provides feedback to the system, a new prediction is performed considering the new information available. Such process is repeated until the translation provided matches the user's expectations.

Contents

Interactive machine translation is specially interesting when translating texts in domains where it is not admissible to output a translation containing errors, hence requiring a human user to amend the translations provided by the system. In such cases, interactive machine translation has been proved to provide benefit to potential users. [1] [2] [3] Nevertheless, there are few commercial software that implements interactive machine translation and work done in the field is mostly restrained to academic research.

History

Historically, interactive machine translation is born as an evolution of the computer-aided translation paradigm, where the human translator and the machine translation system were intended to work as a tandem. [4] This first work was extended within the TransType research project, funded by the Canadian government. In this project, the human interaction was aimed towards producing the target text for the first time by embedding data-driven machine translation techniques within the interactive translation environment with the goal of achieving the best of both actors: the efficiency of the automatic system and the reliability of human translators.

Later, a larger-scale research project, TransType2, [1] [3] funded by the European Commission extended such work by analyzing the incorporation of a complete machine translation system into the process, with the goal of producing a complete translation hypothesis, which the human user is allowed to amend or accept. If the user decides to amend the hypothesis, the system then attempts to make the best use of such feedback in order to produce a new translation hypothesis that takes into account the modifications introduced by the user.

More recently, CASMACAT, [5] also funded by the European Commission, aimed at developing novel types of assistance to human translators and integrated them into a new workbench, consisting of an editor, a server, and analysis and visualisation tools. The workbench was designed in a modular fashion and can be combined with existing computer aided translation tools. Furthermore, the CASMACAT workbench can learn from the interaction with the human translator by updating and adapting its models instantly based on the translation choices of the user. [6] [7]

Recent work on involving an extensive evaluation with human users [8] revealed the fact that interactive machine translation may even be used by users that do not speak the source language in order to achieve near professional translation quality. Moreover, it also elucidated the fact that an interactive scenario is more beneficial than a classic post-edition scenario.

The previously described approaches rely on a tightly coupled underlying corpus-based machine translation system (usually, a Statistical machine translation system) that is used as a glass box, therefore inheriting the shortcomings of the translation systems and limiting the usage of interactive machine translation for some scenarios. For this reason, an approach that uses any kind of bilingual resource (not limited to machine translation) as a black-box to provide interactive machine translation was developed. [9] This approach is not able to extract as much information from the bilingual resources used, due to the black-box nature of the interaction, but can use any resource available to the user. Forecat is a black-box interactive machine translation implementation that is available both as a web application (that includes a webpage and a web services interface) and as a plugin for OmegaT (Forecat-OmegaT).

Process

The interactive machine translation process starts with the system suggesting a translation hypothesis to the user. Then, the user may accept the complete sentence as correct, or may modify it if he considers there is some error. Typically, when modifying a given word, it is assumed that the prefix until that word is correct, leading to a left-to-right interaction scheme. Once the user has changed the word considered incorrect, the system then proposes a new suffix, i.e. the remainder of the sentence. Such process continues until the translation provided satisfies the user.

Although explained at the word level, the previous process may also be implemented at the character level, and hence the system provides a suffix whenever the human translator types in a single character. In addition, there is ongoing effort towards changing the typical left-to-right interaction scheme in order to make human-machine interaction easier. [10] [11]

A similar approach is used in the Caitra translation tool.

Evaluation

Evaluation is a difficult issue in interactive machine translation. Ideally, evaluation should take place in experiments involving human users. However, given the high monetary cost this would imply, this is seldom the case. Moreover, even when considering human translators in order to perform a true evaluation of interactive machine translation techniques, it is not clear what should be measured in such experiments, since there are many different variables that should be taken into account and cannot be controlled, as is for instance the time the user takes in order to get used to the process. In the CASMACAT project, some field trials have been carried out to study some of these variables. [12] [13] [14]

For quick evaluations in laboratory conditions, interactive machine translation is measured by using the key stroke ratio or the word stroke ratio. Such criteria attempt to measure how many key-strokes or words did the user need to introduce before producing the final translated document. [3]

Differences with classical computer-aided translation

Although interactive machine translation is a sub-field of computer-aided translation, the main attractive of the former with respect to the latter is the interactivity. In classical computer-aided translation, the translation system may suggest one translation hypothesis in the best case, and then the user is required to post-edit such hypothesis. In contrast, in interactive machine translation the system produces a new translation hypothesis each time the user interacts with the system, i.e. after each word (or letter) has been introduced.

See also

Related Research Articles

Computational linguistics is an interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, computational linguistics draws upon linguistics, computer science, artificial intelligence, mathematics, logic, philosophy, cognitive science, cognitive psychology, psycholinguistics, anthropology and neuroscience, among others.

<span class="mw-page-title-main">Machine translation</span> Use of software for language translation

Machine translation is use of computational techniques to translate text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages.

<span class="mw-page-title-main">User interface</span> Means by which a user interacts with and controls a machine

In the industrial design field of human–computer interaction, a user interface (UI) is the space where interactions between humans and machines occur. The goal of this interaction is to allow effective operation and control of the machine from the human end, while the machine simultaneously feeds back information that aids the operators' decision-making process. Examples of this broad concept of user interfaces include the interactive aspects of computer operating systems, hand tools, heavy machinery operator controls and process controls. The design considerations applicable when creating user interfaces are related to, or involve such disciplines as, ergonomics and psychology.

Word-sense disambiguation (WSD) is the process of identifying which sense of a word is meant in a sentence or other segment of context. In human language processing and cognition, it is usually subconscious/automatic, but can often come to conscious attention when ambiguity impairs clarity of communication, given the pervasive polysemy in natural language. In computational linguistics, it is an open problem that affects other computer-related writing, such as discourse, improving relevance of search engines, anaphora resolution, coherence, and inference.

Question answering (QA) is a computer science discipline within the fields of information retrieval and natural language processing (NLP) that is concerned with building systems that automatically answer questions that are posed by humans in a natural language.

Computer-aided translation (CAT), also referred to as computer-assisted translation or computer-aided human translation (CAHT), is the use of software, also known as a translator, to assist a human translator in the translation process. The translation is created by a human, and certain aspects of the process are facilitated by software; this is in contrast with machine translation (MT), in which the translation is created by a computer, optionally with some human intervention.

Human-centered computing (HCC) studies the design, development, and deployment of mixed-initiative human-computer systems. It is emerged from the convergence of multiple disciplines that are concerned both with understanding human beings and with the design of computational artifacts. Human-centered computing is closely related to human-computer interaction and information science. Human-centered computing is usually concerned with systems and practices of technology use while human-computer interaction is more focused on ergonomics and the usability of computing artifacts and information science is focused on practices surrounding the collection, manipulation, and use of information.

Multimodal interaction provides the user with multiple modes of interacting with a system. A multimodal interface provides several distinct tools for input and output of data.

ALPAC was a committee of seven scientists led by John R. Pierce, established in 1964 by the United States government in order to evaluate the progress in computational linguistics in general and machine translation in particular. Its report, issued in 1966, gained notoriety for being very skeptical of research done in machine translation so far, and emphasizing the need for basic research in computational linguistics; this eventually caused the U.S. government to reduce its funding of the topic dramatically. This marked the beginning of the first AI winter.

Martin Kay was a computer scientist, known especially for his work in computational linguistics.

Computational humor is a branch of computational linguistics and artificial intelligence which uses computers in humor research. It is a relatively new area, with the first dedicated conference organized in 1996.

<span class="mw-page-title-main">Visual analytics</span>

Visual analytics is an outgrowth of the fields of information visualization and scientific visualization that focuses on analytical reasoning facilitated by interactive visual interfaces.

<span class="mw-page-title-main">Human–computer interaction</span> Academic discipline studying the relationship between computer systems and their users

Human–computer interaction (HCI) is research in the design and the use of computer technology, which focuses on the interfaces between people (users) and computers. HCI researchers observe the ways humans interact with computers and design technologies that allow humans to interact with computers in novel ways. A device that allows interaction between human being and a computer is known as a "Human-computer Interface (HCI)".

Caitra is a translation Computer Assisted Tool, or CAT, developed by the University of Edinburgh. Provided from an online platform, Caitra is based on AJAX Web.2 technologies and the Moses decoder. The web page of the tool is implemented with Ruby on Rails, an open source web framework, and C++.

In natural language processing, textual entailment (TE), also known as natural language inference (NLI), is a directional relation between text fragments. The relation holds whenever the truth of one text fragment follows from another text.

The following outline is provided as an overview of and topical guide to natural-language processing:

LEPOR is an automatic language independent machine translation evaluation metric with tunable parameters and reinforced factors.

MateCat is a web-based computer-assisted translation (CAT) tool, released as open-source software under the Lesser General Public License (LGPL).

Neural machine translation (NMT) is an approach to machine translation that uses an artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model.

<span class="mw-page-title-main">Semantic parsing</span>

Semantic parsing is the task of converting a natural language utterance to a logical form: a machine-understandable representation of its meaning. Semantic parsing can thus be understood as extracting the precise meaning of an utterance. Applications of semantic parsing include machine translation, question answering, ontology induction, automated reasoning, and code generation. The phrase was first used in the 1970s by Yorick Wilks as the basis for machine translation programs working with only semantic representations. Semantic parsing is one of the important tasks in computational linguistics and natural language processing.

References

  1. 1 2 Casacuberta, Francisco; Civera, Jorge; Cubel, Elsa; Lagarda, Antonio L.; Lapalme, Guy; Macklovitch, Elliott; Vidal, Enrique (2009). "Human interaction for high quality machine translation" (PDF). Communications of the ACM. 52 (10): 135–138. doi:10.1145/1562764.1562798. Archived from the original (PDF) on 2011-07-06.
  2. Herbig, Nico; Pal, Santanu; van Genabith, Josef; Krüger, Antonio (2019). "Integrating Artificial and Human Intelligence for Efficient Translation". arXiv: 1903.02978 [cs.HC].
  3. 1 2 3 Barrachina, Sergio; Bender, Oliver; Casacuberta, Francisco; Civera, Jorge; Cubel, Elsa; Khadivi, Shahram; Lagarda, Antonio L.; Ney, Hermann; Tomás, Jesús; Vidal, Enrique (2009). "Statistical approaches to computer-assisted translation" (PDF). Computational Linguistics. 25 (1): 3–28. doi: 10.1162/coli.2008.07-055-r2-06-29 .
  4. Foster, George; Isabelle, Pierre; Plamondon, Pierre (1997). "Target-text mediated interactive machine translation". Machine Translation. 12 (1): 175–194. doi:10.1023/a:1007999327580.
  5. Alabau, Vicent; Buck, Christian; Carl, Michael; Casacuberta, Francisco; Garcia-Martinez, Mercedes; Germann, Ulrich; Gonzalez-Rubio, Jesus; Hill, Robin; Koehn, Philipp; Leiva, Luis; Mesa-Lao, Barto; Ortiz, Daniel; Saint-Amand, Herve; Sanchis, German; Tsoukala, Chara (April 2014). "CASMACAT: A Computer-assisted Translation Workbench" (PDF). Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics. Los Angeles, California: Association for Computational Linguistics. pp. 25–28.
  6. Ortiz-Martinez, Daniel; Garcia-Varea, Ismael; Casacuberta, Francisco (June 2010). "Online Learning for Interactive Statistical Machine Translation" (PDF). Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL. Association for Computational Linguistics. pp. 546–554.
  7. Martinez-Gomez, Pascual; Sanchis-Trilles, German; Casacuberta, Francisco (September 2012). "Online adaptation strategies for statistical machine translation in post-editing scenarios". Pattern Recognition. 45 (9). Elsevier: 3193–3203. doi:10.1016/j.patcog.2012.01.011. hdl: 10251/37324 .
  8. Koehn, Philipp (June 2010). "Enabling Monolingual Translators: Post-Editing vs. Options" (PDF). Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL). Los Angeles, California: Association for Computational Linguistics. pp. 537–545.
  9. Juan Antonio, Pérez-Ortiz; Torregrosa, Daniel; Forcada, Mikel (2014). "Black-box integration of heterogeneous bilingual resources into an interactive translation system". Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation. Los Angeles, California: Association for Computational Linguistics. pp. 57–65.
  10. Sanchis-Trilles, Germán; Ortiz-Martínez, Daniel; Civera, Jorge; Casacuberta, Francisco; Vidal, Enrique; Hoang, Hieu (October 2008). "Improving Interactive Machine Translation via Mouse Actions" (PDF). Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP). Honolulu, Hawaii: Association for Computational Linguistics. pp. 485–494.
  11. González-Rubio, Jesús; Ortiz-Martínez, Daniel; Casacuberta, Francisco (July 2010). "Balancing User Effort and Translation Error in Interactive Machine Translation via Confidence Measures" (PDF). Proceedings of the ACL 2010 Conference Short Papers (ACL). Uppsala, Sweden: Association for Computational Linguistics. pp. 173–177.
  12. Underwood, Nancy; Mesa-Lao, Bartolomé; García-Martínez, Mercedes; Carl, Michael; Alabau, Vicent; González-Rubio, Jesús; Leiva, Luis; Sanchis-Trilles, Germán; Ortiz-Martínez, Daniel; Casacuberta, Francisco (May 2014). "Evaluating the Effects of Interactivity in a Post-Editing Workbench" (PDF). Proceedings of the 29th edition of the Language Resources and Evaluation Conference (LREC). Reykjavik, Iceland. pp. 553–559.
  13. Ortiz-Martínez, Daniel; González-Rubio, Jesús; Alabau, Vicent; Sanchis-Trilles, Germán; Casacuberta, Francisco (August 2015). "Integrating Online and Active Learning in a Computer-Assisted Translation Workbench". New Directions in Empirical Translation Process Research: Exploring the CRITT TPR-DB. Springer. pp. 54–73.
  14. Alabau, Vicent; Carl, Michael; Casacuberta, Francisco; García-Martínez, Mercedes; Mesa-Lao, Bartolomé; Ortiz-Martínez, Daniel; González-Rubio, Jesús; Sanchis-Trilles, Germán; Schaeffer, Moritz (August 2015). "Learning Advanced Post-editing". New Directions in Empirical Translation Process Research: Exploring the CRITT TPR-DB. Springer. pp. 95–111.