SYSTRAN

Last updated
SYSTRAN Translate
SYSTRAN logo.svg
Type of site
Neural machine translation
Available in55 languages (140 combinations)
Founder(s) Dr. Peter Toma
URL www.systransoft.com (SYSTRAN)
www.systran.us (SYSTRAN Americas)
www.systran.net/en/translate/ (SYSTRAN Translate)
CommercialYes
RegistrationOptional
Launched1968;56 years ago (1968)
Current statusActive

SYSTRAN, founded by Dr. Peter Toma in 1968, [1] is one of the oldest machine translation companies. SYSTRAN has done extensive work for the United States Department of Defense and the European Commission.

Contents

SYSTRAN provided the technology for Yahoo! Babel Fish until May 30, 2012, among others. It was used by Google's language tools until 2007. [2] SYSTRAN is used by the Dashboard Translation widget in macOS.

Commercial versions of SYSTRAN can run on Microsoft Windows (including Windows Mobile), Linux, and Solaris. Historically, SYSTRAN systems used rule-based machine translation (RbMT) technology. With the release of SYSTRAN Server 7 in 2010, SYSTRAN implemented a hybrid rule-based/statistical machine translation (SMT) technology which was the first of its kind in the marketplace. [3]

As of 2008, the company had 59 employees of whom 26 are computational experts and 15 computational linguists. [4] The number of employees decreased from 70 in 2006 to 59 in 2008. [4]

History

With its origin in the Georgetown machine translation effort, SYSTRAN was one of the few machine translation systems to survive the major decrease of funding after the ALPAC Report of the mid-1960s. The company was established in La Jolla in California to work on translation of Russian to English text for the United States Air Force during the Cold War. [5] Large numbers of Russian scientific and technical documents were translated using SYSTRAN under the auspices of the USAF Foreign Technology Division (later the National Air and Space Intelligence Center) at Wright-Patterson Air Force Base, Ohio. The quality of the translations, although only approximate, was usually adequate for understanding content.

The company headquarters is in Paris, while its U.S. headquarters is in San Diego, CA.

During the dot-com boom, the international language industry started a new era, and SYSTRAN entered into agreements with a number of translation integrators, the most successful of these being WorldLingo. [6] [7]

In 2016, the Harvard NLP group and SYSTRAN founded OpenNMT, an open source ecosystem for neural machine translation and neural sequence learning. This has enabled machine translation software with learning capabilities, dramatically increasing MT translation quality. The project has since been used in several research and industry applications, and its open source ecosystem is currently maintained by SYSTRAN and Ubiqus.

Business situation

Most of SYSTRAN's revenue comes from a few customers. 57.1% comes from the 10 main customers and the three largest customers account for 10.9%, 8.9%, and 8.9% of its revenues, respectively. [4] Revenues had been declining in the early 2000s: 10.2 million euros in 2004, 10.1 million euros in 2005, 9.3 million euros in 2006, 8.8 million euros in 2007, and 7.6 million euros in 2008, [4] before seeing a rebound in 2009 with 8.6 million euros. [8]

Languages

The following is a list of the languages in which SYSTRAN translate from and to English: [9]


See also

Related Research Articles

Computational linguistics is an interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, computational linguistics draws upon linguistics, computer science, artificial intelligence, mathematics, logic, philosophy, cognitive science, cognitive psychology, psycholinguistics, anthropology and neuroscience, among others.

<span class="mw-page-title-main">Machine translation</span> Use of software for language translation

Machine translation is use of either rule-based or probabilistic machine learning approaches to translation of text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages.

Natural language processing (NLP) is an interdisciplinary subfield of computer science and information retrieval. It is primarily concerned with giving computers the ability to support and manipulate human language. It involves processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. To this end, natural language processing often borrows ideas from theoretical linguistics. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves.

Eurotra was a machine translation project established and funded by the European Commission from 1978 until 1992.

<span class="mw-page-title-main">Babel Fish (website)</span> Former multilingual translation website

Yahoo! Babel Fish was a free Web-based multilingual machine translation service of Yahoo! site. In May 2012 it was replaced by Bing Translator, to which queries were redirected. Although Yahoo! has transitioned its Babel Fish translation services to Bing Translator, it did not sell its translation application to Microsoft outright. As the oldest free online language translator, the service translated text or Web pages in 36 pairs between 13 languages, including English, Simplified Chinese, Traditional Chinese, Dutch, French, German, Greek, Italian, Japanese, Korean, Portuguese, Russian, and Spanish.

Language Weaver is the machine translation (MT) technology and brand of RWS. The brand name was revived in 2021 following the acquisition of SDL and Iconic Translation Machines Ltd. and the merging of the respective teams and technologies. Language Weaver was formerly a standalone company that was acquired by SDL in 2010.

<span class="mw-page-title-main">Google Translate</span> Multilingual neural machine translation service

Google Translate is a multilingual neural machine translation service developed by Google to translate text, documents and websites from one language into another. It offers a website interface, a mobile app for Android and iOS, as well as an API that helps developers build browser extensions and software applications. As of 2022, Google Translate supports 133 languages at various levels; it claimed over 500 million total users as of April 2016, with more than 100 billion words translated daily, after the company stated in May 2013 that it served over 200 million people daily.

In the history of artificial intelligence, an AI winter is a period of reduced funding and interest in artificial intelligence research. The field has experienced several hype cycles, followed by disappointment and criticism, followed by funding cuts, followed by renewed interest years or even decades later.

Machine translation is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one natural language to another.

<span class="mw-page-title-main">NComputing</span> Virtualization software company

NComputing is a desktop virtualization company that manufactures hardware and software to create virtual desktops which enable multiple users to simultaneously share a single operating system instance.

Mobile translation is any electronic device or software application that provides audio translation. The concept includes any handheld electronic device that is specifically designed for audio translation. It also includes any machine translation service or software application for hand-held devices, including mobile telephones, Pocket PCs, and PDAs. Mobile translation provides hand-held device users with the advantage of instantaneous and non-mediated translation from one human language to another, usually against a service fee that is, nevertheless, significantly smaller than a human translator charges.

Informatics is the study of computational systems. According to the ACM Europe Council and Informatics Europe, informatics is synonymous with computer science and computing as a profession, in which the central notion is transformation of information. In some cases, the term "informatics" may also be used with different meanings, e.g. in the context of social computing, or in context of library science.

Philipp Koehn is a computer scientist and researcher in the field of machine translation. His primary research interest is statistical machine translation and he is one of the inventors of a method called phrase based machine translation. This is a sub-field of statistical translation methods that employs sequences of words as the basis of translation, expanding the previous word based approaches. A 2003 paper which he authored with Franz Josef Och and Daniel Marcu called Statistical phrase-based translation has attracted wide attention in Machine translation community and has been cited over a thousand times. Phrase based methods are widely used in machine translation applications in industry.

ProMT is a lead Russian developer of language translation software for businesses and private users since 1991. The company provides on-premises software based on neural technologies.

LEPOR is an automatic language independent machine translation evaluation metric with tunable parameters and reinforced factors.

<span class="mw-page-title-main">Yandex Translate</span> Translation web service by Yandex

Yandex Translate is a web service provided by Yandex, intended for the translation of web pages into another language.

Neural machine translation (NMT) is an approach to machine translation that uses an artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model.

Google Neural Machine Translation (GNMT) is a neural machine translation (NMT) system developed by Google and introduced in November 2016 that uses an artificial neural network to increase fluency and accuracy in Google Translate. The neural network consists of two main blocks, an encoder and a decoder, both of LSTM architecture with 8 1024-wide layers each and a simple 1-layer 1024-wide feedforward attention mechanism connecting them. The total number of parameters has been variously described as over 160 million, approximately 210 million, 278 million or 380 million.

<span class="mw-page-title-main">Semantic parsing</span>

Semantic parsing is the task of converting a natural language utterance to a logical form: a machine-understandable representation of its meaning. Semantic parsing can thus be understood as extracting the precise meaning of an utterance. Applications of semantic parsing include machine translation, question answering, ontology induction, automated reasoning, and code generation. The phrase was first used in the 1970s by Yorick Wilks as the basis for machine translation programs working with only semantic representations. Semantic parsing is one of the important tasks in computational linguistics and natural language processing.

References

  1. Systran: Past and Present - https://lilab.unibas.ch/staff/tenhacken/Applied-CL/3_Systran/3_Systran.html#history
  2. Google Switches to Its Own Language System, Google Operating System, October 2007
  3. Grunwald, David. "Inside SYSTRAN Training Server 7". GTS. Retrieved 30 May 2011.
  4. 1 2 3 4 "SYSTRAN annual financial report 2008" (PDF). SYSTRAN. 2008. Retrieved 2012-04-04.
  5. Bruno Giussani (March 10, 1998). "Free Translation of Language Proves More Divertimento Than a Keg of Monkeys". The New York Times .
  6. "Machine Translation". translations.com. 2016-12-01. Retrieved 2018-03-02.
  7. "With Online Services, Foreign Texts Can Get Lost in Translation". The Wall Street Journal (WSJ). December 20, 2007.
  8. "SYSTRAN annual financial report 2009" (PDF). SYSTRAN. 2009. p. 63. Retrieved 2012-04-04.
  9. "Language Catalogue," (PDF). SYSTRAN. Retrieved 11 October 2021.