Language Weaver

Last updated
Language Weaver
Type of site
Neural machine translation
Available in58 languages and language varieties
Owner RWS
URL www.rws.com/language-weaver/
CommercialYes
RegistrationYes
Launched2002;22 years ago (2002)
Current statusActive

Language Weaver is the machine translation (MT) technology and brand of RWS. The brand name was revived in 2021 following the acquisition of SDL [1] and Iconic Translation Machines Ltd. [2] and the merging of the respective teams and technologies. Language Weaver was formerly a standalone company that was acquired by SDL in 2010.

Contents

History

Language Weaver was a Los Angeles, California–based company founded in 2002 as a spin-out company from the University of Southern California. The company was founded to commercialise a statistical approach to automatic language translation and natural language processing known as statistical machine translation (SMT). The company's name is a reference to one of the pioneers of machine translation — Warren Weaver — who first proposed the idea of using computers to ‘decode’ or ‘decrypt’ language in a memorandum back in 1947.

Language Weaver’s statistical approach to machine translation was cutting-edge at the time, and a significant improvement over previous approaches such as Rule-Based MT. Language Weaver grew steadily over an 8 year period, with staff numbers totalling 96 across offices in US, Europe, and Japan. The company had significant business with Government organisations where its name continues to hold strong recognition to this day.

In July 2010, Language Weaver was acquired by SDL plc for $42.5 million and the company was renamed SDL Language Weaver. [3]

SDL Language Weaver

SDL Language Weaver was the primary machine translation technology at SDL where, over time, it evolved from SMT to syntax-based MT, to Neural Machine Translation. The Language Weaver brand was retired in 2015 in favour of SDL BeGlobal for the cloud-based solution, and SDL Enterprise Translation Server for the on-premise solution. Later, these products were rebranded again as SDL Machine Translation Cloud and SDL Machine Translation Edge respectively.

2021 Relaunch

The Language Weaver brand was revived in 2021 following the acquisition of SDL by RWS, and the merger of the SDL MT and Iconic Translation Machines teams and technologies. The combined technologies of both companies, based on state-of-the-art Transformer-based Neural Machine Translation, are now sold as "Language Weaver" for cloud-based MT, and "Language Weaver Edge" for on-premise MT.

Supported languages

As of September 2021, Language Weaver supports the following languages and language varieties: [4]

  1. Albanian
  2. Arabic
  3. Armenian
  4. Bengali
  5. Bulgarian
  6. Burmese
  7. Catalan
  8. Chinese (Simplified)
  9. Chinese (Traditional)
  10. Croatian
  11. Czech
  12. Danish
  13. Dari
  14. Dutch
  15. English
  16. Estonian
  17. Finnish
  18. French
  19. French (Canada)
  20. Georgian
  21. German
  22. Greek
  23. Hausa
  24. Hebrew
  25. Hindi
  26. Hungarian
  27. Indonesian
  28. Italian
  29. Japanese
  30. Javanese
  31. Khmer
  32. Korean
  33. Kurdish (Kurmanji)
  34. Latvian
  35. Lithuanian
  36. Malay
  37. Maltese
  38. Norwegian
  39. Pashto
  40. Persian
  41. Polish
  42. Portuguese
  43. Portuguese (Brazil)
  44. Romanian
  45. Russian
  46. Serbian
  47. Slovak
  48. Slovenian
  49. Somali
  50. Spanish
  51. Swahili
  52. Swedish
  53. Thai
  54. Turkish
  55. Ukrainian
  56. Urdu
  57. Uzbek
  58. Vietnamese


See also

Notes and references

  1. "Acquisition of SDL by RWS enhances capabilities in Language Services and Technology".
  2. "RWS Acquires MT Provider Iconic".
  3. "SDL Acquires Language Weaver, Adds Statistical Machine Translation to Portfolio".
  4. Ciucean, Andrei. "MT Edge and Language Pair Versions - March Update - Forum - Language Weaver Edge - RWS Community". community.sdl.com. Retrieved 23 September 2021.


Related Research Articles

<span class="mw-page-title-main">Machine translation</span> Use of software for language translation

Machine translation is use of either rule-based or probabilistic machine learning approaches to translation of text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages.

<span class="mw-page-title-main">Natural language processing</span> Field of linguistics and computer science

Natural language processing (NLP) is an interdisciplinary subfield of computer science and linguistics. It is primarily concerned with giving computers the ability to support and manipulate human language. It involves processing natural language datasets, such as text corpora or speech corpora, using either rule-based or probabilistic machine learning approaches. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves.

<span class="mw-page-title-main">SYSTRAN</span> Machine translation company

SYSTRAN, founded by Dr. Peter Toma in 1968, is one of the oldest machine translation companies. SYSTRAN has done extensive work for the United States Department of Defense and the European Commission.

A translation management system (TMS), formerly globalization management system (GMS), is a type of software for automating many parts of the human language translation process and maximizing translator efficiency. The idea of a translation management system is to automate all repeatable and non-essential work that can be done by software/systems and leaving only the creative work of translation and review to be done by human beings. A translation management system generally includes at least two types of technology: process management technology to automate the flow of work, and linguistic technology to aid the translator.

<span class="mw-page-title-main">Motorola Solutions</span> American data communications and telecommunications equipment provider

Motorola Solutions, Inc. is an American video equipment, telecommunications equipment, software, systems and services provider that succeeded Motorola, Inc., following the spinoff of the mobile phone division into Motorola Mobility in 2011. The company is headquartered in Chicago, Illinois.

<span class="mw-page-title-main">F5, Inc.</span> U.S. information technology company

F5, Inc. is an American technology company specializing in application security, multi-cloud management, online fraud prevention, application delivery networking (ADN), application availability & performance, network security, and access & authorization.

Statistical machine translation (SMT) was a machine translation approach, that superseded the previous, rule-based approach because it required explicit description of each and every linguistic rule, which was costly, and which often did not generalize to other languages. Since 2003, the statistical approach itself has been gradually superseded by the deep learning-based neural network approach.

OpenLogos is an open source program that translates from English and German into French, Italian, Spanish and Portuguese. It accepts various document formats and maintains the format of the original document in translation. OpenLogos does not claim to replace human translators; rather, it aims to enhance the human translator's work environment.

<span class="mw-page-title-main">SDL plc</span>

SDL plc was a multinational professional services company headquartered in Maidenhead, United Kingdom. SDL specialized in language translation software and services. It was listed on the London Stock Exchange until it was acquired by RWS Group in November 2020.

Trados Studio is a computer-assisted translation software tool which offers a complete, centralized translation environment for editing, reviewing and managing translation projects and terminology. It is available both as a local desktop tool or online. Trados, owned by RWS, also provides a suite of intelligent machine translation products.

<span class="mw-page-title-main">Microsoft Translator</span> Machine translation cloud service by Microsoft

Microsoft Translator is a multilingual machine translation cloud service provided by Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services and integrated across multiple consumer, developer, and enterprise products, including Bing, Microsoft Office, SharePoint, Microsoft Edge, Microsoft Lync, Yammer, Skype Translator, Visual Studio, and Microsoft Translator apps for Windows, Windows Phone, iPhone and Apple Watch, and Android phone and Android Wear.

Applications Technology (AppTek) is a U.S. company headquartered in McLean, Virginia that specializes in artificial intelligence and machine learning for human language technologies. The company provides both managed and professional services for natural language processing (NLP) technologies including automatic speech recognition (ASR), neural machine translation (MT), natural-language understanding (NLU) and neural speech synthesis. AppTek's automatic speech recognition covers over 45 languages and dialects. The neural MT engine covers over 1000 language pairs between languages.

RWS Group, known commercially as RWS, is a British company that provides intellectual property translation, filing and search services, technical and commercial translation and localization, and develops and supports translation productivity and management software.

Philipp Koehn is a computer scientist and researcher in the field of machine translation. His primary research interest is statistical machine translation and he is one of the inventors of a method called phrase based machine translation. This is a sub-field of statistical translation methods that employs sequences of words as the basis of translation, expanding the previous word based approaches. A 2003 paper which he authored with Franz Josef Och and Daniel Marcu called Statistical phrase-based translation has attracted wide attention in Machine translation community and has been cited over a thousand times. Phrase based methods are widely used in machine translation applications in industry.

ProMT is a lead Russian developer of language translation software for businesses and private users since 1991. The company provides on-premises software based on neural technologies.

MultiTerm is a terminology management tool providing one solution to store and manage multilingual terminology.

Machine translation (MT) algorithms may be classified by their operating principle. MT may be based on a set of linguistic rules, or on large bodies (corpora) of already existing parallel texts. Rule-based methodologies may consist in a direct word-by-word translation, or operate via a more abstract representation of meaning: a representation either specific to the language pair, or a language-independent interlingua. Corpora-based methodologies rely on machine learning and may follow specific examples taken from the parallel texts, or may calculate statistical probabilities to select a preferred option out of all possible translations.

Hybrid machine translation is a method of machine translation that is characterized by the use of multiple machine translation approaches within a single machine translation system. The motivation for developing hybrid machine translation systems stems from the failure of any single technique to achieve a satisfactory level of accuracy. Many hybrid machine translation systems have been successful in improving the accuracy of the translations, and there are several popular machine translation systems which employ hybrid methods.

Google Neural Machine Translation (GNMT) is a neural machine translation (NMT) system developed by Google and introduced in November 2016 that uses an artificial neural network to increase fluency and accuracy in Google Translate. The neural network consists of two main blocks, an encoder and a decoder, both of LSTM architecture with 8 1024-wide layers each and a simple 1-layer 1024-wide feedforward attention mechanism connecting them. The total number of parameters has been variously described as over 160 million, approximately 210 million, 278 million or 380 million.

Memsource is a cloud-based commercial translation management system and computer-assisted translation (CAT) tool. It is developed by Memsource a.s., headquartered in Prague, Czech Republic. According to research by Deloitte, in 2020 Memsource a.s was one of the top 50 fastest growing IT companies in the Czech Republic.