Translate Toolkit

Last updated
Developer(s) Translate.org.za
Initial release2002
Stable release
3.5.3 / 17 January 2022;20 months ago (2022-01-17)
Repository
Written in Python (programming language)
Operating system Cross-platform
Type Computer-assisted translation
License GPLv2+ [1]
Website toolkit.translatehouse.org

The Translate Toolkit is a localization and translation toolkit. It provides a set of tools for working with localization file formats and files that might need localization. The toolkit also provides an API on which to develop other localization tools.

Contents

The toolkit is written in the Python programming language. It is free software originally developed and released by Translate.org.za in 2002 and is now maintained by Translate.org.za and community developers.

Translate Toolkit uses Enchant as spellchecker.

History

The toolkit was originally developed as the mozpotools by David Fraser for Translate.org.za. Translate.org.za had focused on translating KDE which used Gettext PO files for localization. With an internal change to focus on end-user, cross-platform, OSS software, the organisation decided to localize the Mozilla Suite. This required using new tools and new formats that were not as rich as Gettext PO. Thus mozpotools was created to convert the Mozilla DTD and .properties files to Gettext PO.

Various tools were developed as needed, including pocount, a tool to count source text words to allow correct estimations for work, pogrep, to search through translations, and pofilter, to check for various quality issues.

When Translate.org.za began translating OpenOffice.org it was only natural that the Translate Toolkit would be adapted to handle the OpenOffice.org internal file format. Translating OpenOffice.org using PO files is now the default translation method.

As part of the WordForge project the work received a major boost and the toolkit was further extended to manage XLIFF files alongside PO files. Further funded development has added other features including the ability to convert Open Document Format to XLIFF and the management of placeholders (Variables, acronyms, terminology, etc.).

Design goals

The main aim of the toolkit is to increase the quality of localisation and translation. This is achieved by firstly, focusing on good localisation formats thus the toolkit makes use of the PO and XLIFF localisation formats. This has the benefit that it stops the proliferation of localization formats and allows localizers to work with one good localization tool. For the toolkit this means building converters that can transform files to be translated into these two basic formats.

Secondly, to build tools that allow localizers to increase the general quality of their localization. These tools allow for the extraction of terminology and for checking for the consistent use of terminology. The tools allow for checking for various technical errors such as the correct use of variables.

Lastly, the toolkit provides a powerful localisation API [2] that acts as a base on which to build other localization related tools.

Users

Many translators use the toolkit directly, to do quality checks and to transform files for translation. Further there are and have been several indirect users of the Translate Toolkit API:

Supported document formats

OpenDocument Format support

Work was started in June 2008 to incorporate OpenDocument Format support. [4] This work is funded by the NLnet Foundation and is a collaboration between Translate.org.za and Itaapy [5]

See also

Related Research Articles

A translation memory (TM) is a database that stores "segments", which can be sentences, paragraphs or sentence-like units that have previously been translated, in order to aid human translators. The translation memory stores the source text and its corresponding translation in language pairs called “translation units”. Individual words are handled by terminology bases and are not within the domain of TM.

In computing, gettext is an internationalization and localization system commonly used for writing multilingual programs on Unix-like computer operating systems. One of the main benefits of gettext is that it separates programming from translating. The most commonly used implementation of gettext is GNU gettext, released by the GNU Project in 1995. The runtime library is libintl. gettext provides an option to use different strings for any number of plural forms of nouns, but this feature has no support for grammatical gender. The main filename extensions used by this system are .POT, .PO and .MO.

The Darwin Information Typing Architecture (DITA) specification defines a set of document types for authoring and organizing topic-oriented information, as well as a set of mechanisms for combining, extending, and constraining document types. It is an open standard that is defined and maintained by the OASIS DITA Technical Committee.

<span class="mw-page-title-main">Assistive Technology Service Provider Interface</span>

Assistive Technology Service Provider Interface (AT-SPI) is a platform-neutral framework for providing bi-directional communication between assistive technologies (AT) and applications. It is the de facto standard for providing accessibility to free and open desktops, like Linux or OpenBSD, led by the GNOME Project.

This is an overview of software support for the OpenDocument format, an open document file format for saving and exchanging editable office documents.

<span class="mw-page-title-main">OmegaT</span> Computer assisted translation tool written in Java

OmegaT is a computer-assisted translation tool written in the Java programming language. It is free software originally developed by Keith Godfrey in 2000, and is currently developed by a team led by Aaron Madlon-Kay.

.properties is a file extension for files mainly used in Java-related technologies to store the configurable parameters of an application. They can also be used for storing strings for Internationalization and localization; these are known as Property Resource Bundles.

<span class="mw-page-title-main">Okapi Framework</span>

The Okapi Framework is a cross-platform and open-source set of components and applications that offer extensive support for localizing and translating documentation and software.

OAXAL: Open Architecture for XML Authoring and Localization is an Organization for the Advancement of Structured Information Standards (OASIS) standards-based initiative to encourage the development of an open Standards approach to XML Authoring and Localization. OAXAL is an official OASIS Reference Architecture Technical Committee.

<span class="mw-page-title-main">Pootle</span> Free translation software

Pootle is an online translation management tool with a translation interface. It is written in the Python programming language using the Django framework and is free software originally developed and released by Translate.org.za in 2004. It was further developed as part of the WordForge project and the African Network for Localisation and is now maintained by Translate.org.za.

XLIFF is an XML-based bitext format created to standardize the way localizable data are passed between and among tools during a localization process and a common format for CAT tool exchange. The XLIFF Technical Committee (TC) first convened at OASIS in December 2001, but the first fully ratified version of XLIFF appeared as XLIFF Version 1.2 in February 2008. Its current specification is v2.1 released on 2018-02-13, which is backwards compatible with v2.0 released on 2014-08-05.

<span class="mw-page-title-main">Virtaal</span>

Virtaal is a computer-assisted translation tool written in the Python programming language. It is free software developed and maintained by Translate.org.za.

Trados Studio is a computer-assisted translation software tool which offers a complete, centralized translation environment for editing, reviewing and managing translation projects and terminology – It can be used both offline in a desktop tool or online in the cloud. Trados Studio is part of the Trados product portfolio, which is a suite of intelligent translation products owned by RWS that enables freelance translators, language service providers (LSPs) and corporations to streamline processes and improve efficiencies while keeping costs down.

Open Language Tools is a Java project released by Sun Microsystems under the terms of Sun's CDDL.

Transifex is a globalization management system (GMS), a proprietary, web-based translation platform. It targets technical projects with frequently updated content, such as software, documentation, and websites, and encourages the automation of the localization workflow by integrating with common developer tools.

Swordfish Translation Editor is a Computer-assisted translation software.

The Service-Oriented Localisation Architecture Solution, or SOLAS, enables the global conversation in communities and is an open source project of The Rosetta Foundation.

translatewiki.net, formerly named Betawiki, is a web-based translation platform powered by the Translate extension for MediaWiki. It can be used to translate various kinds of texts but is commonly used for creating localisations for software interfaces.

TermBase eXchange (TBX) is an international standard for the representation of structured concept-oriented terminological data, copublished by ISO and the Localization Industry Standards Association (LISA). Originally released in 2002 by LISA's OSCAR special interest group, TBX was adopted by ISO TC 37 in 2008. In 2019 ISO 30042:2008 was withdrawn and revised by ISO 30042:2019. It is currently available as an ISO standard and as an open, industry standard, available at no charge.

References

  1. "Translate/Translate". GitHub . 25 November 2021.
  2. "Translate Toolkit API documentation".
  3. Jacek Śliwerski (2013-12-01). "Closing open-tran.eu". Google Groups. Retrieved December 1, 2017.
  4. "Project page for ODF format support".
  5. "ODF Internationalization (i18n) Test Suite in hforge".