bitext2tmx is a program to segment and align corresponding translated sentences, contained in two plain text files, and generate a translation memory, in TMX format, from them for use in computer-assisted translation applications; in particular it makes a nice companion to OmegaT and OmegaT+.
bitext2tmx(B2T) is a Java application and it works on any Java supported operating system (e.g. Linux, Mac OS X, Solaris, Windows).
Go to the project's
screenshots page
to see some images of B2T.
A new version of B2T is in development. Version 1.0 M1 (milestone 1) is under heavy development and will hopefully be available in the next few weeks or so. In the interim, a development version that was a variation of the 0.9 version has been packaged and made available already. This version, released as version 1.0 M0, only has minimal changes compared to version 0.9. The most significant feature it has is the ability to load bitext from a TMX. This was a lacking feature in the old version that limited the usefulness of the application. More significant changes will occur in M1 and on towards the final 1.0 release.
Initial development by Susana Santos, with help from Sergio Ortiz-Rojas
and Mikel L. Forcada (members of the the
Transducens
research group at the
Departament de Llenguatges i Sistemes Informics,
Universitat d'Alacant, Spain).
Ongoing development by Raymond Martin (OmegaT+ project).
The program originated inside the project "Finite-state translators based on bitexts harvested from the net" (2004–2006), that was funded by the now defunct Ministry of Science and Technology of Spain through grant number TIC2003-08681-C02.
Contributions to the program are also made by other project members from the free software/open source community.
The latest bitext2tmx package can be downloaded through the bitext2tmx project SourceForge page. Access to a snapshot of current development can obtained through the project's CVS repository.
A Java Runtime Environment (JRE), version 1.4 or higher, is required on the system where bitext2tmx it is to be used. A JRE may be obtained from http://java.com (Linux, Solaris, Windows) and other sources (e.g., http://www-128.ibm.com/developerworks/java/jdk/index.html [Linux, Windows]). Documentation on Java installation and use is also available at the same locations.
Mac OS X users already have a JRE installed by default. Just get bitext2tmx and run it.
To get started working with bitext2tmx, read "Bitext2tmx: QuickStart".
Catalan & Spanish webpage translations: Mikel L. Forcada.
French webpage translation: Valerie Martineau, trad. a.
German webpage translation: Sabine Cretella.
This application is released under the GNU General Public License.
Application documentation is released under the Open Publication License (OPL). In particular, this webpage is:
Copyright © 2006-2008 by Mikel L. Forcada and Raymond Martin. This material may be distributed only subject to the terms and conditions set forth in the Open Publication License, v1.0 or later (the latest version is presently available at http://www.opencontent.org/openpub/).
| Last modified February 22, 2008 by Raymond Martin |