bitext2tmx Logo [Català] [Deutsch] [Español] [Français]

bitext2tmx: Bitext Aligner/Converter

bitext2tmx is a program to align and segment corresponding translated sentences, contained in two plain text files, and generate a translation memory (TMX format) from them for use in computer-aided/assisted translation (CAT) applications.

bitext2tmx is a Java application, so it requires Java in order to work on supported operating systems (e.g. Linux, Mac OS X, Solaris, Windows).


A new version of bitext2tmx is in development. Version 1.0 M1 (milestone 1) is under development. A preliminary, unstable, version of it is available in the Git repository and must be checked out and compiled in order to be used.


Application downloads can be done on the bitext2tmx project page at SourceForge.


bitext2tmx User Guide:

Technical Support

Regular community support can be had by joining the bitext2tmx users group (see below). Please post your bitext2tmx related questions to the user group or email members of this project for further information. Do not post your questions to other groups/projects that are not directly related to this application (a number of incidents of this nature have occurred, which resulted in erroneous and possibly misleading information for users of bitext2tmx).

Users Group

There is a Google users group for posting questions and responses related to bitext2tmx that everybody is welcome to join. Come and get help and help out other users in the bitext2tmx community.

Go to the bitext2tmx users group to subscribe or browse messages.


bitext2tmx needs volunteers to participate in its evolution. Both language professionals and software developers are invited to get involved. If you are interested, please contact one of the project managers


Original development of Bitext2tmx was done by Susana Santos, with help from Sergio Ortiz-Rojas and Mikel L. Forcada (members of the the Transducens research group at the Departament de Llenguatges i Sistemes Informics, Universitat d'Alacant, Spain).

The program originated inside the project "Finite-state translators based on bitexts harvested from the net" (2004–2006), that was funded by the now defunct Ministry of Science and Technology of Spain through grant number TIC2003-08681-C02.

Ongoing development and contributions to bitext2tmx are done by project members and others from the free software/open source community.


Catalan & Spanish webpage translations: Mikel L. Forcada.

French webpage translation: Valerie Martineau, trad. a.

German webpage translation: Sabine Cretella.

Software Development

Raymond Martin (OmegaT+ project).



Bitext2tmx is released under the GNU General Public License.


Application documentation is released under the Open Publication License (OPL). In particular, this webpage is:

Copyright © 2006-2010 by Mikel L. Forcada and Raymond Martin. This material may be distributed only subject to the terms and conditions set forth in the Open Publication License, v1.0 or later (the latest version is presently available at