Travatar
This is the home of Travatar, a tree-to-string statistical machine translation system. In other words, when performing translation, you first parse the input using a syntactic parser, then Travatar will perform translation for you. This is particularly effective for language pairs that require a large amount of reordering, such as English-Japanese.
Download/Install
Preparation
Travatar is mainly developed on linux and Mac OSX, but should also work on Windows (via cygwin). In order to install Travatar, you must have git, Boost and autotools installed. The easiest way to install these tools on (Ubuntu/Debian) Linux is:
sudo apt-get install git g++ libboost-all-dev libz-dev autoconf automake autotools-dev libtool
Download/Compilation
The code of Travatar is distributed according to the LGPL, and can be distributed freely according to the license. You can download the latest version of Travatar by going to the github page, or directly by using the following git command.
git clone https://github.com/neubig/travatar.git
Next, compile travatar with the following commands:
cd travatar autoreconf -i ./configure make src/bin/travatar --help
If the final command prints the Travatar help, everything is working properly.
Program Documentation
Overview
The following paper gives an overview of the motivation behind Travatar, and it would be great if you could cite it if you use Travatar in your work:
Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers
Graham Neubig. In Proceedings of the ACL Demonstration Track. 2013.
@inproceedings{neubig13travatar, title = {Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers}, author = {Graham Neubig}, booktitle = {Proceedings of the ACL Demonstration Track}, address = {Sofia, Bulgaria}, month = {August}, year = {2013} }
Training
In order to run Travatar, you will have to train a model. Please take a look at the step-by-step guide to creating a model for Travatar. There are also a number of training options that will affect how training works.
Translating
In order to translate with Travatar, you use the model defined in the previous step. In the most simple of situations, you must simply run the following command, substituting travatar.ini with the appropriate configuration file:
cat input-parsed.txt | travatar -config_file travatar.ini > output-words.txt
Here, input-parsed.txt is a file containing one syntactic parse tree per line.
Development/Support
Contributors
- Graham Neubig (Most Coding)
- Philip Arthur (Hiero, SCFG coding)
- Koichi Akabe (Visualization/Analysis Scripts)
- Akiba Miura (Fixes to Training Scripts)
- Yusuke Oda (Refactoring, etc.)
- Kevin Duh (Compound Splitting)
If you have any questions about Travatar, please send them to the travatar-users mailing list, which you can join by sending mail to travatar-users+subscribe with the domain googlegroups.com. (If you don't get a reply, you can also send questions to neubig at gmail.com.)