Cubit - A Cube Pruning Decoder for Phrase-based Translation

Welcome! This page contains information about Cubit, an efficient Python implementation of phrase-based decoding, à la Pharaoh, but using Cube Pruning inspired by k-best parsing to accelerate the language model integration (hence the name "cubit"). Under typical parameter settings, it can achieve 10-30 fold relative speed up (at the same level of search errors) against conventional beam search, or an even bigger speed up at the same level of BLEU. It is described in Section 5.1 of the following paper:

Liang Huang and David Chiang (2007). Forest Rescoring: Faster Decoding with Integrated Language Models. In Proceedings of the ACL, Prague, Czech Rep.

Version 0.8 (only for referencial uses for implementation of cube pruning). Released Feb. 27, 2008.

The manual will contain a more detailed description of the adaptation of cube pruning to phrase-based decoding.

About

Rescued from https://web.archive.org/web/20150918204917/http://www.cis.upenn.edu/~lhuang3/cubit/cubit-v0.8.tgz

Languages

Language:Python 62.6%Language:C 33.5%Language:Cython 2.7%Language:C++ 1.0%Language:Makefile 0.2%