Kenneth Heafield (kpu)

kpu

Geek Repo

Company:University of Edinburgh

Home Page:https://kheafield.com

Github PK Tool:Github PK Tool


Organizations
bitextor
browsermt
marian-nmt
moses-smt
paracrawl

Kenneth Heafield's repositories

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++License:NOASSERTIONStargazers:2461Issues:69Issues:366

preprocess

Corpus preprocessing

Language:C++License:NOASSERTIONStargazers:93Issues:8Issues:21

intgemm

int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991

Language:C++License:NOASSERTIONStargazers:63Issues:14Issues:31

fasterText

Library for fast text representation and classification.

Language:HTMLLicense:MITStargazers:29Issues:1Issues:0

lazy

A lazy decoder for syntax

Language:C++License:NOASSERTIONStargazers:14Issues:3Issues:6

mtplz

Code for the paper Faster Phrase-Based Decoding by Refining Feature State

Language:C++License:NOASSERTIONStargazers:14Issues:10Issues:3

runforest

A python library that manages and runs experimental pipelines.

Language:PythonLicense:NOASSERTIONStargazers:4Issues:3Issues:0

elrc-scrape

Scrape ELRC-SHARE for corpora

Language:PythonStargazers:3Issues:3Issues:0

usage

Print resource usage of processes to stderr with LD_PRELOAD

Language:C++Stargazers:3Issues:3Issues:0

azurehacks

Hacks to control Azure instances

Language:ShellStargazers:2Issues:9Issues:0

azure-batch-cli-extensions

Batch extension cli commands for Azure cli v2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:3Issues:0

bitextor

Bitextor generates translation memories from multilingual websites.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

Bleualign

Machine-Translation-based sentence alignment tool for parallel text

Language:PythonLicense:GPL-2.0Stargazers:0Issues:3Issues:0

clean

A tool for downloading and cleaning parallel corpora

Language:ShellLicense:NOASSERTIONStargazers:0Issues:1Issues:0

CSrankings

A web app for ranking computer science departments according to their research output in selective venues.

Language:PythonStargazers:0Issues:3Issues:0
Language:MakefileStargazers:0Issues:3Issues:0

firefox-translations

Firefox Translations is a webextension that enables client side translations for web browsers.

Language:JavaScriptLicense:MPL-2.0Stargazers:0Issues:1Issues:0

firefox-translations-training

Training pipelines for Firefox Translations neural machine translation models

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

incubator-joshua

Mirror of Apache Joshua (Incubating)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:3Issues:0

marian-dev

Fast Neural Machine Translation in C++ - development repository

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

masakhane-community

All our community docs! Start here! Lets put Africa on the NLP Map

License:MITStargazers:0Issues:1Issues:0

mtdata

A tool that locates, downloads, and extracts machine translation corpora. `pip install --ignore-installed mtdata`

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

OpenNMT

Open-Source Neural Machine Translation in Torch

Language:LuaLicense:MITStargazers:0Issues:3Issues:0

pcqueue

Simple pcqueue testing

Language:C++Stargazers:0Issues:2Issues:0

pyfasthash

Python Non-cryptographic Hash Library

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ssplit-cpp

Approximate reimplementation of the sentence splitter from the Moses toolkit.

Language:Emacs LispLicense:NOASSERTIONStargazers:0Issues:2Issues:0

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

translateLocally

Fast and secure translation on your local machine, powered by marian and Bergamot.

Language:C++License:MITStargazers:0Issues:1Issues:0

wmt17-website

Website for WMT17 - Second Conference in Machine Translation

Language:HTMLStargazers:0Issues:3Issues:0