Jonathan Clark's repositories

multeval

Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability" from ACL 2011.

Language:GroffLicense:NOASSERTIONStargazers:203Issues:13Issues:17

ducttape

A workflow management system for researchers who heart Unix.

Language:ScalaLicense:NOASSERTIONStargazers:115Issues:13Issues:168

memusg

A 'time'-like utility for Unix that measures peak memory usage

tercom

Translation Error Rate (TER)

Language:JavaLicense:LGPL-2.1Stargazers:43Issues:5Issues:7

bigfatlm

Hadoop MapReduce training of modified Kneser-Ney smoothed language models

Language:JavaLicense:LGPL-3.0Stargazers:30Issues:6Issues:2

salm

Joy Zhang's Suffix Array Language Modeling (SALM) Tooklit

Language:C++License:GPL-2.0Stargazers:4Issues:2Issues:0

turnin-webapp

Allow students to turn in their code via a web app.

Language:ScalaStargazers:4Issues:3Issues:0

akerblad

A Scala port of the LDC's Champollion sentence aligner for document-aligned parallel corpora.

cdec

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

Language:C++License:Apache-2.0Stargazers:2Issues:2Issues:0

tattletale

Python tool for monitoring status of home router, modem, and ISP connectivity. Logs and reports up time for each with email and auto-Twitter shaming built-in.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:0

colorgcc

colorgcc is a perl script to colorize gcc output. I'm collecting random patches and changes

globutils

Convert a glob to a regex in Scala

Language:ScalaStargazers:1Issues:2Issues:0

jbi

Just Build It

Language:ScalaStargazers:1Issues:2Issues:0

liberate

Liberate your NLP data from previous Acts of Senseless Markup Language

Language:PerlStargazers:1Issues:2Issues:0

mosesdecoder

Moses, the machine translation system

Language:C++Stargazers:1Issues:2Issues:0

prunejuice

An AdaGrad optimizer with the FastOSCAR regularizer

Language:C++License:NOASSERTIONStargazers:1Issues:2Issues:0

sametime

Parallelize Unix commands: stdin => (parallel copies of Unix command) => stdout in the same order

Language:ScalaStargazers:1Issues:2Issues:0

scadoop

Yet another thin Scala wrapper for Hadoop

License:LGPL-3.0Stargazers:1Issues:2Issues:0

scala-optparse

Command line option parsing for scala

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

azure-sdk-for-net

Azure Tools for VIsual Studio

Language:C#License:MITStargazers:0Issues:2Issues:0

CNTK

Microsoft Cognitive Toolkit (CNTK)

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

EmacsChocolateyPackage

source crap for emacs chocolatey package

Stargazers:0Issues:2Issues:0

failfinder

Automatically exported from code.google.com/p/failfinder

Language:HTMLStargazers:0Issues:1Issues:0

groupify

A tiny utility for partitioning a group of people into smaller groups (making small meetings/discussion easy).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

psprofile

Jon's Basic Powershell setup (and other basics of how to setup a new Windows box)

Language:PowerShellStargazers:0Issues:2Issues:0

PSReadLine

A bash inspiried readline implementation for PowerShell

Language:C#License:BSD-2-ClauseStargazers:0Issues:2Issues:0

treegraft

Automatically exported from code.google.com/p/treegraft

Language:HTMLStargazers:0Issues:1Issues:2

uglygenerics

Automatically exported from code.google.com/p/uglygenerics

Language:CStargazers:0Issues:1Issues:0

zstuff

zsplit (and eventually other such things)

Language:CStargazers:0Issues:2Issues:0