mbanon / benchmarks

Several benchmarks on sentence splitting and language identification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

benchmarks

Language identification tests

See /langid/README.md

Sentence splitting tests

See /sentence_splitting/README.md

About

Several benchmarks on sentence splitting and language identification

License:Creative Commons Attribution 4.0 International


Languages

Language:Mathematica 17.9%Language:Roff 13.3%Language:Bikeshed 13.3%Language:Smalltalk 11.9%Language:Emacs Lisp 11.9%Language:Makefile 11.9%Language:Slash 11.9%Language:JavaScript 7.7%Language:Python 0.3%Language:Shell 0.1%