dudds4's starred repositories
hebrew_tokenizer
A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word expression extraction.
fast_align
Simple, fast unsupervised word aligner
TED-Multilingual-Parallel-Corpus
TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted from TED talks www.ted.com for 109 world languages.
FacebookPostsScraper
Scraper for posts in Facebook user profiles, pages and groups
Facebook_Marketplace_Monitor
Scrape FB marketplace for local items matching search criteria
QL_Dice10000
Using q-learning to solve the Dice 10,000 game
socket.io-poco
cross platform c++ socket.io client written using poco libraries