Márton Miháltz's repositories
word2vec-GoogleNews-vectors
word2vec Google News model
trendminer-hunlp
Hungarian NLP pipeline for social media text analysis (TrendMiner project)
hunlp-pipeline
Hungarian NLP pipelne for tokenization, pos-tagging and stemming using open source tools
trendminer-hutools
Various tools used by TrendMiner/hu (Facebook data download, Java NooJ import/export format conversion)
cnn-text-classification-tf
Convolutional Neural Network for Text Classification in Tensorflow with word2vec embeddings (switch to branch dev-mmihaltz!)
biralat
Bírálat sablon a BME-s dolgozatokhoz
dedupe
:id: A python library for accurate and scaleable fuzzy matching, record deduplication and entity-resolution.
huwn-util
Miscellaneous utilities for Hungarian WordNet data files
keep-a-changelog
If you build software, keep a changelog.
libWNXML
C++ API for querying Hungarian WordNet XML files
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
opener-setup
Script to download and install OpeNER and all dependencies in one go (Ubuntu)
pytimeout
Python module to enable timeout on python code