Sean Miller's starred repositories
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
crfsuite-rs
Rust binding to crfsuite
indic_nlp_library
Resources and tools for Indian language Natural Language Processing
hashtag_master
HashtagMaster: Segmentation tool for hashtags
char-rnn-text-generation
Character Embeddings Recurrent Neural Network Text Generation Models
nyt-first-said
Tweets when words are published for the first time in the NYT
handwritten-text-recognition-for-apache-mxnet
This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.