Masatoshi Suzuki's repositories
WikiEntVec
Distributed representations of words and named entities trained on Wikipedia.
wikipedia-utils
Utility scripts for preprocessing Wikipedia texts for NLP
japanese-bert
BERT models with tokenization for Japanese texts.
aio2-soseki-baseline
Baseline QA system for AIO2 competition utilizing BPR (binary passage retrieval)
aio2-tfidf-baseline
TFIDF-based QA system for AIO2 competition
closed-book-qa
Quizbowl as a testbed of entity knowledge
dotfiles
dotfiles for Vim, Zsh, and tmux
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
luke
LUKE -- Language Understanding with Knowledge-based Embeddings
singletongue.github.io
Personal pages.
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia