Hirokazu Kiyomaru's repositories
pu-learning
A collection of notebooks that implement algorithms introduced in "Learning from positive and unlabeled data: a survey"
tf2-seq2seq
Scripts to train a seq2seq model using tensorflow 2
diversity-aware-event-prediction
Diversity-aware Event Prediction based on a Conditional Variational Autoencoder with Reconstruction (COIN2019)
acl-trend-visualizer
A tool to count the number of ACL papers including specific words.
neural-pepper
Control Pepper and do the neural captioning
BERT-related-papers
BERT-related papers
bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
hirokazukiyomaru.com
my web page
HojiChar
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
react-wordle
A fun Wordle clone made using React, Typescript, and Tailwind
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl
Train transformer language models with reinforcement learning.