Masahiro Suzuki's repositories
language-pretraining
Pre-training Language Models for Japanese
jptranstokenizer
Japanese Tokenizer for transformers library
aclanthology-translate
Translate abstracts on ACL anthology into Japanese
Language:PythonMIT000
Language:Python000
MIT000
mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
Language:Python000
llm-japanese-dataset
LLM構築用の日本語チャットデータセット
000
000
Language:JavaScript000
retarfilib
Frequently used functions
Language:Python000
Language:PythonMIT000
winjumantokenizer
Juman Tokenizer for Windows which has compability with jptranstokenizer
Language:PythonMIT000