InitialBug's repositories
MarCo-Dialog
The code of ACL 2020 paper "Multi-Domain Dialogue Acts and Response Co-Generation"
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
how-to-train-tokenizer
怎么训练一个LLM分词器
Language:Python000
python-pinyin
汉字转拼音(pypinyin)
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.