Dod-o's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
wukong-robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
torchscale
Foundation Architecture for (M)LLMs
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
python-docx-template
Use a docx as a jinja2 template
Awesome-Code-LLM
👨💻 An awesome and curated list of best code-LLM for research.
text-dedup
All-in-one text de-duplication
mathpix-markdown-it
Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
mobile_monitor_android_simple
一款轻量级的监听Android手机短信电话通知->推送到微信、企业微信、自定义接口、邮箱、Bark的软件
arxiv-tools
Tools to bulk download arxiv data
bigcode-analysis
Repository for analysis and experiments in the BigCode project.
chatgpt-gzh
一个基于公众号的chatpgt项目
CyrillicHandwritingPOC
Repository for contributions for Data Generation for Post-OCR correction of Cyrillic handwriting paper
arxiv-compiler
Service to compile LaTeX source packages into PDF, PostScript, and other formats