Alan's repositories
13_languages_detection_XLM-R
you can download the finetuned model for language detection
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Background-Matting
Background Matting: The World is Your Green Screen
douban-comments-similarity
豆瓣影评数据集 word2vec+LSH相似评论分析
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
FastBERT
对ACL2020 FastBERT论文的复现,论文地址:https://arxiv.org/pdf/2004.02178.pdf
fastText
Library for fast text representation and classification.
Firefly
Firefly: 大模型训练工具,支持训练MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
geektime-ai-course
Jupyter Notebooks for Geektime AI Course
Google-Mirrors
Google谷歌、Wikipedia维基百科、谷歌学术镜像2023最新 新增各种镜像站
llama
Inference code for LLaMA models
ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
nlp_demo
bilibili-nlp
openai-cookbook
Examples and guides for using the OpenAI API
pytorch-tutorial
PyTorch深度学习快速入门教程(绝对通俗易懂!)
seq2seq_translation
seq2seq_translation
SoftMaskedBert-PyTorch
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.
Text_Corrector
英文和中文文本纠错
train_custom_LLM
Train your custom LLMs like Llama, baichuan-7b, GPT
UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo