spectre's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
llama_index
LlamaIndex is a data framework for your LLM applications
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
pretrain-gnns
Strategies for Pre-training Graph Neural Networks
Awesome-RSPapers
Recommender System Papers
Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
recommendation_model
练习下用pytorch来复现下经典的推荐系统模型, 如MF, FM, DeepConn, MMOE, PLE, DeepFM, NFM, DCN, AFM, AutoInt, ONN, FiBiNET, DCN-v2, AFN, DCAP等
Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。
Recommender-System-Pytorch
基于 Pytorch 实现推荐系统相关的算法
awesome-all-you-need-papers
A list of all "all you need" papers. Updated daily using the arXiv API.
copy_paste_aug_detectron2
Copy-paste augmentation in detectron2 pipeline