Young's starred repositories
albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism etc. 🎉🎉
computer-science
🎓 Path to a free self-taught education in Computer Science!
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
promptbase
All things prompt engineering
flash-attention
Fast and memory-efficient exact attention
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
ChatGPT-AutoExpert
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
text-generation-inference
Large Language Model Text Generation Inference
text-generation-webui
A Gradio web UI for Large Language Models.