Dr. Yong CHENG's repositories
awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
blog
Public repo for HF blog posts
ChatAlpaca
A Multi-Turn Dialogue Corpus based on Alpaca Instructions
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily search and find personal or work documents by asking questions in everyday language.
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
determined
Determined: Deep Learning Training Platform
document.ai
基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)
evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
flash-attention
Fast and memory-efficient exact attention
hai-platform
一种以任务级分时调度GPU算力的高性能深度学习训练平台
HugNLP
HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊
langchain-ray
Examples on how to use LangChain and Ray
llama_index
LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
llm-foundry
LLM training code for MosaicML foundation models
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
mlflow
Open source platform for the machine learning lifecycle
openai-cookbook
Examples and guides for using the OpenAI API
scikit-llm
Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks.
sd-prompt-translator
Stable Diffusion extension for prompt translation
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
text-generation-inference
Large Language Model Text Generation Inference
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.