Pumpkin's starred repositories
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
llama_parse
Parse files for optimal RAG
ring-flash-attention
Ring attention implementation with flash attention
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
plugins-quickstart
Get a ChatGPT plugin up and running in under 5 minutes!
ringattention
Transformers with Arbitrarily Large Context
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.