Yao Zhou's starred repositories
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
generative-models
Generative Models by Stability AI
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
SearchEngine
搜索引擎原理
multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
text-dedup
All-in-one text de-duplication
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vattention
Dynamic Memory Management for Serving LLMs without PagedAttention
Full-Segment-Anything
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size
the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
GeoReasoner
GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode