MaybeShewill-CV's starred repositories
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
what-happens-when-zh_CN
What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
work-stealing-queue
A fast work-stealing queue template in C++
Cascade-CLIP
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation