MaybeShewill-CV's starred repositories
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
what-happens-when-zh_CN
What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
work-stealing-queue
A fast work-stealing queue template in C++