MaybeShewill-CV

MaybeShewill-CV

Geek Repo

Company:Baidu

Location:Tong Ji University

Home Page:https://maybeshewill-cv.github.io

Github PK Tool:Github PK Tool


Organizations
baidu

MaybeShewill-CV's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48831Issues:545Issues:195

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23284Issues:159Issues:3633

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21269Issues:168Issues:158

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:10704Issues:74Issues:423

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:9296Issues:59Issues:8

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8424Issues:79Issues:31

what-happens-when-zh_CN

What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when

Language:PythonLicense:Apache-2.0Stargazers:6911Issues:67Issues:64

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:5959Issues:44Issues:163

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5602Issues:38Issues:68

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5077Issues:39Issues:34

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3757Issues:86Issues:86

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3625Issues:34Issues:312

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2329Issues:41Issues:0

c-style

My favorite C programming practices.

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

VMamba

VMamba: Visual State Space Models,code is based on mamba

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1504Issues:32Issues:216

FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Language:PythonLicense:NOASSERTIONStargazers:997Issues:34Issues:127

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:931Issues:42Issues:26

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:834Issues:12Issues:16

accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:633Issues:21Issues:16

meta

Header-only, non-intrusive and macro-free runtime reflection system in C++

Language:C++License:MITStargazers:560Issues:24Issues:6

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:538Issues:16Issues:5

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:455Issues:8Issues:11

work-stealing-queue

A fast work-stealing queue template in C++

Language:C++License:NOASSERTIONStargazers:279Issues:10Issues:1

D-LIOM

Tightly-coupled Direct LiDAR-Inertial Odometry and Mapping Based on Cartographer3D.

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookLicense:MITStargazers:167Issues:6Issues:10

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonLicense:Apache-2.0Stargazers:155Issues:22Issues:6

TinyCLIP

[ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Language:PythonLicense:NOASSERTIONStargazers:50Issues:4Issues:2