MaybeShewill-CV

MaybeShewill-CV

Geek Repo

Company:Baidu

Location:Tong Ji University

Home Page:https://maybeshewill-cv.github.io

Github PK Tool:Github PK Tool


Organizations
baidu

MaybeShewill-CV's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:60941Issues:514Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48279Issues:530Issues:191

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:21363Issues:149Issues:3335

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:19604Issues:162Issues:136
Language:PythonLicense:Apache-2.0Stargazers:9712Issues:100Issues:285

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:7992Issues:79Issues:28

what-happens-when-zh_CN

What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when

Language:PythonLicense:Apache-2.0Stargazers:6853Issues:66Issues:63

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:5770Issues:41Issues:152

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5545Issues:38Issues:65

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5046Issues:39Issues:33

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3711Issues:83Issues:80

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3461Issues:32Issues:253

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2237Issues:39Issues:0

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1786Issues:22Issues:62

VMamba

VMamba: Visual State Space Models,code is based on mamba

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1447Issues:32Issues:206

stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language:C++License:Apache-2.0Stargazers:1270Issues:48Issues:512

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:910Issues:42Issues:24

sam-pt

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Language:PythonLicense:Apache-2.0Stargazers:905Issues:43Issues:33

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:818Issues:12Issues:16

meta

Header-only, non-intrusive and macro-free runtime reflection system in C++

Language:C++License:MITStargazers:555Issues:24Issues:6

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:516Issues:0Issues:0

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:439Issues:8Issues:10

D-LIOM

Tightly-coupled Direct LiDAR-Inertial Odometry and Mapping Based on Cartographer3D.

work-stealing-queue

A fast work-stealing queue template in C++

Language:C++License:NOASSERTIONStargazers:276Issues:10Issues:1

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookLicense:MITStargazers:134Issues:6Issues:8

TinyCLIP

[ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Language:PythonLicense:NOASSERTIONStargazers:47Issues:2Issues:0