MaybeShewill-CV

followers

following

stars

Baidu

Tong Ji University

https://maybeshewill-cv.github.io

Organizations

baidu

MaybeShewill-CV's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031088 197 4830

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION26196 217 237

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.015773 104 812

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT13115 93 16

LivePortrait

Bring portraits to life!

Language:PythonNOASSERTION11821 108 335

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.010870 64 244

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Language:PythonApache-2.09809 40 75

duix.ai

Language:C++NOASSERTION4460 214 41

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Language:Python3717 5 11

automq

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.

Language:JavaNOASSERTION3699 35 460

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.03330 30 138

OpenGlass

Turn any glasses into AI-powered smart glasses

Language:CMIT3258 54 38

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonApache-2.02152 21 137

c-style

My favorite C programming practices.

NOASSERTION1971 48 7

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.01801 26 118

FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

Language:PythonNOASSERTION144100

FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Language:PythonNOASSERTION1352 29 218

nano-llama31

nanoGPT style version of Llama 3.1

Language:Python1173 21 5

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonMIT1029 47 40

accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Language:Jupyter NotebookApache-2.0899 20 55

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonApache-2.0595 18 7

meta

Header-only, non-intrusive and macro-free runtime reflection system in C++

Language:C++MIT579 24 6

cppguidebook

小彭老师领衔编写，现代C++的中文百科全书

Language:TypstNOASSERTION569 47 19

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonNOASSERTION435 9 14

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonApache-2.0362 21 27

asio-grpc

Asynchronous gRPC with Asio/unified executors

Language:C++Apache-2.0359 11 86

Tianji

从零学习，制作懂人情世故的大语言模型

Language:PythonApache-2.0352 5 3

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookMIT282 8 21

OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Language:PythonMIT230 4 19

Cascade-CLIP

Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Language:PythonMIT32 2 3