MaybeShewill-CV

MaybeShewill-CV

Geek Repo

Company:Baidu

Location:Tong Ji University

Home Page:https://maybeshewill-cv.github.io

Github PK Tool:Github PK Tool


Organizations
baidu

MaybeShewill-CV's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31088Issues:197Issues:4830

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26196Issues:217Issues:237

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:15773Issues:104Issues:812

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13115Issues:93Issues:16

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:11821Issues:108Issues:335

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10870Issues:64Issues:244

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Language:PythonLicense:Apache-2.0Stargazers:9809Issues:40Issues:75
Language:C++License:NOASSERTIONStargazers:4460Issues:214Issues:41

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

automq

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.

Language:JavaLicense:NOASSERTIONStargazers:3699Issues:35Issues:460

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:3330Issues:30Issues:138

OpenGlass

Turn any glasses into AI-powered smart glasses

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonLicense:Apache-2.0Stargazers:2152Issues:21Issues:137

c-style

My favorite C programming practices.

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1801Issues:26Issues:118

FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

Language:PythonLicense:NOASSERTIONStargazers:1441Issues:0Issues:0

FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Language:PythonLicense:NOASSERTIONStargazers:1352Issues:29Issues:218

nano-llama31

nanoGPT style version of Llama 3.1

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:1029Issues:47Issues:40

accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:899Issues:20Issues:55

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:595Issues:18Issues:7

meta

Header-only, non-intrusive and macro-free runtime reflection system in C++

Language:C++License:MITStargazers:579Issues:24Issues:6

cppguidebook

小彭老师领衔编写,现代C++的中文百科全书

Language:TypstLicense:NOASSERTIONStargazers:569Issues:47Issues:19

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:435Issues:9Issues:14

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonLicense:Apache-2.0Stargazers:362Issues:21Issues:27

asio-grpc

Asynchronous gRPC with Asio/unified executors

Language:C++License:Apache-2.0Stargazers:359Issues:11Issues:86

Tianji

从零学习,制作懂人情世故的大语言模型

Language:PythonLicense:Apache-2.0Stargazers:352Issues:5Issues:3

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookLicense:MITStargazers:282Issues:8Issues:21

OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Language:PythonLicense:MITStargazers:230Issues:4Issues:19

Cascade-CLIP

Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Language:PythonLicense:MITStargazers:32Issues:2Issues:3