sirius541's starred repositories

TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language:PythonLicense:MITStargazers:3212Issues:0Issues:0

florence2-finetuning

Quick exploration into fine tuning florence 2

Language:Jupyter NotebookLicense:MITStargazers:244Issues:0Issues:0

baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Language:PythonLicense:MITStargazers:2426Issues:0Issues:0

MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Language:PythonStargazers:324Issues:0Issues:0

DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Language:PythonLicense:MITStargazers:271Issues:0Issues:0

pytorch-cifar100

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)

Language:PythonStargazers:4171Issues:0Issues:0

Transformer

Transformer seq2seq model, program that can build a language translator from parallel corpus

Language:PythonLicense:Apache-2.0Stargazers:1332Issues:0Issues:0

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3271Issues:0Issues:0

lscm

Least squares conformal mapping implemented in C++

Language:C++License:MITStargazers:107Issues:0Issues:0

PoissonRecon

Poisson Surface Reconstruction

Language:C++License:MITStargazers:1537Issues:0Issues:0

DocTrPP

DocTr++ in PaddlePaddle

Language:PythonStargazers:36Issues:0Issues:0

doc3D-dataset

A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)

Language:ShellLicense:MITStargazers:159Issues:0Issues:0

doc3D-renderer

Blender rendering codes for doc3D-dataset (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)

Language:PythonStargazers:110Issues:0Issues:0

sd-face-editor

Face Editor for Stable Diffusion

Language:PythonLicense:MITStargazers:1014Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4252Issues:0Issues:0

InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10795Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14660Issues:0Issues:0

Document-Dewarping-with-Control-Points

Document Dewarping with Control Points

Language:PythonLicense:MITStargazers:150Issues:0Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5696Issues:0Issues:0

PaperEdge

The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36344Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4191Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6755Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11573Issues:0Issues:0

Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1698Issues:0Issues:0

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:11543Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9431Issues:0Issues:0

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR

Language:PythonLicense:MITStargazers:2867Issues:0Issues:0

PromptSR

PyTorch code for our paper "Image Super-Resolution with Text Prompt Diffusion"

Stargazers:101Issues:0Issues:0

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3907Issues:0Issues:0