ykshi's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:163691Issues:1558Issues:2288

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45444Issues:302Issues:658

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33743Issues:340Issues:2661

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28959Issues:213Issues:526

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:17924Issues:89Issues:214

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:14189Issues:286Issues:329

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5867Issues:51Issues:139

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonLicense:MITStargazers:2474Issues:37Issues:68

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonLicense:NOASSERTIONStargazers:1403Issues:8Issues:68

pix2pix-zero

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Language:PythonLicense:MITStargazers:1025Issues:34Issues:28

ICCV-2023-Papers

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Language:PythonLicense:MITStargazers:894Issues:12Issues:10

SceneDreamer

[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Language:PythonLicense:NOASSERTIONStargazers:583Issues:34Issues:10

NaveGo

NaveGo: an open-source MATLAB/GNU Octave toolbox for processing integrated navigation systems and performing inertial sensors analysis.

Language:MATLABLicense:NOASSERTIONStargazers:565Issues:51Issues:75

MNAD

An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.

Ultralight-SimplePose

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

DCLS-SR

Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.

Language:PythonLicense:MITStargazers:217Issues:9Issues:60

CoBEVT

[CoRL2022] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers

Language:PythonLicense:Apache-2.0Stargazers:196Issues:9Issues:16

MobileR2L

[CVPR 2023] Real-Time Neural Light Field on Mobile Devices

Language:PythonLicense:NOASSERTIONStargazers:190Issues:42Issues:10

TarDAL

CVPR 2022 | Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection.

Language:PythonLicense:GPL-3.0Stargazers:177Issues:3Issues:0

awesome-diffusion-low-level-vision

Awesome Diffusion Models in Low-Level Vision

License:MITStargazers:172Issues:10Issues:0

wire

wavelet implicit neural representations

Language:PythonLicense:MITStargazers:126Issues:7Issues:24

CoAlign

[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors

Language:PythonLicense:NOASSERTIONStargazers:124Issues:6Issues:31

DOC-VTON

Official code for DOC-VTON. We provide visualization results of Awesome Virtual Tryon. Besides, we provide auxiliary data of VITON and VITON-HD for training and testing.

VehicleMAE

[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang

DDistill-SR

DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution (TMM 2023)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:2Issues:0

DnSwin

Codes for our paper "DnSwin: Toward Real-World Denoising via Continuous Wavelet Sliding-Transformer" (KBS 2022)

Language:PythonStargazers:13Issues:2Issues:0

NegVSR

NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Super-Resolution. Real-world, video super-resolution, image processing, data augmentation, video, resolution-resolution

Language:PythonStargazers:9Issues:0Issues:0

ADASR

ADASR official implementation ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data Fusion

Language:PythonStargazers:8Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0