ykshi

ykshi's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT163691 1558 2288

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045444 302 658

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.033743 340 2661

ControlNet

Let us control diffusion models!

Language:PythonApache-2.028959 213 526

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonNOASSERTION17924 89 214

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonNOASSERTION14189 286 329

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookApache-2.05867 51 139

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonMIT2474 37 68

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonNOASSERTION1403 8 68

pix2pix-zero

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Language:PythonMIT1025 34 28

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Language:PythonMIT894 12 10

SceneDreamer

[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Language:PythonNOASSERTION583 34 10

NaveGo

NaveGo: an open-source MATLAB/GNU Octave toolbox for processing integrated navigation systems and performing inertial sensors analysis.

Language:MATLABNOASSERTION565 51 75

MNAD

An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.

Language:Python329 12 58

Ultralight-SimplePose

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Language:Python274 13 10

DCLS-SR

Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.

Language:PythonMIT217 9 60

CoBEVT

[CoRL2022] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers

Language:PythonApache-2.0196 9 16

MobileR2L

[CVPR 2023] Real-Time Neural Light Field on Mobile Devices

Language:PythonNOASSERTION190 42 10

TarDAL

CVPR 2022 | Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection.

Language:PythonGPL-3.0177 30

awesome-diffusion-low-level-vision

Awesome Diffusion Models in Low-Level Vision

MIT172 100

wire

wavelet implicit neural representations

Language:PythonMIT126 7 24

CoAlign

[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors

Language:PythonNOASSERTION124 6 31

DOC-VTON

Official code for DOC-VTON. We provide visualization results of Awesome Virtual Tryon. Besides, we provide auxiliary data of VITON and VITON-HD for training and testing.

Language:Python55 5 3

VehicleMAE

[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang

Language:Python17 5 3

DDistill-SR

DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution (TMM 2023)

Language:PythonApache-2.016 20

DnSwin

Codes for our paper "DnSwin: Toward Real-World Denoising via Continuous Wavelet Sliding-Transformer" (KBS 2022)

Language:Python13 20

NegVSR

NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Super-Resolution. Real-world, video super-resolution, image processing, data augmentation, video, resolution-resolution

Language:Python900

ADASR

ADASR official implementation ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data Fusion

Language:Python800

pix2pixzero.github.io

website

Language:HTML100