ljdang's starred repositories

opencv

Open Source Computer Vision Library

Language:C++License:Apache-2.0Stargazers:78293Issues:2656Issues:10759

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:24952Issues:323Issues:394

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19614Issues:301Issues:1356

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Language:CudaLicense:NOASSERTIONStargazers:15862Issues:204Issues:1017

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonLicense:MITStargazers:11327Issues:104Issues:81

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:11078Issues:120Issues:210

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7781Issues:118Issues:288

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7755Issues:97Issues:1580

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:7198Issues:56Issues:191

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6458Issues:112Issues:294

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:6242Issues:67Issues:247

LeetCode-Py

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Language:PythonLicense:NOASSERTIONStargazers:5775Issues:39Issues:16

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5724Issues:32Issues:130

ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Language:PythonLicense:MITStargazers:4683Issues:44Issues:366

deit

Official DeiT repository

Language:PythonLicense:Apache-2.0Stargazers:4018Issues:48Issues:197

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonLicense:Apache-2.0Stargazers:3266Issues:69Issues:263

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Language:PythonLicense:NOASSERTIONStargazers:2106Issues:50Issues:1373

torchrec

Pytorch domain library for recommendation systems

Language:PythonLicense:BSD-3-ClauseStargazers:1886Issues:32Issues:170

EasyCV

An all-in-one toolkit for computer vision

Language:PythonLicense:Apache-2.0Stargazers:1773Issues:31Issues:75

EasyRec

A framework for large scale recommendation algorithms.

Language:PythonLicense:Apache-2.0Stargazers:1736Issues:50Issues:117

ClassyVision

An end-to-end PyTorch framework for image and video classification

Language:PythonLicense:MITStargazers:1589Issues:70Issues:77

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Language:PythonLicense:NOASSERTIONStargazers:1020Issues:16Issues:74

EfficientFormer

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Language:PythonLicense:NOASSERTIONStargazers:978Issues:37Issues:58

MIRNetv2

[TPAMI 2022] Learning Enriched Features for Fast Image Restoration and Enhancement. Results on Defocus Deblurring, Denoising, Super-resolution, and image enhancement

Language:PythonLicense:NOASSERTIONStargazers:409Issues:5Issues:21

mvit

Code Release for MViTv2 on Image Recognition.

Language:PythonLicense:Apache-2.0Stargazers:389Issues:13Issues:20
Language:PythonLicense:Apache-2.0Stargazers:306Issues:17Issues:23

P3M

[ACM MM 2021] Privacy-Preserving Portrait Matting

Language:PythonLicense:MITStargazers:289Issues:20Issues:13

IFRNet

IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation (CVPR 2022)

Language:PythonLicense:MITStargazers:266Issues:10Issues:40