Guodong Xu (memoiry)

memoiry

Geek Repo

Company:Apple <- Zhejiang University, CAD&CG

Location:Beijing

Home Page:person.zjulearning.org.cn/guodongxu/

Github PK Tool:Github PK Tool


Organizations
ZJULearning

Guodong Xu's starred repositories

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:822Issues:0Issues:0

fast-DiT

Fast Diffusion Models with Transformers

Language:PythonLicense:NOASSERTIONStargazers:569Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5293Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8360Issues:0Issues:0

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:599Issues:0Issues:0

UniHOI

Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models"

Stargazers:22Issues:0Issues:0

PGDiff

[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

Language:PythonLicense:NOASSERTIONStargazers:120Issues:0Issues:0

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:1883Issues:0Issues:0

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:4734Issues:0Issues:0

daclip-uir

[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.

Language:PythonLicense:MITStargazers:559Issues:0Issues:0

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Stargazers:693Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35258Issues:0Issues:0

sd-dino

Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"

Language:Jupyter NotebookStargazers:216Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7752Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44629Issues:0Issues:0

UniDetector

Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".

Language:PythonLicense:Apache-2.0Stargazers:494Issues:0Issues:0

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonLicense:MITStargazers:8499Issues:0Issues:0

XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Language:PythonLicense:MITStargazers:1617Issues:0Issues:0

DenseTeacher

DenseTeacher: Dense Pseudo-Label for Semi-supervised Object Detection

Language:PythonLicense:Apache-2.0Stargazers:116Issues:0Issues:0

memray

Memray is a memory profiler for Python

Language:PythonLicense:Apache-2.0Stargazers:12652Issues:0Issues:0

MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Language:PythonLicense:Apache-2.0Stargazers:3620Issues:0Issues:0

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

License:UnlicenseStargazers:29268Issues:0Issues:0

Relative_Human

Relative Human dataset, CVPR 2022

Language:PythonStargazers:135Issues:0Issues:0

pi-consistency-activity-detection

End-to-End Semi-Supervised Learning for Video Action Detection [CVPR 2022]

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

MonoDTR

MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)

Language:PythonLicense:MITStargazers:123Issues:0Issues:0

MixFormer

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Language:PythonLicense:MITStargazers:428Issues:0Issues:0

SparseInst

[CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation

Language:PythonLicense:MITStargazers:563Issues:0Issues:0

bigdetection

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Language:PythonLicense:Apache-2.0Stargazers:381Issues:0Issues:0

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:2017Issues:0Issues:0

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

Language:TeXStargazers:3720Issues:0Issues:0