Zhan Tong (yztongzhan)

yztongzhan

Geek Repo

Company:Ant Research

Location:Shanghai, China

Home Page:https://scholar.google.com/citations?user=6FsgWBMAAAAJ

Github PK Tool:Github PK Tool

Zhan Tong's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:43390Issues:297Issues:606

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:34814Issues:1005Issues:183

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:13860Issues:159Issues:169

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CLicense:NOASSERTIONStargazers:13825Issues:174Issues:288

CoDeF

[CVPR 2024] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:4726Issues:73Issues:76

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3089Issues:43Issues:49

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:2168Issues:31Issues:121

DiffusionDet

[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:1958Issues:17Issues:103

MAT

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

Language:PythonLicense:NOASSERTIONStargazers:659Issues:10Issues:107

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonLicense:MITStargazers:370Issues:6Issues:43

hmr-survey

[TPAMI 2023] Recovering 3D Human Mesh from Monocular Images: A Survey

AdaptFormer

[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"

Language:PythonLicense:MITStargazers:285Issues:7Issues:35

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:252Issues:9Issues:62

Occupancy-MAE

Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

Language:PythonLicense:Apache-2.0Stargazers:231Issues:7Issues:30

UM-MAE

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:229Issues:5Issues:22

EDT

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

SportsMOT

[ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes

AVION

Code release for "Training a Large Video Model on a Single Machine in a Day"

Language:PythonLicense:MITStargazers:88Issues:1Issues:4

TeSTra

Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"

Language:PythonLicense:Apache-2.0Stargazers:87Issues:2Issues:9

MetaBEV

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation

sparseformer

the official SparseFormer repo

Language:PythonLicense:MITStargazers:55Issues:9Issues:2

DDM

[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Language:PythonLicense:MITStargazers:46Issues:2Issues:10

STMixer

[CVPR 2023] STMixer: A One-Stage Sparse Action Detector

MSPN

Multi-Stage Pose Network

VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Language:PythonLicense:NOASSERTIONStargazers:37Issues:2Issues:5

EVAD

[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement

Language:PythonLicense:NOASSERTIONStargazers:17Issues:2Issues:3

SNCLR

[ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning

Language:PythonStargazers:10Issues:0Issues:0

ZeroI2V

Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video"

Language:PythonLicense:Apache-2.0Stargazers:10Issues:4Issues:4

chatgpt_mini_helper

My customized GPT 3.5 helper

Language:PythonStargazers:7Issues:2Issues:0

VideoMAE

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0