Kecheng Zheng (zkcys001)

zkcys001

Geek Repo

Company:Ant Research

Location:Hangzhou

Home Page:https://zkcys001.github.io/

Github PK Tool:Github PK Tool

Kecheng Zheng's starred repositories

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20392Issues:177Issues:375

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14131Issues:116Issues:373

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9151Issues:75Issues:448

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:4790Issues:73Issues:79

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3814Issues:112Issues:70

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonLicense:Apache-2.0Stargazers:3808Issues:43Issues:210

interfacegan

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Language:PythonLicense:MITStargazers:1480Issues:42Issues:104

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1453Issues:28Issues:81

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

SpaTracker

[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space

Language:PythonLicense:NOASSERTIONStargazers:572Issues:61Issues:24

Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Awesome-Open-Vocabulary-Semantic-Segmentation

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

prompt-pretraining

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

Language:PythonLicense:Apache-2.0Stargazers:249Issues:5Issues:13

LaCLIP

[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"

Language:PythonLicense:BSD-2-ClauseStargazers:239Issues:8Issues:8

CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Language:PythonLicense:NOASSERTIONStargazers:149Issues:6Issues:24

PLOT

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Language:PythonLicense:MITStargazers:125Issues:3Issues:9

ddae

[ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"

RLIPv2

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training

Language:PythonLicense:Apache-2.0Stargazers:100Issues:2Issues:17

ALIP

[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

Language:PythonLicense:MITStargazers:87Issues:3Issues:1

SynthCLIP

Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.

Aurora

Official implementation of Aurora

Language:PythonLicense:NOASSERTIONStargazers:80Issues:7Issues:1

Ant-Multi-Modal-Framework

Research Code for Multimodal-Cognition Team in Ant Group

Language:PythonLicense:CC-BY-4.0Stargazers:60Issues:3Issues:13

DreamLIP

[ECCV 2024] Offical Pytorch implementation of DreamLIP: Language-Image Pre-training with Long Captions

AGAP

Learning Naturally Aggregated Appearance for Efficient 3D Editing

Language:PythonLicense:GPL-3.0Stargazers:32Issues:2Issues:1

TagAlign

Official implementation of TagAlign

Language:PythonStargazers:13Issues:1Issues:0

CoReS

code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"