Bing Li (bing-li-ai)

bing-li-ai

Geek Repo

Github PK Tool:Github PK Tool

Bing Li's starred repositories

DiffSF

Official repository for paper: DiffSF: Diffusion Models for Scene Flow Estimation

Language:PythonStargazers:14Issues:0Issues:0

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonLicense:MITStargazers:889Issues:0Issues:0

ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonStargazers:318Issues:0Issues:0
License:Apache-2.0Stargazers:9Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:646Issues:0Issues:0

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonLicense:MITStargazers:674Issues:0Issues:0

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

Stargazers:693Issues:0Issues:0

omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Language:PythonLicense:NOASSERTIONStargazers:696Issues:0Issues:0

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonLicense:MITStargazers:2665Issues:0Issues:0

xlstm

Official repository of the xLSTM.

Language:PythonLicense:AGPL-3.0Stargazers:1093Issues:0Issues:0

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonLicense:MITStargazers:840Issues:0Issues:0

Diffusion4D

"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

Language:PythonStargazers:188Issues:0Issues:0

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Stargazers:332Issues:0Issues:0

MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Language:PythonLicense:NOASSERTIONStargazers:1856Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1108Issues:0Issues:0

CorDA

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0
Stargazers:30Issues:0Issues:0

MVEdit

[WIP] Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Language:JavaScriptLicense:MITStargazers:187Issues:0Issues:0

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookLicense:MITStargazers:863Issues:0Issues:0

DragAPart

[ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.

Language:PythonStargazers:54Issues:0Issues:0

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1532Issues:0Issues:0

tc4d

TC4D: Trajectory-Conditioned Text-to-4D Generation

Language:PythonLicense:Apache-2.0Stargazers:156Issues:0Issues:0
License:MITStargazers:8Issues:0Issues:0

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:319Issues:0Issues:0

attribute-control

Fine-Grained Subject-Specific Attribute Expression Control in T2I Models

Language:Jupyter NotebookLicense:MITStargazers:101Issues:0Issues:0

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024

Language:PythonLicense:Apache-2.0Stargazers:428Issues:0Issues:0

LaVie

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:798Issues:0Issues:0

OpenTAD

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:124Issues:0Issues:0

spad

Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024

Language:PythonStargazers:117Issues:0Issues:0

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Stargazers:335Issues:0Issues:0