Beast code in Giters

Bing Li's starred repositories

DiffSF

Official repository for paper: DiffSF: Diffusion Models for Scene Flow Estimation

Language:Python1400

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonMIT88900

ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:Python31800

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonMIT64600

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonMIT67400

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

69300

omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Language:PythonNOASSERTION69600

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonMIT266500

xlstm

Official repository of the xLSTM.

Language:PythonAGPL-3.0109300

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonMIT84000

"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

Language:Python18800

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

33200

MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Language:PythonNOASSERTION185600

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT110800

CorDA

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models

Language:PythonApache-2.02500

vividzoo

3000

MVEdit

[WIP] Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Language:JavaScriptMIT18700

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookMIT86300

DragAPart

[ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.

Language:Python5400

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonAGPL-3.0153200

tc4d

TC4D: Trajectory-Conditioned Text-to-4D Generation

Language:PythonApache-2.015600

Pascal-EA

MIT800

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonMIT31900

attribute-control

Fine-Grained Subject-Specific Attribute Expression Control in T2I Models

Language:Jupyter NotebookMIT10100

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024

Language:PythonApache-2.042800

LaVie

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Language:PythonApache-2.079800

OpenTAD

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Language:PythonApache-2.012400

spad

Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024

Language:Python11700

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

33500

bing-li-ai