Xiong Lin (bruinxiong)

bruinxiong

Geek Repo

Company:SenseTime, Xi'an, China

Location:China

Home Page:https://bruinxiong.github.io/xionglin.github.io/

Github PK Tool:Github PK Tool

Xiong Lin's repositories

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-3d-diffusion

A collection of papers on diffusion models for 3D generation.

License:MITStargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:MITStargazers:0Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

License:Apache-2.0Stargazers:0Issues:0Issues:0

CuMo

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

License:Apache-2.0Stargazers:0Issues:0Issues:0

FinePOSE_CVPR2024

FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models

License:MITStargazers:0Issues:0Issues:0

FunClip

Open-source, accurate and easy-to-use video clipping tool | 开源、精准、方便的视频切片工具

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HPT

HPT - Open Multimodal LLMs from HyperGAI

License:Apache-2.0Stargazers:0Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

License:NOASSERTIONStargazers:0Issues:0Issues:0

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

License:Apache-2.0Stargazers:0Issues:0Issues:0

lerobot

🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Stargazers:0Issues:0Issues:0

OpenLRM

An open-source impl. of Large Reconstruction Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

License:AGPL-3.0Stargazers:0Issues:0Issues:0

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

License:NOASSERTIONStargazers:0Issues:0Issues:0

Rip-NeRF

Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids

Language:PythonStargazers:0Issues:0Issues:0

RoHM

The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.

License:NOASSERTIONStargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

spad

Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024

Stargazers:0Issues:0Issues:0

StoryDiffusion

Create Magic Story!

License:Apache-2.0Stargazers:0Issues:0Issues:0

SwissArmyTransformer

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

License:NOASSERTIONStargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

License:NOASSERTIONStargazers:0Issues:0Issues:0

VideoMV

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

License:MITStargazers:0Issues:0Issues:0