vincentliuheyang

vincentliuheyang's starred repositories

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonNOASSERTION485500

openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。

Language:PythonMIT37900

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonApache-2.0795700

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonMIT1438200

latent-nerf

Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"

Language:PythonMIT68600

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause442400

threestudio

A unified framework for 3D content generation.

Language:PythonApache-2.0586900

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT633000

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonNOASSERTION112500

Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language:PythonGPL-3.018800

tpdm

Official code for "Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models" (TPDM)

Language:Python3600

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonNOASSERTION202200

text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language:PythonMIT98300

Speech2Lip

[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

Language:Python5500

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookNOASSERTION163000

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonApache-2.0134700

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonMIT74300

Versatile-Diffusion

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Language:PythonMIT130100

PointGPT

[NeurIPS 2023] PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Language:PythonMIT17200

Animal3D

MIT2000

EAMM

Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'

Language:PythonMIT17900

galai

Model API for GALACTICA

Language:Jupyter NotebookApache-2.0266000

ivid

PyTorch implementation of the ICCV paper "3D-aware Image Generation using 2D Diffusion Models"

Language:PythonMIT29400

embedchain

Memory for AI agents

Language:PythonApache-2.0886600

SDFusion

Language:C++MIT37600

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonAGPL-3.02537700

ControlNet-for-Diffusers

Transfer the ControlNet with any basemodel in diffusers🔥

Language:PythonMIT77400

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION803300

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonApache-2.0196000

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonApache-2.0498500