vincentliuheyang's starred repositories

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:4855Issues:0Issues:0

openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

Language:PythonLicense:MITStargazers:379Issues:0Issues:0

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:7957Issues:0Issues:0

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14382Issues:0Issues:0

latent-nerf

Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"

Language:PythonLicense:MITStargazers:686Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4424Issues:0Issues:0

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:5869Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6330Issues:0Issues:0

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Language:PythonLicense:NOASSERTIONStargazers:1125Issues:0Issues:0

Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language:PythonLicense:GPL-3.0Stargazers:188Issues:0Issues:0

tpdm

Official code for "Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models" (TPDM)

Language:PythonStargazers:36Issues:0Issues:0

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2022Issues:0Issues:0

text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language:PythonLicense:MITStargazers:983Issues:0Issues:0

Speech2Lip

[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

Language:PythonStargazers:55Issues:0Issues:0

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1630Issues:0Issues:0

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonLicense:Apache-2.0Stargazers:1347Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonLicense:MITStargazers:743Issues:0Issues:0

Versatile-Diffusion

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Language:PythonLicense:MITStargazers:1301Issues:0Issues:0

PointGPT

[NeurIPS 2023] PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Language:PythonLicense:MITStargazers:172Issues:0Issues:0
License:MITStargazers:20Issues:0Issues:0

EAMM

Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'

Language:PythonLicense:MITStargazers:179Issues:0Issues:0

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2660Issues:0Issues:0

ivid

PyTorch implementation of the ICCV paper "3D-aware Image Generation using 2D Diffusion Models"

Language:PythonLicense:MITStargazers:294Issues:0Issues:0

embedchain

Memory for AI agents

Language:PythonLicense:Apache-2.0Stargazers:8866Issues:0Issues:0
Language:C++License:MITStargazers:376Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:25377Issues:0Issues:0

ControlNet-for-Diffusers

Transfer the ControlNet with any basemodel in diffusers🔥

Language:PythonLicense:MITStargazers:774Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8033Issues:0Issues:0

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonLicense:Apache-2.0Stargazers:1960Issues:0Issues:0

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonLicense:Apache-2.0Stargazers:4985Issues:0Issues:0