ThanhPham1987

Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations", ICLR 2024

MIT000

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Apache-2.0000

grokfast-pytorch

Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"

MIT000

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

MIT000

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Apache-2.0000

MultiPly

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild (CVPR2024 Oral)

000

MV-VTON

MV-VTON: Multi-View Virtual Try-On with Diffusion Models

000

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

000

Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.

000

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

000

SMILE-Dataset

[NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"

000

top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

CC0-1.0000

typer

Typer, build great CLIs. Easy to code. Based on Python type hints.

MIT000

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

000

ThanhPham1987

PeterPham's repositories

applied-llm

Awesome-Diffusion-Models

awesome-talking-head-generation

DiffSynth-Studio

garfield

HSIConvKAN

LLM101n

LongVA

MimicBrush

practice-auto-label

textgrad

transformers.js

videollm-online

BentoBLIP

CosmicMan

DDMI

Depth-Anything-V2

finetune-your-clone

grokfast-pytorch

hallo

MagicTime

MultiPly

MV-VTON

OpenCLAY

OpenYOLO3D

RAG-Survey

SMILE-Dataset

top-cvpr-2024-papers

typer

VGen