Adam's repositories
threestudio
A unified framework for 3D content generation.
4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
generative-models
Generative Models by Stability AI
gorilla
Gorilla: An API store for LLMs
gsplat
CUDA accelerated rasterization of gaussian splatting
Gym-LLaVA
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Gym-Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
insightface
State-of-the-art 2D and 3D Face Analysis Project
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
MVDream
Multi-view Diffusion for 3D Generation
PHALP
Code repository for the paper "Tracking People by Predicting 3D Appearance, Location & Pose". (CVPR 2022 Oral)
pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
Spot-MuJoCo-ROS2
Simulation environment with SpotMini model in MuJoCo based on ROS2
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
V3D
V3D: Video Diffusion Models are Effective 3D Generators
Video-LLaMA
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
vivid123
[CVPR 2024] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models
zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)