RWL's repositories
carla
Open-source simulator for autonomous driving research.
chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
diffseg
DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements the main DiffSeg algorithm and additionally includes an experimental feature to add semantic labels to the masks based on a generated caption.
DreamMat
[SIGGRAPH2024] DreamMat: High-quality PBR Material Generation with Geometry- and Light-aware Diffusion Models
dspy
DSPy: The framework for programming with foundation models
frida
Clone this repo to build Frida
GPS-Gaussian
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
JaxMARL
Multi-Agent Reinforcement Learning with JAX
LoG
Level of Gaussians
long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models."
MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Marigold
Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Octopus
🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
OpenVoice
Instant voice cloning by MyShell.
precisenumbers
For when you want your numbers to be precise
sdk-examples
Spectacular AI SDK examples
segmenteverygrain
A SAM-based model for instance segmentation of images of grains
streaming-llm
Efficient Streaming Language Models with Attention Sinks
vec2text
utilities for converting deep representations (like sentence embeddings) back to text
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
volatility3
Volatility 3.0 development
Wonder3D
A cross-domain diffusion model for 3D reconstruction from a single image
yolo_tracking
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models