笔移云误's starred repositories
osu-dreamer
a diffusion-based ML model for generating osu! maps from raw audio
HarmonyDream
Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
rrt-algorithms
n-dimensional RRT, RRT* (RRT-Star)
BEVFormer_tensorrt
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
BreezeShane.github.io
My own private blog.
large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
AnimateLCM
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
weighted-likelihood-filter
Code for the paper "Outlier-robust Kalman Filtering through Generalised Bayes" presented at ICML 2024
GaussianSplats3D
Three.js-based implementation of 3D Gaussian splatting