Stefano Ferraro's starred repositories

genrl

[GenRL] Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

3d-gaussian-splatting

Implementation for 3d gaussian splatting

Language:PythonLicense:MITStargazers:313Issues:0Issues:0

RVT

Official Code for RVT-2 and RVT

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:246Issues:0Issues:0

catalyst-rl-tutorial

Using Catalyst.RL to train a robot to perform peg-in-hole insertion in simulation.

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

choreographer

[ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to efficiently coordinate and adapt the skills to solve downstream tasks.

Language:PythonLicense:MITStargazers:31Issues:0Issues:0