There are 8 repositories under embodied-ai topic.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
A library for differentiable nonlinear optimization
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
Unified Reinforcement Learning Framework
SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark
[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
A universal summary of current robotics simulators
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"
We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.
《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞
[ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"
📣 [IEEE IROS 2023] Official Repository of IROS 23 paper "Uncertainty-Aware Lidar Place Recognition in Novel Environments"
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
Official GitHub Repository for paper "Visual Graph Memory with Unsupervised Representation for Visual Navigation", ICCV 2021
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"