Nur Muhammad "Mahi" Shafiullah's starred repositories
Open-Teach
A Versatile Teleoperation framework for Robotic Manipulation using Meta Quest3
epub2sphinx
epub2sphinx is a tool to convert epub files to ReST for Sphinx
vq_bet_official
Official code for "Behavior Generation with Latent Actions"
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
DepthStreamCompression
Depth Stream Compression using RVL and Temporal RVL
EmbodiedScan
[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
stretch_visual_servoing
Example code for visual servoing using Stretch 3's gripper camera
stretch_dex_teleop
Dexterous teleoperation for the Stretch mobile manipulators from Hello Robot Inc.
diff_history
[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
nlp-phd-global-equality
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
lang-segment-anything
SAM with text prompt
pyAvroPhonetic
Python implementation of Avro Phonetic
see-to-touch
Code base for See to Touch project: https://see-to-touch.github.io/