nahidalam's repositories
anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
apple_pie
robot foundation models
Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
imm
Official implementation of Inductive Moment Matching
Isaac-GR00T
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
kokoro
https://hf.co/hexgrad/Kokoro-82M
lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
mllms_know
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
open-r1
Fully open reproduction of DeepSeek-R1
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
open_clip
An open source implementation of CLIP.
smolagents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
smollm
Everything about the SmolLM2 and SmolVLM family of models
tidybot2
TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning
video-generation-survey
A reading list of video generation
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks