nahidalam

User data from Github https://github.com/nahidalam

followers

following

stars

nahidalam's repositories

maya

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

Language:PythonApache-2.0107 40

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.06 10

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

MIT000

apple_pie

robot foundation models

000

Awesome-VLA

000

Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.

Apache-2.0000

DeepSeek-V3

MIT000

HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

NOASSERTION000

imm

Official implementation of Inductive Moment Matching

NOASSERTION000

Isaac-GR00T

NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.

Language:Jupyter NotebookApache-2.0000

kokoro

https://hf.co/hexgrad/Kokoro-82M

Apache-2.0000

lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Apache-2.0000

llama3-kvcache-compress

Language:Python010

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION000

ml-system-design

Apache-2.0000

mllms_know

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

000

MMVP

000

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

Apache-2.0000

open-r1

Fully open reproduction of DeepSeek-R1

Apache-2.0000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.0000

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookNOASSERTION000

openpi

Apache-2.0000

S5

Language:PythonMIT000

smolagents

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Apache-2.0000

smollm

Everything about the SmolLM2 and SmolVLM family of models

Apache-2.0000

Step-Video-T2V

MIT000

tidybot2

TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning

MIT000

video-generation-survey

A reading list of video generation

000

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Language:PythonApache-2.0000

Zonos

Apache-2.0000