Jian's repositories
act-plus-plus
Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN
Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
barkour_robot
Barkour Robot: Agile Quadruped Robots by Google DeepMind
BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
EmbodiedAIxLLMPapers
Papers on integrating large language models with embodied AI
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
gpt4all
gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue
llm_multiagent_debate
Code for Improving Factuality and Reasoning in Language Models through Multiagent Debate
llmtune
4-Bit Finetuning of Large Language Models on One Consumer GPU
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
MotionGPT
MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
ollama-voice-mac
Mac compatible Ollama Voice
open-interpreter
A natural language interface for computers
PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
Retrieval-QA-Benchmark
Benchmark baseline for retrieval qa applications
roop
one-click deepfake (face swap)
tidybot
TidyBot: Personalized Robot Assistance with Large Language Models
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
universal_manipulation_interface
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
WizardLM
WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions
XrayGPT
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.