Jian's repositories

act-plus-plus

Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

barkour_robot

Barkour Robot: Agile Quadruped Robots by Google DeepMind

License:NOASSERTIONStargazers:0Issues:0Issues:0

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

License:Apache-2.0Stargazers:0Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI with a graph/nodes interface.

License:GPL-3.0Stargazers:0Issues:0Issues:0

EmbodiedAIxLLMPapers

Papers on integrating large language models with embodied AI

Stargazers:0Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

License:MITStargazers:0Issues:0Issues:0

gpt4all

gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

llm_multiagent_debate

Code for Improving Factuality and Reasoning in Language Models through Multiagent Debate

Stargazers:0Issues:0Issues:0

llmtune

4-Bit Finetuning of Large Language Models on One Consumer GPU

Stargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

License:MITStargazers:0Issues:0Issues:0

MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

License:MITStargazers:0Issues:0Issues:0

MotionGPT

MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

License:MITStargazers:0Issues:0Issues:0

ollama-voice-mac

Mac compatible Ollama Voice

License:AGPL-3.0Stargazers:0Issues:0Issues:0

open-interpreter

A natural language interface for computers

License:AGPL-3.0Stargazers:0Issues:0Issues:0

PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Retrieval-QA-Benchmark

Benchmark baseline for retrieval qa applications

License:GPL-3.0Stargazers:0Issues:0Issues:0

roop

one-click deepfake (face swap)

License:GPL-3.0Stargazers:0Issues:0Issues:0

tidybot

TidyBot: Personalized Robot Assistance with Large Language Models

Stargazers:0Issues:0Issues:0

ToolBench

An open platform for training, serving, and evaluating large language model for tool learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

Stargazers:0Issues:0Issues:0

universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

License:MITStargazers:0Issues:0Issues:0

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

License:Apache-2.0Stargazers:0Issues:0Issues:0

WizardLM

WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions

Stargazers:0Issues:0Issues:0

XrayGPT

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Stargazers:0Issues:0Issues:0