Cherryjingyao's starred repositories

VLN-GOAT

Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:28Issues:0Issues:0

Dream2Real

[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models

Language:PythonStargazers:53Issues:0Issues:0
Language:C++License:MITStargazers:12Issues:0Issues:0

accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:932Issues:0Issues:0

NaviLLM

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Language:PythonLicense:MITStargazers:111Issues:0Issues:0

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonLicense:NOASSERTIONStargazers:449Issues:0Issues:0

PULSE

Official Implementation of the ICLR 2024 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control

Language:PythonStargazers:131Issues:0Issues:0
Language:PythonStargazers:50Issues:0Issues:0

Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:8831Issues:0Issues:0

visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Language:PythonLicense:MITStargazers:551Issues:0Issues:0

SuperPrompt

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

Stargazers:4562Issues:0Issues:0

Qwen-Agent

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:3277Issues:0Issues:0

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:2587Issues:0Issues:0

JARVIS-1

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Language:JavaStargazers:333Issues:0Issues:0

Agent-Smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Language:PythonLicense:MITStargazers:83Issues:0Issues:0

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:219Issues:0Issues:0

Multi-Agent-GPT

Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。

Language:PythonLicense:MITStargazers:218Issues:0Issues:0

Chatglm_lora_multi-gpu

chatglm多gpu用deepspeed和

Language:PythonStargazers:397Issues:0Issues:0

alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Language:PythonLicense:MITStargazers:346Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23603Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34528Issues:0Issues:0

FireAct

FireAct: Toward Language Agent Fine-tuning

Language:PythonLicense:MITStargazers:248Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12233Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:2664Issues:0Issues:0

autogen

A programming framework for agentic AI 🤖

Language:C#License:CC-BY-4.0Stargazers:31887Issues:0Issues:0

agentUniverse

agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.

Language:PythonLicense:Apache-2.0Stargazers:813Issues:0Issues:0
Language:PythonStargazers:96Issues:0Issues:0

mem0

The Memory layer for your AI apps

Language:PythonLicense:Apache-2.0Stargazers:22290Issues:0Issues:0

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:3020Issues:0Issues:0