Cherryjingyao's starred repositories
Dream2Real
[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Agent-Smith
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Multi-Agent-GPT
Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。
Chatglm_lora_multi-gpu
chatglm多gpu用deepspeed和
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agentUniverse
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.