vaxin's starred repositories
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
BetterChatGPT
An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
RehabilitationGuide
颈椎病腰突康复指南,为程序员群体提供简单可靠的康复指南。
MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
LLMAgentPapers
Must-read Papers on LLM Agents.
alpaca_chinese_dataset
人工精调的中文对话数据集和一段chatglm的微调代码
llm-reasoners
A library for advanced large language model reasoning
gptstore-prompts
Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
agent-studio
An open toolkit for building and benchmarking general virtual agents in the wild
Formal-LLM
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
codeinterpreter-codebox
Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.