michael's repositories
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
bosszp-selenium
使用python+selenium完成对boss互联网相关岗位的数据爬取
chatdevIDE
ChatDev IDE is an tools for building your ai agent, Whether it's NPCs in games or powerful agent tools, you can design what you want for this platform.
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs
coze-discord-proxy
代理Discord对话Coze-Bot,实现以API形式请求GPT4模型,提供对话、文生图、图生文、知识库检索等功能。
dataherald
Interact with your SQL database, Natural Language to SQL using LLMs
equibot
Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".
facefusion
Next generation face swapper and enhancer
FastGPT
FastGPT is a knowledge-based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization!
Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
Fooocus
Focus on prompting and generating
GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
kimi-free-api
🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
mergekit
Tools for merging pretrained large language models.
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
so-large-lm
大模型理论基础
SoraWebui
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.
tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision