Wing Joe's repositories
Anima
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU
api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
Bert-VITS2-Integration-package
vits2 backbone with bert
chatgpt-plus
AI 助手全套开源解决方案,自带运营管理后台,开箱即用。集成了 ChatGPT, Azure, ChatGLM,讯飞星火,文心一言等多个平台的大语言模型。支持 MJ AI 绘画,Stable Diffusion AI 绘画,微博热搜等插件工具。采用 Go + Vue3 + element-plus 实现。
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
DB-GPT
Revolutionizing Database Interactions with Private LLM Technology
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
LangSegment
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
new-api
基于One API的二次开发版本,支持Midjourney,仅供个人管理渠道使用,请勿用于商业API分发!
OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
OpenVoice
Instant voice cloning by MyShell
palworld-server-docker
A Docker Container to easily run a Palworld dedicated server.
PhotoMaker
PhotoMaker
QAnything
Question and Answer based on Anything.
TonyColab
Colab script collection for various amazing projects! 各种牛逼项目的Colab脚本集合!
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
WebGAL
A brand new web Visual Novel engine | 全新的网页端视觉小说引擎
whispering
Whispering Tiger - OpenAI's whisper with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
WhisperLive
A nearly-live implementation of OpenAI's Whisper.
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)