firefly600's repositories
AI-
A high-performance inference system for large language models, designed for production environments.
bpy_triangle
A Blender add-on for Triangle.
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
ComfyTextures
Unreal Engine ⚔️ ComfyUI - Automatic texturing using generative diffusion models
deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
faster-whisper
Faster Whisper transcription with CTranslate2 ASR 快速的语音识别
Fay--
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
Fooocus
Focus on prompting and generating
FunASR--
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
generative-models
Generative Models by Stability AI
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
gpt4all
gpt4all: open-source LLM chatbots that you can run anywhere
gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
gunicorn--flask-WSGI
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
llama.cpp
LLM inference in C/C++
OpenVoice
Instant voice cloning by MyShell.
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
sd-webui-prompt-all-in-one
This is an extension based on sd-webui, aimed at improving the user experience of the prompt/negative prompt input box. It has a more intuitive and powerful input interface function, and provides automatic translation, history record, and bookmarking functions. 这是一个基于 sd-webui 的扩展,旨在提高提示词/反向提示词输入框的使用体验。它拥有更直观、强大的输入界面功能,它提供了自动翻译、历史记录和收藏等功能。
stable-diffusion-webui-wd14-tagger
Labeling extension for Automatic1111's Web UI
StoryDiffusion
Create Magic Story!
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
whisper_ASR_-
Robust Speech Recognition via Large-Scale Weak Supervision ASR 语音识别
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Yi
A series of large language models trained from scratch by developers @01-ai