loama's repositories
AlistClient
AList Client for iOS and Android. / 基于 AList api 开发的 Android 和 iOS 客户端
audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
automatic-theater
利用 Docker 打造自动化家庭影院,开箱即用
Bert-VITS2-ext
基于Bert-VITS2做的表情、动画测试
ChatGLM-6B-Engineering
ChatGLM-6B Prompt Engineering Project
ChatGLM-MNN
Pure C++, Easy Deploy ChatGLM-6B.
dify
One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.
Digital_Life_Server
Yet another voice assistant, but alive.
esp32-s3-wifiCam
ESP32通过WiFi传输图像,使用QT6.3实现服务器以及客户端
faster-whisper-GUI
faster_whisper GUI with PySide6
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
lenis
How smooth scroll should be
lizzie
Lizzie - Leela Zero Interface
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
MINI_LLM
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
PlayEdu
PlayEdu 是一款适用于搭建内部培训平台的开源系统,旨在为企业/机构打造自己品牌的内部培训平台。
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
stt
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
Umi-OCR
OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。
video-edit-demo
基于ffmpeg.js的web简版视频编辑器
wire-pod
Fully-featured server software for the Anki (now Digital Dream Labs) Vector robot.
XunFeiTTS
XunFei text-to-speech intergration for unreal engine 5.