gwbw's starred repositories

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

Language:JavaScriptLicense:MITStargazers:24436Issues:0Issues:0

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:5867Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:61572Issues:0Issues:0

video2x

A machine learning-based lossless video super resolution framework. Est. Hack the Valley II, 2018.

Language:C++License:AGPL-3.0Stargazers:10171Issues:0Issues:0

NudeNet

Lightweight nudity detection

Language:PythonLicense:AGPL-3.0Stargazers:1765Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:3698Issues:0Issues:0

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:38305Issues:0Issues:0

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4423Issues:0Issues:0

sqlite-vec

A vector search SQLite extension that runs anywhere!

Language:CLicense:Apache-2.0Stargazers:3937Issues:0Issues:0

Mangio-RVC-Fork

*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.

Language:PythonLicense:MITStargazers:1001Issues:0Issues:0

SakuraLLM

适配轻小说/Galgame的日中翻译大模型

Language:PythonLicense:GPL-3.0Stargazers:2319Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10837Issues:0Issues:0

Nekomoekissaten-Subs

Subtitle source files from Nekomoe Kissaten. Should there be any issues, please create them in this main repository first.

Stargazers:2111Issues:0Issues:0

gallery-dl

Command-line program to download image galleries and collections from several image hosting sites

Language:PythonLicense:GPL-2.0Stargazers:11562Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:52924Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:338Issues:0Issues:0

open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Language:SvelteLicense:MITStargazers:42449Issues:0Issues:0

How-to-use-Transformers

Transformers 库快速入门教程

Language:PythonLicense:Apache-2.0Stargazers:1031Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:12263Issues:0Issues:0

N_m3u8DL-RE

Cross-Platform, modern and powerful stream downloader for MPD/M3U8/ISM. English/简体中文/繁體中文.

Language:C#License:MITStargazers:4449Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8732Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2905Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:5443Issues:0Issues:0

simple-one-api

OpenAI 接口接入适配,支持千帆大模型平台、讯飞星火大模型、腾讯混元以及MiniMax、Deep-Seek,等兼容OpenAI接口,仅单可执行文件,配置超级简单,一键部署,开箱即用. Seamlessly integrate with OpenAI and compatible APIs using a single executable for quick setup and deployment.

Language:GoLicense:MITStargazers:1275Issues:0Issues:0

FluentRead

拥有基于上下文语境的人工智能翻译引擎,为网站提供更加友好的翻译,让所有人都能够拥有基于母语般的阅读体验。

Language:JavaScriptLicense:GPL-3.0Stargazers:1316Issues:0Issues:0

eat

I'm a CPU and memory eating monster. 一个吃 CPU 内存的怪兽。

Language:GoLicense:GPL-3.0Stargazers:274Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:13231Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11856Issues:0Issues:0

xiaoju-survey

XIAOJUSURVEY is an enterprises form builder and analytics platform that allows users to create questionnaires, exams, polls, quizzes, and analyze data online.

Language:TypeScriptLicense:Apache-2.0Stargazers:2130Issues:0Issues:0

bloop

bloop is a fast code search engine written in Rust.

Language:RustLicense:Apache-2.0Stargazers:9420Issues:0Issues:0