beilyLan

followers

following

stars

James Wilson's repositories

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

bark

🔊 Text-Prompted Generative Audio Model

NOASSERTION000

Blog

博客上相关的代码

000

CHRLINE

LINE Chrome API

BSD-3-Clause000

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

MPL-2.0000

evpp

A modern C++ network library for developing high performance network services in TCP/UDP/HTTP protocols.

BSD-3-Clause000

explorerplusplus

Explorer++ is a lightweight and fast file manager for Windows

GPL-3.0000

fakeyou.py

Language:PythonGPL-3.0000

Gorgeous-Whatsapp

The WhatsApp lib for java

Language:JavaGPL-3.0010

gruut

A tokenizer, text cleaner, and phonemizer for many human languages.

MIT000

lama-cleaner

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Apache-2.0000

multidiffusion-upscaler-for-automatic1111

Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0

NOASSERTION000

OpenVoice

Instant voice cloning by MyShell.

Language:PythonNOASSERTION000

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonApache-2.0000

perspective-api-client

Node.js client for the Perspective API

MIT000

pinyin

:cn: 汉字拼音 ➜ hàn zì pīn yīn

000

platform_system_core

NOASSERTION000

polyphone

Chinese polyphone disambiguation for Text-to-Speech application

000

python-pinyin

汉字转拼音(pypinyin)

MIT000

Recorder

html5 js 录音 mp3 wav ogg webm amr 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

MIT000

SantanderEscaped

A new, enhanced File Manager for iOS devices

Language:SwiftMIT000

shotcut

cross-platform (Qt), open-source (GPLv3) video editor

Language:C++GPL-3.0010

so-vits-svc

SoftVC VITS Singing Voice Conversion

BSD-3-Clause000

tesseract

Tesseract Open Source OCR Engine (main repository)

Apache-2.0000

TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)

Language:C#MIT000

Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

MIT000

Uploader

A JavaScript library providing multiple simultaneous, stable, fault-tolerant and resumable/restartable file uploads via the HTML5 File API.

Language:JavaScriptNOASSERTION010

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

MIT000

weekly

前端食堂技术周刊，每周发布。🌰

000

yt-dlp

A youtube-dl fork with additional features and fixes

Unlicense000