James Wilson's repositories
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
bark
🔊 Text-Prompted Generative Audio Model
Blog
博客上相关的代码
CHRLINE
LINE Chrome API
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
evpp
A modern C++ network library for developing high performance network services in TCP/UDP/HTTP protocols.
explorerplusplus
Explorer++ is a lightweight and fast file manager for Windows
Gorgeous-Whatsapp
The WhatsApp lib for java
gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
lama-cleaner
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
OpenVoice
Instant voice cloning by MyShell.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
perspective-api-client
Node.js client for the Perspective API
pinyin
:cn: 汉字拼音 ➜ hàn zì pīn yīn
polyphone
Chinese polyphone disambiguation for Text-to-Speech application
python-pinyin
汉字转拼音(pypinyin)
Recorder
html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
SantanderEscaped
A new, enhanced File Manager for iOS devices
so-vits-svc
SoftVC VITS Singing Voice Conversion
tesseract
Tesseract Open Source OCR Engine (main repository)
TTS-Voice-Wizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
weekly
前端食堂技术周刊,每周发布。🌰
yt-dlp
A youtube-dl fork with additional features and fixes