Weiren's starred repositories

chat-macOS

Making the community's best AI chat models available to everyone.

Language:SwiftLicense:Apache-2.0Stargazers:1554Issues:0Issues:0

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:18614Issues:0Issues:0

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:3378Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

Language:PythonLicense:GPL-3.0Stargazers:1231Issues:0Issues:0

professional-programming

A collection of learning resources for curious software engineers

Language:PythonLicense:MITStargazers:46740Issues:0Issues:0
Language:PythonStargazers:34Issues:0Issues:0

Juce-Plugins

Audio Plugins created using C++ and Juce Framework

Language:C++Stargazers:72Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9185Issues:0Issues:0

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Language:PythonStargazers:198Issues:0Issues:0

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

WhisperKit

On-device Speech Recognition for Apple Silicon

Language:SwiftLicense:MITStargazers:3913Issues:0Issues:0

Applio

A simple, high-quality voice conversion tool focused on ease of use and performance

Language:PythonLicense:MITStargazers:1791Issues:0Issues:0

PaSST

Efficient Training of Audio Transformers with Patchout

Language:PythonLicense:Apache-2.0Stargazers:303Issues:0Issues:0

wavmark

AI-based Audio Watermarking Tool

Language:PythonLicense:MITStargazers:226Issues:0Issues:0

Stochastic-Restoration-GAN

Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks in Pytorch

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

APO

Some random notes about Windows Audio Processing Objects (APOs).

Stargazers:66Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:1024Issues:0Issues:0

AEC3

AEC3 Extracted From WebRTC

Stargazers:1Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4946Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:Jupyter NotebookLicense:MITStargazers:7608Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1021Issues:0Issues:0

AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Language:PythonLicense:MITStargazers:1110Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:40Issues:0Issues:0

Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

Stargazers:103Issues:0Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3613Issues:0Issues:0

nnAudio

Audio processing by using pytorch 1D convolution network

Language:PythonLicense:MITStargazers:1032Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2714Issues:0Issues:0

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16894Issues:0Issues:0

EasyVC

A toolkit for any-to-any encoder-decoder voice conversion systems

Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0