Cyril Lv (IMYBo)

IMYBo

Geek Repo

Company:NWPU

Location:China

Github PK Tool:Github PK Tool

Cyril Lv's starred repositories

UniAudio

The official source code of UniAudio

Language:PythonStargazers:78Issues:0Issues:0

coder2gwy

互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。

Stargazers:25734Issues:0Issues:0

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Language:PythonLicense:MITStargazers:340Issues:0Issues:0

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:996Issues:0Issues:0
Language:PythonStargazers:45Issues:0Issues:0

cocopilot

你可以把它称为:联合副驾驶。

Language:ShellLicense:GPL-2.0Stargazers:3316Issues:0Issues:0

padasip

Python Adaptive Signal Processing

Language:PythonLicense:MITStargazers:297Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:617Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:263Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2474Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12472Issues:0Issues:0

chatgpt-on-wechat

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Language:PythonLicense:MITStargazers:28438Issues:0Issues:0

torchcrepe

Pytorch implementation of the CREPE pitch tracker

Language:PythonLicense:MITStargazers:390Issues:0Issues:0

AIGC-progress

Follow the rapid development of AIGC models and applications. | 跟上AIGC模型和应用快速发展的步伐 🚀

Stargazers:81Issues:0Issues:0

MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Language:CudaStargazers:25Issues:0Issues:0
Language:PerlStargazers:13Issues:0Issues:0

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:601Issues:0Issues:0

Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language:PythonLicense:Apache-2.0Stargazers:458Issues:0Issues:0

wesignal

Production first, nn-based on-device signal processing toolkit.

License:Apache-2.0Stargazers:63Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:536Issues:0Issues:0

deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Language:PythonLicense:MITStargazers:897Issues:0Issues:0

RBN

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

Language:PythonStargazers:85Issues:0Issues:0

Audio-Effects

Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.

Language:C++Stargazers:698Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9907Issues:0Issues:0

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:948Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8280Issues:0Issues:0
Language:PythonStargazers:23Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2198Issues:0Issues:0

IntelNeuromorphicDNSChallenge

Intel Neuromorphic DNS Challenge

Language:Jupyter NotebookLicense:MITStargazers:120Issues:0Issues:0

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

Stargazers:416Issues:0Issues:0