Cyril Lv (IMYBo)

IMYBo

Geek Repo

Company:NWPU

Location:China

Github PK Tool:Github PK Tool

Cyril Lv's starred repositories

ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Language:PythonLicense:NOASSERTIONStargazers:88Issues:0Issues:0

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookLicense:MITStargazers:122Issues:0Issues:0

Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

Stargazers:150Issues:0Issues:0

awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

License:MITStargazers:235Issues:0Issues:0

S4M

Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10855Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20047Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10824Issues:0Issues:0

dover-lap

Python package for combining diarization system outputs.

Language:PythonLicense:MITStargazers:74Issues:0Issues:0

VBx

Variational Bayes HMM over x-vectors diarization

Language:PythonStargazers:242Issues:0Issues:0

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2235Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:27Issues:0Issues:0

aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Language:PythonLicense:MITStargazers:188Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:1500Issues:0Issues:0

VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

Language:PythonStargazers:15Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1012Issues:0Issues:0

NSD-MS2S

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Language:ShellStargazers:55Issues:0Issues:0

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Language:PythonLicense:Apache-2.0Stargazers:197Issues:0Issues:0

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLLicense:MITStargazers:455Issues:0Issues:0

BAE-Net

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

Stargazers:44Issues:0Issues:0

NeXt_TDNN_ASV

Official repository of NeXt-TDNN for speaker verification

Language:PythonStargazers:40Issues:0Issues:0

kmeans_pytorch

kmeans using PyTorch

Language:Jupyter NotebookLicense:MITStargazers:449Issues:0Issues:0

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:31031Issues:0Issues:0

nider

Python package to add text to images, textures and different backgrounds

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

unfoldNd

(N=1,2,3)-dimensional unfold (im2col) and fold (col2im) in PyTorch

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7427Issues:0Issues:0
Language:ShellStargazers:43Issues:0Issues:0

BlueLM

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Language:PythonLicense:NOASSERTIONStargazers:810Issues:0Issues:0

UniAudio

The official source code of UniAudio

Language:PythonStargazers:76Issues:0Issues:0