ieyniie

0

followers

0

following

stars

ieyniie's repositories

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

MIT000

stable-diffusion-webui

Stable Diffusion web UI

AGPL-3.0000

slidev

Presentation Slides for Developers

MIT000

jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

MIT000

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

MIT000

C8DASR-Baseline-NeMo

NeMo: a toolkit for conversational AI

Apache-2.0000

OpenVoice

Instant voice cloning by MyShell.

MIT000

SRP-DNN

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

MIT000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

TTS_coqui

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

faceswap

Deepfakes Software For All

GPL-3.0000

VoiceprintRecognition-Pytorch

本项目使用了EcapaTdnn模型实现的声纹识别

Language:PythonApache-2.0000

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

Language:C++NOASSERTION000

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT000

VITS-Pytorch

本项目是基于Pytorch的语音合成项目，使用的是VITS，VITS是一种语音合成方法，这种时端到端的模型使用起来非常简单，不需要文本对齐等太复杂的流程，直接一键训练和生成，大大降低了学习门槛。

Apache-2.0000

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Apache-2.0000

TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Language:Python000

NBSS

The official repo of NBC & SpatialNet

Language:Python000

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

MIT000

DeepShip

000

SSSfastMNMF

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

NOASSERTION000

Beam-Guided-TasNet

Beam-guided TasNet

BSD-3-Clause000

odas

ODAS: Open embeddeD Audition System

GPL-3.0000

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

AGPL-3.0000

IR-GAN

Augmenting Room Impulse Response

MIT000

Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

000

nn-gev

Neural network supported GEV beamformer

NOASSERTION000