ieyniie

ieyniie

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

ieyniie's repositories

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

License:MITStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

License:AGPL-3.0Stargazers:0Issues:0Issues:0

slidev

Presentation Slides for Developers

License:MITStargazers:0Issues:0Issues:0

jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

License:MITStargazers:0Issues:0Issues:0

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

License:MITStargazers:0Issues:0Issues:0

C8DASR-Baseline-NeMo

NeMo: a toolkit for conversational AI

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:MITStargazers:0Issues:0Issues:0

SRP-DNN

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

License:MITStargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

TTS_coqui

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

faceswap

Deepfakes Software For All

License:GPL-3.0Stargazers:0Issues:0Issues:0

VoiceprintRecognition-Pytorch

本项目使用了EcapaTdnn模型实现的声纹识别

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VITS-Pytorch

本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。

License:Apache-2.0Stargazers:0Issues:0Issues:0

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

License:Apache-2.0Stargazers:0Issues:0Issues:0

TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Language:PythonStargazers:0Issues:0Issues:0

NBSS

The official repo of NBC & SpatialNet

Language:PythonStargazers:0Issues:0Issues:0

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

SSSfastMNMF

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Beam-Guided-TasNet

Beam-guided TasNet

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

odas

ODAS: Open embeddeD Audition System

License:GPL-3.0Stargazers:0Issues:0Issues:0

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

IR-GAN

Augmenting Room Impulse Response

License:MITStargazers:0Issues:0Issues:0

Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Stargazers:0Issues:0Issues:0

nn-gev

Neural network supported GEV beamformer

License:NOASSERTIONStargazers:0Issues:0Issues:0