户建坤 (ruclion)

ruclion

Geek Repo

Company:Tsinghua University

Location:深圳

Home Page:https://blog.csdn.net/u013625492

Github PK Tool:Github PK Tool

户建坤's starred repositories

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:182411Issues:7640Issues:39113

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:2818Issues:39Issues:181

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:1872Issues:48Issues:80

cnpy

library to read/write .npy and .npz files in C/C++

Language:C++License:MITStargazers:1249Issues:29Issues:64

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1002Issues:20Issues:124

VNN

VNN是由欢聚集团(Joyy Inc.)推出的高性能、轻量级神经网络部署框架。目前已为Hago、VOO、VFly、马克相机等App提供20余种AI能力的支持,覆盖直播、短视频、视频编辑等泛娱乐场景和工程场景

Language:CLicense:NOASSERTIONStargazers:957Issues:30Issues:33

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

cppflow

Run TensorFlow models in C++ without installation and without Bazel

Language:C++License:MITStargazers:760Issues:25Issues:187

Focal-Loss-Pytorch

全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函数,解决数据不平衡问题.

Language:Jupyter NotebookStargazers:422Issues:5Issues:19

voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Language:PythonLicense:Apache-2.0Stargazers:367Issues:25Issues:25

streamlit-audio-recorder

Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)

Language:TypeScriptLicense:MITStargazers:327Issues:1Issues:17

kaldiio

A pure python module for reading and writing kaldi ark files

Language:PythonLicense:NOASSERTIONStargazers:243Issues:12Issues:16

Speech-enhancement

Deep neural network based speech enhancement toolkit

Language:MATLABLicense:GPL-2.0Stargazers:209Issues:8Issues:28

dscore

Diarization scoring tools.

Language:PythonLicense:BSD-2-ClauseStargazers:194Issues:8Issues:4

GPV

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

Language:PythonLicense:GPL-3.0Stargazers:140Issues:5Issues:9

psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Language:PythonLicense:BSD-3-ClauseStargazers:124Issues:1Issues:12

Datadriven-GPVAD

The codebase for Data-driven general-purpose voice activity detection.

Language:PythonLicense:MITStargazers:89Issues:8Issues:15

AI_beatmap_generator

尝试使用神经网络生成音乐游戏Malody的谱面。

Language:Jupyter NotebookLicense:MITStargazers:43Issues:2Issues:1

ram_modified

"Recurrent Models of Visual Attention" in TensorFlow

sound_event_detection

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

Language:PythonLicense:Apache-2.0Stargazers:35Issues:1Issues:1

DiViMe

ACLEW Diarization Virtual Machine

Language:ShellLicense:Apache-2.0Stargazers:30Issues:13Issues:152

mica-speech-activity-detection

Robust Speech Activity Detection (SAD) in movie audio

DomainAdversarialVoiceActivityDetection

Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"

Language:Jupyter NotebookLicense:MITStargazers:23Issues:4Issues:2

audio_augment

A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN

musan_investigation_cnn_rnn

Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.

Language:PythonLicense:MITStargazers:14Issues:3Issues:2

MultiTarget_VAD

Representation of Paper: On training targets for noise-robust voice activity detection.

Language:Jupyter NotebookLicense:MITStargazers:4Issues:1Issues:1