RicherMans

followers

following

stars

Xiaomi

China, Beijing

richermans.github.io

Heinrich Dinkel's repositories

GPV

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

Language:PythonGPL-3.0140 5 9

Datadriven-GPVAD

The codebase for Data-driven general-purpose voice activity detection.

Language:PythonMIT90 8 15

AudioCaption

Dataset and baseline for the first Audiocaption task

Language:PythonMIT77 7 1

CED

Source code for Consistent ensemble distillation for audio tagging

Language:PythonGPL-3.061 3 6

text_based_depression

Source code for the paper "Text-based Depression Detection: What Triggers An Alert"

Language:Python45 3 5

SAT

Streaming Audiotransformers for online Audio tagging

Language:PythonGPL-3.035 4 3

PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Language:PythonGPL-3.030 4 3

CDur

Repository for the paper "Towards duration robust weakly supervised sound event detection"

Language:PythonGPL-3.022 1 4

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

Language:PythonGPL-3.022 3 2

Speaker-Anti-Spoofing-Classifiers

Baselines and Classifiers for speaker anti-spoofing detection

Language:Python18 30

Dcase2018_pooling

Repo for our pooling approach on the DCASE2018 task4

Language:PythonApache-2.015 20

HEAR2021_EfficientLatent

Submission to the HEAR2021 Challenge

Language:PythonApache-2.015 2 1

SpokenLanguageClassifiers

Pretrained spoken language classifiers from audio.

Language:PythonMIT8 3 2

HEAR_CED

Hear evaluation for CED models.

Language:PythonGPL-3.05 10

ImageNet21K

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

Language:PythonMIT2 10

coc-pyright

Pyright extension for coc.nvim

Language:TypeScriptMIT1 10

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1 10

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:Python1 20

Nanopi-R4S

My NanoPi R4S builds

Language:Shell1 20

richermans.github.io

My Blog / Jekyll Themes / PWA

Language:CSSApache-2.01 20

audioset_tagging_cnn

Language:PythonMIT010

hearbenchmark.com

HEAR Benchmark website and leaderboard submissions

Apache-2.0000

ignite

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Language:PythonBSD-3-Clause010

models

Models and examples built with TensorFlow

Language:PythonApache-2.0020

nanopi-openwrt

Openwrt for Nanopi R4S

Language:Shell000

pretorched-x

Pretrained Image & Video ConvNets for PyTorch: NASNet, ResNeXt (2D + 3D), ResNet (2D + 3D), InceptionV4, InceptionResnetV2, Xception, DPN, NonLocalNets, R(2+1)D nets, MultiView CNNs, Temporal Relation Networks, etc.

Language:PythonMIT030

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION030

tensorboard-pytorch

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Language:PythonMIT020

torchaudio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause000

torchlibrosa

Language:PythonISC010