hbwu-ntu

hbwu-ntu

Geek Repo

Company:National Taiwan University

Location:Seattle, WA, US

Home Page:https://hbwu-ntu.github.io/

Github PK Tool:Github PK Tool


Organizations
s3prl

hbwu-ntu's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:64982Issues:542Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33486Issues:748Issues:1226

ChatGPT

Reverse engineered ChatGPT API

Language:PythonLicense:GPL-2.0Stargazers:27989Issues:290Issues:810

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

From-0-to-Research-Scientist-resources-guide

Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7418Issues:97Issues:1483

RepDistiller

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Language:PythonLicense:BSD-2-ClauseStargazers:2119Issues:17Issues:56

awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Language:PythonLicense:MITStargazers:1393Issues:33Issues:1

sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++License:Apache-2.0Stargazers:915Issues:36Issues:136

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:908Issues:11Issues:105

PromptCLUE

PromptCLUE, 全中文任务支持零样本学习模型

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:645Issues:9Issues:19

pytorch-domain-adaptation

A collection of implementations of adversarial domain adaptation algorithms

Language:PythonLicense:MITStargazers:597Issues:12Issues:11

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language:CudaLicense:AGPL-3.0Stargazers:466Issues:10Issues:51

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:358Issues:7Issues:34

AudioClassification-Pytorch

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.

Language:PythonLicense:Apache-2.0Stargazers:352Issues:6Issues:27

PaSST

Efficient Training of Audio Transformers with Patchout

Language:PythonLicense:Apache-2.0Stargazers:287Issues:4Issues:46

beamformers

Easy to use Beamformers for multi-channel speech separation/enhancement

Language:PythonLicense:MITStargazers:171Issues:4Issues:4

GradTTS

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Language:PythonLicense:MITStargazers:169Issues:5Issues:3

MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Language:PythonLicense:MITStargazers:158Issues:15Issues:9

audioset-processing

Toolkit for downloading and processing Google's AudioSet dataset.

Language:Jupyter NotebookLicense:MITStargazers:153Issues:3Issues:6

psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Language:PythonLicense:BSD-3-ClauseStargazers:131Issues:1Issues:12

DCCRN-with-various-loss-functions

DCCRN with various loss functions

Language:PythonLicense:MITStargazers:89Issues:1Issues:8

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

Language:PythonLicense:MITStargazers:52Issues:5Issues:2

SpeakerGuard

a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC

SpecAugment-plus

A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

AVCleanse

ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'

Language:PythonLicense:Apache-2.0Stargazers:3Issues:4Issues:6

neiwen.github.io

Neiwen's homepage

Language:JavaScriptLicense:MITStargazers:2Issues:0Issues:0