kjw11

kjw11

Geek Repo

Location:Shatin, HK

Github PK Tool:Github PK Tool

kjw11's starred repositories

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:22045Issues:0Issues:0

CSEnet-ASR

Cross-Speaker Encoding Network for Multi-talker Speech Recognition

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

classifier-balancing

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

Language:PythonLicense:NOASSERTIONStargazers:930Issues:0Issues:0

RIDE-LongTailRecognition

[ICLR 2021 Spotlight] Code release for "Long-tailed Recognition by Routing Diverse Distribution-Aware Experts."

Language:PythonLicense:MITStargazers:254Issues:0Issues:0

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:PythonStargazers:488Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:31635Issues:0Issues:0

SRILM

Mirror of srilm source code :-)

Stargazers:6Issues:0Issues:0

whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Language:PythonLicense:BSD-2-ClauseStargazers:286Issues:0Issues:0

randomized_positional_encodings

Randomized Positional Encodings Boost Length Generalization of Transformers

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:606Issues:0Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37393Issues:0Issues:0

Visualizer

assistant tools for attention visualization in deep learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:899Issues:0Issues:0

whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Language:PythonLicense:MITStargazers:199Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14830Issues:0Issues:0

MyChatGPT

A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:25696Issues:0Issues:0

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:1237Issues:0Issues:0

LayerCAM-jittor

The official code for our TIP paper 'LayerCAM: Exploring Hierarchical Class Activation Maps for Localization'

Language:PythonStargazers:108Issues:0Issues:0

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

Stargazers:447Issues:0Issues:0

AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Language:PythonStargazers:107Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:824Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:898Issues:0Issues:0

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaLicense:Apache-2.0Stargazers:1068Issues:0Issues:0
License:NOASSERTIONStargazers:9Issues:0Issues:0

LibriMix

An open source dataset for source separation

Language:PythonLicense:MITStargazers:344Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:29Issues:0Issues:0

RIR-Generator

Generating room impulse responses

Language:C++License:MITStargazers:409Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8072Issues:0Issues:0

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

License:MITStargazers:2546Issues:0Issues:0

ReduNet

ReduNet

Language:PythonStargazers:534Issues:0Issues:0