Huang-Cheng, Chou's starred repositories

bitcoin

Bitcoin Core integration/staging tree

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:35920Issues:329Issues:441

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:20065Issues:309Issues:1369

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:14249Issues:693Issues:1650

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:13052Issues:98Issues:539

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10899Issues:140Issues:355

flower

Flower: A Friendly Federated AI Framework

Language:PythonLicense:Apache-2.0Stargazers:5056Issues:42Issues:584

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2440Issues:20Issues:54

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2254Issues:46Issues:398

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:1096Issues:49Issues:150

conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:954Issues:9Issues:37

acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLLicense:MITStargazers:486Issues:16Issues:35

BitcoinArmory

Python-Based Bitcoin Software

Language:C++License:NOASSERTIONStargazers:470Issues:59Issues:335

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:466Issues:16Issues:19

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookLicense:MITStargazers:453Issues:9Issues:16

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonLicense:MITStargazers:424Issues:12Issues:8

CREMA-D

Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)

Language:RLicense:NOASSERTIONStargazers:362Issues:10Issues:7

SONAR

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Language:PythonLicense:NOASSERTIONStargazers:337Issues:14Issues:19

pfl-research

Simulation framework for accelerating research in Private Federated Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:291Issues:22Issues:13

AdamW-and-SGDW

Decoupled Weight Decay Regularization (ICLR 2019)

Language:LuaLicense:BSD-3-ClauseStargazers:264Issues:7Issues:3

MARLIN

[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg

Language:PythonLicense:NOASSERTIONStargazers:227Issues:9Issues:24

Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

dynamic-superb

The official repository of Dynamic-SUPERB.

LibreFace

[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis

Language:PythonLicense:NOASSERTIONStargazers:94Issues:3Issues:4

calibration_library

Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!

imbalanced-DL

A Python Package for Deep Imbalanced Learning

Language:PythonLicense:BSD-2-ClauseStargazers:52Issues:6Issues:0

bdl-rul-svgd

Bayesian deep learning for remaining useful life estimation via Stein variational gradient descent

Language:PythonLicense:Apache-2.0Stargazers:18Issues:2Issues:0

Anxiety-Detection-from-free-form-audio-journals

Repository for CS224S project: Detecting anxiety from short clips of free-form speech

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0