SLP-RL@HUJI

SLP-RL@HUJI's repositories

aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Language:PythonMIT219 5 30

slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Language:PythonMIT218 8 3

HebTTS

The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"

Language:Python101 2 7

salmon

The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)

Language:Python47 10

SC-PhASE

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Language:PythonNOASSERTION28 10

SLM-Discrete-Representations

This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)

Language:Python17 10

SpokenStoryCloze

A spoken version of the textual story cloze benchmark

MIT1401

budget-realloc

The official repo of the COLM 2024 paper: The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

MIT800

AudioToken

This repo is a fork from the official PyTorch implementation of "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation" (Interspeech 2023)

Language:PythonMIT500

DISSC

This is a from from the official repository of "Speaking Style Conversion With Discrete Self-Supervised Units"

Language:PythonMIT100

TempoTokens

This repo is a fork, containing the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Language:PythonMIT100

DSVAE-NES

This is a fork from the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES" (ICLR 2022)

Language:Python000

im2wav

This is a fork from the official implementation of the pipeline presented in "I hear your true colors: Image Guided Audio Generation" (ICASSP 2023)

Language:PythonMIT000