SLP-RL@HUJI (slp-rl)

SLP-RL@HUJI

slp-rl

Organization data from Github https://github.com/slp-rl

The Spoken Language Processing Research Lab (SLP-RL) at the Hebrew University of Jerusalem Israel (HUJI) official repository.

GitHub:@slp-rl

SLP-RL@HUJI's repositories

aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Language:PythonLicense:MITStargazers:219Issues:5Issues:30

slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Language:PythonLicense:MITStargazers:218Issues:8Issues:3

HebTTS

The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"

salmon

The official code for the SALMonšŸ£ benchmark (ICASSP 2025 - Oral)

Language:PythonStargazers:47Issues:1Issues:0

SC-PhASE

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Language:PythonLicense:NOASSERTIONStargazers:28Issues:1Issues:0

SLM-Discrete-Representations

This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)

Language:PythonStargazers:17Issues:1Issues:0

SpokenStoryCloze

A spoken version of the textual story cloze benchmark

License:MITStargazers:14Issues:0Issues:1

budget-realloc

The official repo of the COLM 2024 paper: The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

License:MITStargazers:8Issues:0Issues:0

AudioToken

This repo is a fork from the official PyTorch implementation of "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation" (Interspeech 2023)

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

DISSC

This is a from from the official repository of "Speaking Style Conversion With Discrete Self-Supervised Units"

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

TempoTokens

This repo is a fork, containing the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

DSVAE-NES

This is a fork from the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES" (ICLR 2022)

Language:PythonStargazers:0Issues:0Issues:0

im2wav

This is a fork from the official implementation of the pipeline presented in "I hear your true colors: Image Guided Audio Generation" (ICASSP 2023)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0