SLP-RL@HUJI's repositories
SLM-Discrete-Representations
This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)
SpokenStoryCloze
A spoken version of the textual story cloze benchmark
budget-realloc
The official repo of the COLM 2024 paper: The Larger the Better? Improved LLM Code-Generation via Budget Reallocation
AudioToken
This repo is a fork from the official PyTorch implementation of "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation" (Interspeech 2023)
TempoTokens
This repo is a fork, containing the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
DSVAE-NES
This is a fork from the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES" (ICLR 2022)
im2wav
This is a fork from the official implementation of the pipeline presented in "I hear your true colors: Image Guided Audio Generation" (ICASSP 2023)