AirLabUR

Audio Information Research Lab's repositories

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookMIT100

AIR-ASVspoof

Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"

Language:PythonMIT000

amt-tools

Machine learning tools and framework for automatic music transcription.

Language:PythonMIT000

ASVspoof2021_AIR

Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"

Language:PythonMIT000

DyViSE

Official implementation of our MMSP 2022 paper, "Dynamic vision-guided speaker embedding for audio-visual speaker diarization"

Language:Python000

emotalkingface

The code for the TMM paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"

Language:PythonMIT000

Filler-semi-CRF

Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]

Language:PythonMIT000

gss

Demo page

Language:SCSSMIT000

hrtf_field

Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"

Language:PythonMIT000

HRTF_field_norm

Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"

Language:PythonBSD-3-Clause000

InvitedTalk

Invited talk at group meeting of AIR lab

000

SASV_PR

Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"

Language:PythonMIT000

guitar-transcription-with-inhibition

Code for the paper "A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription Systems".

MIT000

HBAS_chapter_voice3

Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"

MIT000

samo

Official Implementation of our ICASSP 2023 paper "SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING"

MIT000

sparse-analytic-filters

Code for the paper "Learning Sparse Analytic Filters for Piano Transcription".

MIT000

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

000