Audio Information Research Lab (AirLabUR)

Audio Information Research Lab

AirLabUR

Geek Repo

We work on computer audition, i.e., designing computational systems that can analyze and understand sounds including music, speech, and environmental sounds.

Location:Rochester, New York

Home Page:https://labsites.rochester.edu/air/index.html

Github PK Tool:Github PK Tool

Audio Information Research Lab's repositories

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

AIR-ASVspoof

Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

amt-tools

Machine learning tools and framework for automatic music transcription.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ASVspoof2021_AIR

Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DyViSE

Official implementation of our MMSP 2022 paper, "Dynamic vision-guided speaker embedding for audio-visual speaker diarization"

Language:PythonStargazers:0Issues:0Issues:0

emotalkingface

The code for the TMM paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Filler-semi-CRF

Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gss

Demo page

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0

hrtf_field

Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HRTF_field_norm

Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

InvitedTalk

Invited talk at group meeting of AIR lab

Stargazers:0Issues:0Issues:0

SASV_PR

Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

guitar-transcription-with-inhibition

Code for the paper "A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription Systems".

License:MITStargazers:0Issues:0Issues:0

HBAS_chapter_voice3

Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"

License:MITStargazers:0Issues:0Issues:0

samo

Official Implementation of our ICASSP 2023 paper "SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING"

License:MITStargazers:0Issues:0Issues:0

sparse-analytic-filters

Code for the paper "Learning Sparse Analytic Filters for Piano Transcription".

License:MITStargazers:0Issues:0Issues:0

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

Stargazers:0Issues:0Issues:0