ta603

followers

0

following

stars

ta603's starred repositories

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python26100

RefinPaint

Language:PythonNOASSERTION900

CHAD

Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)

Language:PythonMIT1200

PyMusicLooper

A python program for repeating music endlessly and creating seamless music loops, with play/export/tagging support.

Language:PythonMIT22800

geomloss

Geometric loss functions between point clouds, images and volumes

Language:PythonMIT57700

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonNOASSERTION44500

symusic

A swift and unified toolkit for symbolic music processing

Language:C++MIT11300

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonMIT102000

Muskits

An opensource music processing toolkit

Language:PythonApache-2.030600

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

Language:Python5200

phoneme-informed-note-level-singing-transcription

A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023

Language:PythonMIT2400

icassp2022-vocal-transcription

Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"

Language:Python13700

CSD_reannotation

Re-annotation for CSD dataset for singing transcription

NOASSERTION500

Dance2Music

Automatic Dance-driven Music Generation

Language:Python1600

video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

Language:PythonMIT28200

CDCD

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Language:Python15400

D2M-GAN

[ECCV2022] D2M-GAN for music generation from dance videos

Language:Python8500

ismir2017-deepsalience

Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"

Language:Jupyter NotebookMIT8300

Melody-extraction-with-melodic-segnet

The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"

Language:PythonMIT6900

hFT-Transformer

Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).

Language:PythonMIT6800

Bailando

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Language:PythonNOASSERTION38100

EDGE

Official PyTorch Implementation of EDGE (CVPR 2023)

Language:PythonMIT42200

CQTdiff

Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23

Language:Jupyter NotebookMIT10100

jamendolyrics

Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation

Language:PythonNOASSERTION7100

lyrics-melody

Lyrics and Vocal Melody Generation conditioned on Accompaniment

Language:PythonGPL-3.02700

anticipation

Anticipatory Autoregressive Models

Language:PythonApache-2.013900

Self-supervised_Metric_Learning

Language:PythonNOASSERTION400

genmusic_demo_list

a list of demo websites for automatic music generation research

open-musiclm

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Language:PythonMIT50400

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookMIT26400