ta603

ta603

Geek Repo

Github PK Tool:Github PK Tool

ta603's starred repositories

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:PythonStargazers:261Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:9Issues:0Issues:0

CHAD

Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

PyMusicLooper

A python program for repeating music endlessly and creating seamless music loops, with play/export/tagging support.

Language:PythonLicense:MITStargazers:228Issues:0Issues:0

geomloss

Geometric loss functions between point clouds, images and volumes

Language:PythonLicense:MITStargazers:577Issues:0Issues:0

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonLicense:NOASSERTIONStargazers:445Issues:0Issues:0

symusic

A swift and unified toolkit for symbolic music processing

Language:C++License:MITStargazers:113Issues:0Issues:0

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:1020Issues:0Issues:0

Muskits

An opensource music processing toolkit

Language:PythonLicense:Apache-2.0Stargazers:306Issues:0Issues:0

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

Language:PythonStargazers:52Issues:0Issues:0

phoneme-informed-note-level-singing-transcription

A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

icassp2022-vocal-transcription

Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"

Language:PythonStargazers:137Issues:0Issues:0

CSD_reannotation

Re-annotation for CSD dataset for singing transcription

License:NOASSERTIONStargazers:5Issues:0Issues:0

Dance2Music

Automatic Dance-driven Music Generation

Language:PythonStargazers:16Issues:0Issues:0

video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

Language:PythonLicense:MITStargazers:282Issues:0Issues:0

CDCD

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Language:PythonStargazers:154Issues:0Issues:0

D2M-GAN

[ECCV2022] D2M-GAN for music generation from dance videos

Language:PythonStargazers:85Issues:0Issues:0

ismir2017-deepsalience

Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"

Language:Jupyter NotebookLicense:MITStargazers:83Issues:0Issues:0

Melody-extraction-with-melodic-segnet

The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

hFT-Transformer

Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

Bailando

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Language:PythonLicense:NOASSERTIONStargazers:381Issues:0Issues:0

EDGE

Official PyTorch Implementation of EDGE (CVPR 2023)

Language:PythonLicense:MITStargazers:422Issues:0Issues:0

CQTdiff

Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23

Language:Jupyter NotebookLicense:MITStargazers:101Issues:0Issues:0

jamendolyrics

Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation

Language:PythonLicense:NOASSERTIONStargazers:71Issues:0Issues:0

lyrics-melody

Lyrics and Vocal Melody Generation conditioned on Accompaniment

Language:PythonLicense:GPL-3.0Stargazers:27Issues:0Issues:0

anticipation

Anticipatory Autoregressive Models

Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0

genmusic_demo_list

a list of demo websites for automatic music generation research

Stargazers:586Issues:0Issues:0

open-musiclm

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Language:PythonLicense:MITStargazers:504Issues:0Issues:0

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookLicense:MITStargazers:264Issues:0Issues:0