ta603's starred repositories
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
PyMusicLooper
A python program for repeating music endlessly and creating seamless music loops, with play/export/tagging support.
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
phoneme-informed-note-level-singing-transcription
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
icassp2022-vocal-transcription
Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music"
CSD_reannotation
Re-annotation for CSD dataset for singing transcription
Dance2Music
Automatic Dance-driven Music Generation
video-bgm-generation
Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)
ismir2017-deepsalience
Companion code for ISMIR 2017 paper "Deep Salience Representations for $F_0$ Estimation in Polyphonic Music"
Melody-extraction-with-melodic-segnet
The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"
hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
jamendolyrics
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
lyrics-melody
Lyrics and Vocal Melody Generation conditioned on Accompaniment
anticipation
Anticipatory Autoregressive Models
genmusic_demo_list
a list of demo websites for automatic music generation research
open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.