Ian Shih's repositories
ThemeTransformer
The official implementation of Theme Transformer. A Theme-based music generation. IEEE TMM
SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
SSL_Interface
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
MusicChain
🎹🎵🎶 A platform to make Original and Cover Visible and Valuable.
midi2Tiles
A tool for creating synthesia-like piano tiles effect from midi files.
sensor_your_music
A tool for transmitting phone sensors to puredata with WebRTC
theme_extraction
a tool for extraction musical theme
visual_midi
Converts a pretty midi sequence to a bokeh plot.
av_hubert_revised
A self-supervised learning framework for audio-visual speech
cafe
Deploy your own Notion-powered website in minutes with Next.js and Vercel.
devise
A fast, minimal, responsive Hugo theme for blogs.
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
zerospeech2021
Zerospeech Challenge 2021: validation and evaluation software