wyj1996's repositories
k-diffusion
Karras et al. (2022) diffusion models for PyTorch
Huawei-Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
2d-slice-set-networks
code for the 2D slice set networks
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
av_hubert
A self-supervised learning framework for audio-visual speech
Brain-TokenGT
"Beyond the Snapshot: Brain Tokenized Graph Transformer for Longitudinal Brain Functional Connectome Embedding" (MICCAI 2023)
BrainBERT
[ICLR 2023] Code for BrainBERT
BrainGB
Officially Accepted to IEEE Transactions on Medical Imaging (TMI, IF: 11.037) - Special Issue on Geometric Deep Learning in Medical Imaging.
BrainLM
https://huggingface.co/ahof1704/brainlm/tree/main for ckpt download
BraVL
Code and Data for "Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features"
Com-BrainTF
The official Pytorch implementation of paper "Community-Aware Transformer for Autism Prediction in fMRI Connectome" accepted by MICCAI 2023
CUHK-PhD-Thesis-Template
Latex template for CUHK PhD Thesis
DISSC
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
fMRI-reconstruction-NSD
fMRI-to-image reconstruction on the NSD dataset.
learning-from-brains
Self-supervised learning techniques for neuroimaging data inspired by prominent learning frameworks in natural language processing + One of the broadest neuroimaging datasets used for pre-training to date.
llama
Inference code for LLaMA models
mind-vis
Code base for MinD-Vis
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
netrep
Some methods for comparing network representations in deep learning and neuroscience.
OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
soft-vc
Soft speech units for voice conversion
Unit-DSR-demo
Unit-DSR demo page
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Video-LLaMA
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
xai-brain-decoding-benchmark
Benchmarking explanation methods for mental state decoding with deep learning models.