wyj1996

followers

following

stars

wyj1996's repositories

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonMIT200

Huawei-Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter Notebook100

ppg-vc

PPG-Based Voice Conversion

Language:PythonApache-2.0100

2d-slice-set-networks

code for the 2D slice set networks

Language:Python000

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION000

Brain-TokenGT

"Beyond the Snapshot: Brain Tokenized Graph Transformer for Longitudinal Brain Functional Connectome Embedding" (MICCAI 2023)

000

BrainBERT

[ICLR 2023] Code for BrainBERT

Language:Jupyter Notebook000

BrainGB

Officially Accepted to IEEE Transactions on Medical Imaging (TMI, IF: 11.037) - Special Issue on Geometric Deep Learning in Medical Imaging.

Language:MATLABMIT000

BrainLM

https://huggingface.co/ahof1704/brainlm/tree/main for ckpt download

Language:Jupyter NotebookApache-2.0000

BraVL

Code and Data for "Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features"

MIT000

Com-BrainTF

The official Pytorch implementation of paper "Community-Aware Transformer for Autism Prediction in fMRI Connectome" accepted by MICCAI 2023

000

CUHK-PhD-Thesis-Template

Latex template for CUHK PhD Thesis

Language:TeX000

DISSC

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730

Language:PythonMIT000

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

MIT000

fMRI-reconstruction-NSD

fMRI-to-image reconstruction on the NSD dataset.

Language:Jupyter NotebookMIT000

learning-from-brains

Self-supervised learning techniques for neuroimaging data inspired by prominent learning frameworks in natural language processing + One of the broadest neuroimaging datasets used for pre-training to date.

000

llama

Inference code for LLaMA models

Language:PythonNOASSERTION000

mind-vis

Code base for MinD-Vis

Language:PythonMIT000

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

MIT000

netrep

Some methods for comparing network representations in deep learning and neuroscience.

Language:PythonMIT000

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Apache-2.0000

phoneme_segmentation

Language:Python000

semantic-decoding

000

soft-vc

Soft speech units for voice conversion

Language:Jupyter NotebookMIT000

speech-recognition-for-people-with-dysarthria

000

Unit-DSR-demo

Unit-DSR demo page

Language:HTML000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0000

Video-LLaMA

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause000

xai-brain-decoding-benchmark

Benchmarking explanation methods for mental state decoding with deep learning models.

Language:Python000