wyj1996's repositories

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Huawei-Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

ppg-vc

PPG-Based Voice Conversion

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

2d-slice-set-networks

code for the 2D slice set networks

Language:PythonStargazers:0Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:0Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Brain-TokenGT

"Beyond the Snapshot: Brain Tokenized Graph Transformer for Longitudinal Brain Functional Connectome Embedding" (MICCAI 2023)

Stargazers:0Issues:0Issues:0

BrainBERT

[ICLR 2023] Code for BrainBERT

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

BrainGB

Officially Accepted to IEEE Transactions on Medical Imaging (TMI, IF: 11.037) - Special Issue on Geometric Deep Learning in Medical Imaging.

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0

BrainLM

https://huggingface.co/ahof1704/brainlm/tree/main for ckpt download

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

BraVL

Code and Data for "Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features"

License:MITStargazers:0Issues:0Issues:0

Com-BrainTF

The official Pytorch implementation of paper "Community-Aware Transformer for Autism Prediction in fMRI Connectome" accepted by MICCAI 2023

Stargazers:0Issues:0Issues:0

CUHK-PhD-Thesis-Template

Latex template for CUHK PhD Thesis

Language:TeXStargazers:0Issues:0Issues:0

DISSC

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

License:MITStargazers:0Issues:0Issues:0

fMRI-reconstruction-NSD

fMRI-to-image reconstruction on the NSD dataset.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

learning-from-brains

Self-supervised learning techniques for neuroimaging data inspired by prominent learning frameworks in natural language processing + One of the broadest neuroimaging datasets used for pre-training to date.

Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mind-vis

Code base for MinD-Vis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

License:MITStargazers:0Issues:0Issues:0

netrep

Some methods for comparing network representations in deep learning and neuroscience.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

soft-vc

Soft speech units for voice conversion

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Unit-DSR-demo

Unit-DSR demo page

Language:HTMLStargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Video-LLaMA

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

xai-brain-decoding-benchmark

Benchmarking explanation methods for mental state decoding with deep learning models.

Language:PythonStargazers:0Issues:0Issues:0