chevalierNoir's starred repositories
markdown-here
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
speechbrain
A PyTorch-based Speech Toolkit
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
pytorch-openpose
pytorch implementation of openpose including Hand and Body Pose Estimation.
TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
genmusic_demo_list
a list of demo websites for automatic music generation research
SpeechTransProgress
Tracking the progress in end-to-end speech translation
Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.
FS-Detection
Code for paper "Fingerspelling detection in American Sign Language"
asl-iter-attn
ASL Fingerspelling recognition in the wild
A2W-Segmental
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings (SLT'2021)