chevalierNoir

followers

0

following

stars

chevalierNoir's starred repositories

markdown-here

Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.

Language:JavaScriptMIT59649 1014 621

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT20606 154 266

dask

Parallel computing with task scheduling

Language:PythonBSD-3-Clause12599 212 5211

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.08926 135 1101

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.02268 46 398

pytorch-openpose

pytorch implementation of openpose including Hand and Body Pose Estimation.

Language:Jupyter Notebook2111 25 78

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonNOASSERTION1555 27 129

mt3

MT3: Multi-Task Multitrack Music Transcription

Language:PythonApache-2.01440 27 91

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION848 15 111

genmusic_demo_list

a list of demo websites for automatic music generation research

Ego4d

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Language:Jupyter NotebookMIT359 22 167

muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Language:PythonNOASSERTION359 13 23

SpeechTransProgress

Tracking the progress in end-to-end speech translation

CC0-1.0254 27 2

Multi-Modal-Transformer

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.

audio-visual

Language:CMIT59 10 10

OpenASL

A Large-Scale Open-Domain Sign Language Translation Dataset (ASL-English)

Language:PythonNOASSERTION54 6 8

FS-Detection

Code for paper "Fingerspelling detection in American Sign Language"

Language:Python18 2 1

asl-iter-attn

ASL Fingerspelling recognition in the wild

Language:Python12 20

A2W-Segmental

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings (SLT'2021)

Language:Python2 20