chevalierNoir

chevalierNoir

Geek Repo

Github PK Tool:Github PK Tool

chevalierNoir's starred repositories

markdown-here

Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.

Language:JavaScriptLicense:MITStargazers:59649Issues:1014Issues:621

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:20606Issues:154Issues:266

dask

Parallel computing with task scheduling

Language:PythonLicense:BSD-3-ClauseStargazers:12599Issues:212Issues:5211

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8926Issues:135Issues:1101

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2268Issues:46Issues:398

pytorch-openpose

pytorch implementation of openpose including Hand and Body Pose Estimation.

Language:Jupyter NotebookStargazers:2111Issues:25Issues:78

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Language:PythonLicense:NOASSERTIONStargazers:1555Issues:27Issues:129

mt3

MT3: Multi-Task Multitrack Music Transcription

Language:PythonLicense:Apache-2.0Stargazers:1440Issues:27Issues:91

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:848Issues:15Issues:111

genmusic_demo_list

a list of demo websites for automatic music generation research

Ego4d

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Language:Jupyter NotebookLicense:MITStargazers:359Issues:22Issues:167

muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Language:PythonLicense:NOASSERTIONStargazers:359Issues:13Issues:23

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Multi-Modal-Transformer

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.

OpenASL

A Large-Scale Open-Domain Sign Language Translation Dataset (ASL-English)

Language:PythonLicense:NOASSERTIONStargazers:54Issues:6Issues:8

FS-Detection

Code for paper "Fingerspelling detection in American Sign Language"

asl-iter-attn

ASL Fingerspelling recognition in the wild

Language:PythonStargazers:12Issues:2Issues:0

A2W-Segmental

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings (SLT'2021)

Language:PythonStargazers:2Issues:2Issues:0