jc-hou

jc-hou

Geek Repo

Github PK Tool:Github PK Tool

jc-hou's starred repositories

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1337Issues:0Issues:0

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonLicense:Apache-2.0Stargazers:1991Issues:0Issues:0

graph-neural-pde

Graph Neural PDEs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:314Issues:0Issues:0

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonLicense:NOASSERTIONStargazers:219Issues:0Issues:0

BSSE-SE

Boosting Self-Supervised Embeddings for Speech Enhancement

Language:PythonLicense:MITStargazers:44Issues:0Issues:0
Language:CLicense:MITStargazers:57Issues:0Issues:0

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

Stargazers:1078Issues:0Issues:0

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Stargazers:202Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1916Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19613Issues:0Issues:0

pytorch-domain-adaptation

A collection of implementations of adversarial domain adaptation algorithms

Language:PythonLicense:MITStargazers:605Issues:0Issues:0

salad

A toolbox for domain adaptation and semi-supervised learning. Contributions welcome.

Language:HTMLLicense:MPL-2.0Stargazers:333Issues:0Issues:0

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:PythonStargazers:2590Issues:0Issues:0

code2pdf

Convert your source code to PDF

Language:RubyLicense:NOASSERTIONStargazers:117Issues:0Issues:0
License:MITStargazers:3389Issues:0Issues:0

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonLicense:MITStargazers:336Issues:0Issues:0

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonLicense:MITStargazers:357Issues:0Issues:0

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

AtSNE

Anchor-t-SNE for large-scale and high-dimension vector visualization

Language:CudaStargazers:55Issues:0Issues:0

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:11874Issues:0Issues:0

C2D

PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

pytorch-VideoDataset

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

Language:PythonLicense:MITStargazers:70Issues:0Issues:0

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonLicense:MITStargazers:6726Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31646Issues:0Issues:0

jean-zay-doc

Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/

License:MITStargazers:108Issues:0Issues:0

PlotNeuralNet

Latex code for making neural networks diagrams

Language:TeXLicense:MITStargazers:21899Issues:0Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:5485Issues:0Issues:0

ZeroShotVideoClassification

Zero-shot video classification by end-to-end training of 3D convolutional neural networks

Language:PythonLicense:Apache-2.0Stargazers:145Issues:0Issues:0

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonLicense:MITStargazers:2750Issues:0Issues:0

attention-transfer

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

Language:Jupyter NotebookStargazers:1438Issues:0Issues:0