jc-hou

jc-hou

Geek Repo

Github PK Tool:Github PK Tool

jc-hou's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30833Issues:315Issues:890

PlotNeuralNet

Latex code for making neural networks diagrams

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19204Issues:297Issues:1337

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:11108Issues:89Issues:341

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonLicense:MITStargazers:6677Issues:176Issues:1443

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:5452Issues:115Issues:652

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonLicense:MITStargazers:2717Issues:24Issues:187

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonLicense:Apache-2.0Stargazers:1941Issues:41Issues:78

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1844Issues:32Issues:160

attention-transfer

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

Language:Jupyter NotebookStargazers:1429Issues:50Issues:27

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1271Issues:16Issues:118

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

pytorch-domain-adaptation

A collection of implementations of adversarial domain adaptation algorithms

Language:PythonLicense:MITStargazers:596Issues:12Issues:11

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonLicense:MITStargazers:355Issues:10Issues:34

salad

A toolbox for domain adaptation and semi-supervised learning. Contributions welcome.

Language:HTMLLicense:MPL-2.0Stargazers:332Issues:16Issues:36

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonLicense:MITStargazers:332Issues:10Issues:44

graph-neural-pde

Graph Neural PDEs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:310Issues:12Issues:13

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonLicense:NOASSERTIONStargazers:210Issues:10Issues:31

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

ZeroShotVideoClassification

Zero-shot video classification by end-to-end training of 3D convolutional neural networks

Language:PythonLicense:Apache-2.0Stargazers:144Issues:10Issues:7

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:142Issues:4Issues:0

code2pdf

Convert your source code to PDF

Language:RubyLicense:NOASSERTIONStargazers:116Issues:2Issues:17

jean-zay-doc

Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/

pytorch-VideoDataset

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

Language:PythonLicense:MITStargazers:68Issues:3Issues:2

C2D

PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"

Language:PythonLicense:MITStargazers:67Issues:3Issues:6

AtSNE

Anchor-t-SNE for large-scale and high-dimension vector visualization

Language:CudaStargazers:55Issues:2Issues:0

BSSE-SE

Boosting Self-Supervised Embeddings for Speech Enhancement

Language:PythonLicense:MITStargazers:37Issues:1Issues:1