Beast code in Giters

jc-hou's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.030833 315 890

PlotNeuralNet

Latex code for making neural networks diagrams

Language:TeXMIT21605 227 118

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT19204 297 1337

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT11108 89 341

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonMIT6677 176 1443

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonNOASSERTION5452 115 652

awesome-tips

MIT3336 98 4

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonMIT2717 24 187

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Language:Python2565 24 96

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonApache-2.01941 41 78

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1844 32 160

attention-transfer

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

Language:Jupyter Notebook1429 50 27

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonNOASSERTION1271 16 118

Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

1014 36 5

pytorch-domain-adaptation

A collection of implementations of adversarial domain adaptation algorithms

Language:PythonMIT596 12 11

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonMIT355 10 34

salad

A toolbox for domain adaptation and semi-supervised learning. Contributions welcome.

Language:HTMLMPL-2.0332 16 36

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonMIT332 10 44

graph-neural-pde

Graph Neural PDEs

Language:Jupyter NotebookApache-2.0310 12 13

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonNOASSERTION210 10 31

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

200 13 2

ZeroShotVideoClassification

Zero-shot video classification by end-to-end training of 3D convolutional neural networks

Language:PythonApache-2.0144 10 7

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonMIT142 40

code2pdf

Convert your source code to PDF

Language:RubyNOASSERTION116 2 17

jean-zay-doc

Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/

MIT100 13 40

pytorch-VideoDataset

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

Language:PythonMIT68 3 2

C2D

PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"

Language:PythonMIT67 3 6

audio-visual

Language:CMIT56 11 10

AtSNE

Anchor-t-SNE for large-scale and high-dimension vector visualization

Language:Cuda55 20

BSSE-SE

Boosting Self-Supervised Embeddings for Speech Enhancement

Language:PythonMIT37 1 1