Ahsen Khaliq's repositories
projected_gan
[NeurIPS'21] Projected GANs Converge Faster
ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
chm
Official PyTorch Implementation of Convolutional Hough Matching Networks, CVPR 2021 (oral)
DeepLearningExamples
Deep Learning Examples
fastai
The fastai deep learning library
frame-interpolation
FILM: Frame Interpolation for Large Motion, In arXiv 2022.
IBN-Net
Instance-Batch Normalization Networks (ECCV2018)
it5
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
keras-io
Keras documentation, hosted live at keras.io
magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language
manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
MEAL-V2
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
Pytorch-HarDNet
35% faster than ResNet: Harmonic DenseNet, A low memory traffic network
RealBasicVSR
Official repository of "RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution"
start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2022 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
StyleNeRF
This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"
StyleSwin
[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
TokenCut
pytorch implementation of "Self-supervised transformers for unsupervised object discovery using normalized cut"
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
YOLOP
You Only Look Once for Panopitic Driving Perception.(https://arxiv.org/abs/2108.11250)
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/