Ahsen Khaliq's repositories
projected_gan
[NeurIPS'21] Projected GANs Converge Faster
ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
DeepLearningExamples
Deep Learning Examples
fastai
The fastai deep learning library
frame-interpolation
FILM: Frame Interpolation for Large Motion, In arXiv 2022.
IBN-Net
Instance-Batch Normalization Networks (ECCV2018)
it5
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
keras-io
Keras documentation, hosted live at keras.io
magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language
manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
MEAL-V2
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
Pytorch-HarDNet
35% faster than ResNet: Harmonic DenseNet, A low memory traffic network
RealBasicVSR
Official repository of "RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution"
start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2022 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
StyleNeRF
This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"
StyleSwin
[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
TokenCut
pytorch implementation of "Self-supervised transformers for unsupervised object discovery using normalized cut"
UniFormer
[ICLR2022] official implementation of UniFormer
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
YOLOP
You Only Look Once for Panopitic Driving Perception.(https://arxiv.org/abs/2108.11250)
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/