Khawar Islam's repositories
FPVT_BMVC22
Code of Pyramid Vision Transformer at BMVC 2022
khawar-islam.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Face-Transformer
Face Transformer for Recognition
khawarislam.github.io
:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby
MogaNet
Code release for MogaNet: Efficient Multi-order Gated Aggregation Network
pytorch-image-classification-OOD
WOrking on OOD
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
YOCO
Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.
yolov8_tracking
Real-time multi-object tracking and segmentation using YOLOv8 with DeepOCSORT and OSNet