Shashank Krishna Vempati's repositories
Group-Chat-Video-and-Audio-call
This application integrates 3 main features i.e Group chat, video and audio calling. Go through the documentation before executing the files.
All-Language-OCRs
Model checkpoints are uploaded here
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Book-Understanding-Deep-Learning
Understanding Deep Learning - Simon J.D. Prince
COL-783-Digital-Image-Processing-2023
All assignments along with reports
COL780-Computer-Vision-Assignments
IIT Delhi, 2022 Semester 2, Taken by Anurag Mittal (IITM Professor)
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
doc3D-dataset
A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)
EasyOCR-Reference
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
HLExt-via-IS-LineDetection
Line Extraction in Handwritten Documents via Instance Segmentation
indic-gen-bench
IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing generation tasks across a diverse set 29 of Indic languages covering 13 scripts and 4 language families.
ml-papers
My collection of machine learning papers
New_York_CitiBike-Tableau-challenge
New York CitiBike Tableau
OCR-V4-IIITH
Indian Language OCR
OCRDatasets
A collection of OCR-related datasets
open_clip
An open source implementation of CLIP.
PaperEdge
The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
PlotNeuralNet
Latex code for making neural networks diagrams
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
ResNeSt
ResNeSt: Split-Attention Networks
sanskrit-ocr
An OCR for classical Sanskrit document images
Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
TEXTRON
Data Programming for Text Detection in Documents using SPEAR
urdu-synth
High-quality synthetic text data generation for Urdu Text Recognition
UTRNet-High-Resolution-Urdu-Text-Recognition
UTRNet: High Resolution Multi-scale Feature Maps For Accurate Recognition Of Printed Urdu Text (ICDAR'23)
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch