ShashankKrishnaV

Shashank Krishna Vempati's repositories

COL-783-Digital-Image-Processing-2023

All assignments along with reports

Language:Python000

awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

CC0-1.0000

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

000

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

MIT000

MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

000

IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing generation tasks across a diverse set 29 of Indic languages covering 13 scripts and 4 language families.

NOASSERTION000

Book-Understanding-Deep-Learning

Understanding Deep Learning - Simon J.D. Prince

NOASSERTION000

open_clip

An open source implementation of CLIP.

NOASSERTION000

All-Language-OCRs

Model checkpoints are uploaded here

Language:PythonMIT000

Group-Chat-Video-and-Audio-call

This application integrates 3 main features i.e Group chat, video and audio calling. Go through the documentation before executing the files.

Language:PythonMIT200

TEXTRON

Data Programming for Text Detection in Documents using SPEAR

GPL-3.0000

hiertext

The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.

CC-BY-SA-4.0000

CLIPOCR

000

urdu-synth

High-quality synthetic text data generation for Urdu Text Recognition

Apache-2.0000

Scene-Text-Detection

000

bbocr

BSD-3-Clause000

PlotNeuralNet

Latex code for making neural networks diagrams

MIT000

UTRNet-High-Resolution-Urdu-Text-Recognition

UTRNet: High Resolution Multi-scale Feature Maps For Accurate Recognition Of Printed Urdu Text (ICDAR'23)

NOASSERTION000

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

MIT000

Scene-Text-Recognition-Recommendations

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

MIT000

ml-papers

My collection of machine learning papers

MIT000

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

000

OCR-V4-IIITH

Indian Language OCR

MIT000

New_York_CitiBike-Tableau-challenge

New York CitiBike Tableau

000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Apache-2.0000

STR_benchmark_cleansed

000

HLExt-via-IS-LineDetection

Line Extraction in Handwritten Documents via Instance Segmentation

000

EasyOCR-Reference

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Apache-2.0000

ShashankKrishnaV