Ruthvik Vaila's repositories
Audio-Classification-HF
Audio emotion recognition using Huggingface Library
CenterPose
Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image
DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Deep-Learning-Experiments
Videos, notes and experiments to understand deep learning
ECE-697-Fall-2022
ECE-697-Fall-2022
glasses
High-quality Neural Networks for Computer Vision 😎
hiertext
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
langchain-tutorials
Overview and tutorial of the LangChain Library
logpiles_segmentation
Code repository for paper Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
mlforecast
Scalable machine learning based time series forecasting.
mvts_transformer
Multivariate Time Series Transformer, public version
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
pan_pp.pytorch
Official implementations of PSENet, PAN and PAN++.
parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
python-patterns
A collection of design patterns/idioms in Python
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
SF-Mask-RCNN
Synthetic RGB-D Fusion (SF) Mask R-CNN for Unseen Object Instance Segmentation
text-generation-webui
A Gradio web UI for Large Language Models.
Total-Text-Dataset
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.