voithru's repositories
voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
wav2vec2_finetune
Wav2Vec2 finetune and inference code for IITP AI Grand Challenge
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
AI_dubbing
ai dubbing
PPOCRLabel
PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PPOCR model to automatically detect and re-recognize data. It is written in python3 and pyqt5, supporting rectangular box annotation and four-point annotation modes. Annotations can be directly used for the training of PPOCR detection and recognition models.
Seq2Seq-PyTorch
Sequence to Sequence Models with PyTorch
TextFuseNet
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
vecalign
Improved Sentence Alignment in Linear Time and Space
autoscaler
Autoscaling components for Kubernetes
ChainForge
An open-source visual programming environment for battle-testing prompts to LLMs.