iredescentone's starred repositories
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
FastWhisper
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
Relation-Networks-for-Object-Detection
Relation Networks for Object Detection
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
PAN.pytorch
A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
pytorch-lamb
Implementation of https://arxiv.org/abs/1904.00962
ChatIE
The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us. The response speed depends on openai. ( sometimes, the official is too crowded and slow)
RefineMask
RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features (CVPR 2021)
chargrid-pytorch
Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)
Medical-table
医疗体检单表格解析,通过kmean做的表格识别
DocumentInformationExtraction
Key Information Extraction From Documents: Evaluation And Generator