fireae's repositories
ApraPipes
A pipeline framework for developing video and image processing application. Supports multiple GPUs and Machine Learning tooklits
CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
Chinese-Minority-PLM
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
clipsmm
C++ Binding for CLIPS Rules Engine
Dango-Translator
团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器
devstream
DevStream: the open-source DevOps toolchain manager (DTM).
diffgram
Training Data (Data Labeling, Annotation, Workflow) for all Data Types (Image, Video, 3D, Text, Geo, Audio, more) at scale.
docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
dolt
Dolt – It's Git for Data
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
fauxpilot
FauxPilot - an open-source GitHub Copilot server
flameshot
Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:
geogebra
GeoGebra apps (mirror)
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
laf
laf. js is an open source BaaS framework, which provides cloud database, cloud functions, file storage and other capabilities. Front-end developers become full-stack developers in seconds.
lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
OpenBoard
OpenBoard is a cross-platform interactive whiteboard application intended for use in a classroom setting.
SAN
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Sphere
Web-scale retrieval for knowledge-intensive NLP
Stark
[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
TextBPN-Plus-Plus
Arbitrary Shape Text Detection via Boundary Transformer
TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
UIE
Unified Structure Generation for Universal Information Extraction
ViBERTgrid-PyTorch
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
YOLOv4-pytorch
This is a pytorch repository of YOLOv4, attentive YOLOv4 and mobilenet YOLOv4 with PASCAL VOC and COCO
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors