ALIOSKUPER's starred repositories
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
mmdetection
OpenMMLab Detection Toolbox and Benchmark
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
ImageMagick
🧙♂️ ImageMagick 7
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
RapidStructure
版面分析 | 表格识别 | 文档方向分类
OpenCV-Projects
OpenCV projects using python
Data-for-LaTeX_OCR
LaTeX OCR 的数据仓库
ExtractRect
find the largest rectangle inscribed in a non-convex polygon
SuperCLUE-Math6
SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
Simple-LaTeX-OCR
Large scale training of Latex formula recognition model, currently being organized and open source
LaTeX-OCR_Helper
用于文献阅读时提取Latex表达式的小工具