WeiHaoran's starred repositories
SMT-plusplus
Official implementation of the Sheet Music Transformer ++
olimpic-icdar24
Practical End-to-End Optical Music Recognition for Pianoform Music
UnrealText
Synthetic Scene Text from 3D Engines
Vary-tiny-600k
Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)
mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
CornerAffinity
[IJCAI2022] Corner Affinity: A Robust Grouping Algorithm to Make Corner-guided Detector Great Again
HumanLiker
[NeurIPS2022 spotlight]HumanLiker: A Human-like Object Detector to Model the Manual Labeling Process
CCNet-Pure-Pytorch
Criss-Cross Attention (2d&3d) for Semantic Segmentation in pure Pytorch with a faster and more precise implementation.
Aircraft-KP
Keypoint dataset for airplane
Object-Detection-Metrics
Most popular metrics used to evaluate object detection algorithms.