a1004123217's repositories
hclust-cpp
C++ fast hierarchical clustering algorithms
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
ANTs
Advanced Normalization Tools (ANTs)
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
daily_fudan
因遗留原因,自动更新不好实现,本项目不再维护,新项目地址 https://github.com/Limour-dev/daily_fudan_actions
daily_fudan_actions
daily_fudan的运行器,自动定时执行https://github.com/Limour-dev/daily_fudan_core 的代码。请按daily_fudan_core的readme正确配置FUDAN,按本仓库的readme正确配置GH_PAT。两个secrets都在本仓库配置,注意不要配置到core仓库。
dbnet-
dbnet++ 的mobilenetv3 backbone 实现
DBNet.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
FastSAM
Fast Segment Anything
madmom
detect beat and downbeat
MeDIT
Medical and Digital Image Tookbox. This project includs some functions to process medical image and digital images.
music
music
pythonlearning
ecnu
pytorch-mobile
mobile vision
TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023)
trocr-chinese
transformers ocr for chinese
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Yolo_mark
GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2