MichaelHL-ai's repositories
OCR_EVALUATION
Modified by icdar codes for OCR results evaluation(detection or end-to-end recognition results). For more details, please see my blog:
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
mmyolo
OpenMMLab YOLO series toolbox and benchmark
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
DECA
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
allennlp
An open-source NLP research library, built on PyTorch.
Oscar
Oscar and VinVL
jieba
结巴中文分词
Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
SEAN
SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)
face-parsing.PyTorch
Using modified BiSeNet for face parsing in PyTorch
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
BFM_to_FLAME
Convert from Basel Face Model (BFM) to the FLAME head model
setup
Setup a new machine without sudo!
Dynamic-convolution-Pytorch
Pytorch!!!Pytorch!!!Pytorch!!! Dynamic 3d/2d convolution and some models' accuracy.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
DBNet.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
cocoapi
COCO API - Dataset @ http://cocodataset.org/
Alexnet-tensorflow
TensorFlow implementation of AlexNet(resnetv1 mobilenet) and its training and testing on kaggle Dogs vs Cats
tensorflow_PSENet
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:
opencv
Open Source Computer Vision Library
Stronger-yolo
🔥Improve yolo with latest paper