sinpy戴's repositories
swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
TexTeller-latex
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
GlyphDraw
Text-To-Image Generation with Chinese Characters
GlyphControl
[NeurIPS2023] This is the official inference code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
trocr-latex
transformers ocr for chinese
JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
fairscale
PyTorch extensions for high performance and large scale training.
SOD100K
The official repo of the TPAMI 2021/ECCV 2020 work CSNet: A Highly Efficient Model with 100K Parameters to Study the Semantics of Salient Object Detection
pymavlink
python MAVLink interface and utilities
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
ardupilot
ArduPlane, ArduCopter, ArduRover source
RGBD-SOD-datasets
All those partitioned RGB-D Saliency Datasets we collected are shared in ready-to-use manner.
densemapnet
Keras code of my ICRA 2018 paper "Fast Disparity Estimation using Dense Networks"