Max (xtyrrell)

xtyrrell

Geek Repo

Location:Cape Town, South Africa

Home Page:xtyrrell.com

Twitter:@xtyrrell_com

Github PK Tool:Github PK Tool


Organizations
dadoagency
Flipper0x

Max's starred repositories

MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Language:PythonStargazers:340Issues:0Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Language:PythonLicense:Apache-2.0Stargazers:4143Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3374Issues:0Issues:0

react-native-document-scanner-android

Document scanner android, feature live detection, auto-capture, perspective correction :vibration_mode: :camera: -- :trophy:

Language:JavaLicense:MITStargazers:81Issues:0Issues:0

atopile

Design circuit boards with code! ✨ Get software-like design reuse 🚀, validation, version control and collaboration in hardware; starting with electronics ⚡️

Language:PythonLicense:Apache-2.0Stargazers:1793Issues:0Issues:0

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5443Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:6776Issues:0Issues:0

tesseract.js-video

An example app to recognize video clip with tesseract.js

Language:HTMLStargazers:116Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17347Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:11786Issues:0Issues:0

BlurDetection2

Blur Detection with OpenCV in Python

Language:PythonLicense:MITStargazers:328Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8253Issues:0Issues:0

rembg-trainer

Code to train U2Net model for use with rembg tool

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

rembg

Rembg is a tool to remove images background

Language:PythonLicense:MITStargazers:15071Issues:0Issues:0

background-removal-js

Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.

Language:TypeScriptLicense:AGPL-3.0Stargazers:5377Issues:0Issues:0

DIS

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2044Issues:0Issues:0

U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Language:PythonLicense:Apache-2.0Stargazers:8237Issues:0Issues:0

backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Language:PythonLicense:MITStargazers:6384Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:6525Issues:0Issues:0

page-dewarp

Document image dewarping library using a cubic sheet model

Language:PythonLicense:MITStargazers:91Issues:0Issues:0

DewarpNet

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Language:PythonLicense:MITStargazers:466Issues:0Issues:0

tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

Language:JavaScriptLicense:Apache-2.0Stargazers:33927Issues:0Issues:0

tesseract.js-typescript

An example to use tesseract.js in typescript

Language:TypeScriptStargazers:7Issues:0Issues:0

react-native-opencv3

react-native-opencv3 wraps functionality from OpenCV Java SDK 3.4.4 + contrib modules and iOS OpenCV 3.4.1 + contrib modules for use in React-Native apps. Please enjoy!

Language:Objective-C++License:NOASSERTIONStargazers:180Issues:0Issues:0

opencv-js

OpenCV JavaScript version for node.js or browser

Language:TypeScriptLicense:Apache-2.0Stargazers:343Issues:0Issues:0

nextjs-github-pages

🚀 Deploy a Next.js app to GitHub Pages via GitHub Actions.

Language:TypeScriptLicense:MITStargazers:374Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39630Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8538Issues:0Issues:0
Language:PythonLicense:MITStargazers:10Issues:0Issues:0