So Uchida (S-aiueo32)

S-aiueo32

Geek Repo

Company:Sansan Inc.

Location:Tokyo, Japan

Twitter:@s_aiueo32

Github PK Tool:Github PK Tool

So Uchida's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49010Issues:0Issues:0

GiT

Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonLicense:Apache-2.0Stargazers:220Issues:0Issues:0

desigen

Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]

Language:PythonStargazers:40Issues:0Issues:0

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2303Issues:0Issues:0

SPTSv2

The official implementation of SPTS v2: Single-Point Text Spotting

Language:PythonLicense:Apache-2.0Stargazers:119Issues:0Issues:0

calibration-framework

The net:cal calibration framework is a Python 3 library for measuring and mitigating miscalibration of uncertainty estimates, e.g., by a neural network.

Language:PythonLicense:Apache-2.0Stargazers:323Issues:0Issues:0

manim

Animation engine for explanatory math videos

Language:PythonLicense:MITStargazers:59483Issues:0Issues:0

vscode-extension-samples

Sample code illustrating the VS Code extension API.

Language:TypeScriptLicense:MITStargazers:8291Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:29Issues:0Issues:0
Language:PythonStargazers:32Issues:0Issues:0

ElasticDiffusion-official

The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)

Language:PythonStargazers:119Issues:0Issues:0

CAMixerSR

CAMixerSR: Only Details Need More “Attention” (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:149Issues:0Issues:0
Stargazers:4Issues:0Issues:0

Handwriting-Transformers

Handwriting-Transformers (ICCV21)

Language:PythonLicense:MITStargazers:159Issues:0Issues:0
Language:PythonLicense:MITStargazers:66Issues:0Issues:0

convolutional-handwriting-gan

ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)

Language:PythonLicense:MITStargazers:258Issues:0Issues:0

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023 Spotlight)

Language:PythonLicense:NOASSERTIONStargazers:604Issues:0Issues:0

DocTr-Plus

The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.

Language:PythonLicense:MITStargazers:365Issues:0Issues:0

EDiffSR

[IEEE TGRS 2024] EDiffSR: An Efficient Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution

Language:PythonStargazers:97Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8604Issues:0Issues:0

small-object-detection-benchmark

icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

sahi_batched

Sahi batched inference (Yolov8 only)

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:715Issues:0Issues:0

vrdu

We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.

Stargazers:67Issues:0Issues:0

CCD

[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition

Language:PythonStargazers:135Issues:0Issues:0

groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Language:PythonStargazers:611Issues:0Issues:0

GPT-4V_OCR

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Language:PythonStargazers:106Issues:0Issues:0

DocProj

Document Rectification and Illumination Correction using a Patch-based CNN

Language:PythonLicense:MITStargazers:324Issues:0Issues:0

SinSR

[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step

Language:PythonLicense:NOASSERTIONStargazers:162Issues:0Issues:0

SeeSR

[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Language:PythonLicense:Apache-2.0Stargazers:311Issues:0Issues:0