So Uchida (S-aiueo32)

S-aiueo32

Geek Repo

Company:Sansan Inc.

Location:Tokyo, Japan

Twitter:@s_aiueo32

Github PK Tool:Github PK Tool

So Uchida's starred repositories

Language:PythonLicense:MITStargazers:55Issues:0Issues:0

VizWiz2024-VQA-AnswerTherapy

[2024VizWiz] Vision-Language Model-based PolyFormer for Recognizing Visual Questions with Multiple Answer Groundings

Language:PythonStargazers:2Issues:0Issues:0

content-debiased-fvd

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:339Issues:0Issues:0

segment-caption-anything

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.

Language:PythonLicense:Apache-2.0Stargazers:164Issues:0Issues:0

Hi-SAM

[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Language:PythonLicense:Apache-2.0Stargazers:142Issues:0Issues:0

YOLO

An MIT rewrite of YOLOv9

Language:PythonLicense:MITStargazers:231Issues:0Issues:0

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:7264Issues:0Issues:0

CRVQA2024

Visual question answering with rationales

License:MITStargazers:3Issues:0Issues:0

DiffTSR

[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)

Language:PythonStargazers:22Issues:0Issues:0

DeqIR

PyTorch implementation of "Deep Equilibrium Diffusion Restoration with Parallel Sampling (CVPR 2024)"

Language:PythonStargazers:56Issues:0Issues:0

ESTextSpotter

(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

Language:PythonStargazers:70Issues:0Issues:0

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:3285Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13135Issues:0Issues:0

extension-cpp

C++ extensions in PyTorch

Language:PythonStargazers:952Issues:0Issues:0

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1477Issues:0Issues:0

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

Stargazers:1163Issues:0Issues:0

CCSR

Official codes of CCSR: Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Language:PythonStargazers:361Issues:0Issues:0

SwinTextSpotter

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Language:PythonStargazers:258Issues:0Issues:0

Bridging-Text-Spotting

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

Language:PythonLicense:NOASSERTIONStargazers:34Issues:0Issues:0

torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonLicense:BSD-3-ClauseStargazers:977Issues:0Issues:0

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

Language:PythonLicense:GPL-3.0Stargazers:4517Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21926Issues:0Issues:0
Language:PythonStargazers:63Issues:0Issues:0

DPText-DETR

[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer

Language:PythonLicense:NOASSERTIONStargazers:161Issues:0Issues:0

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1483Issues:0Issues:0
Language:PythonStargazers:125Issues:0Issues:0

tao_pytorch_backend

TAO Toolkit deep learning networks with PyTorch backend

Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17389Issues:0Issues:0

doc3D-dataset

A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)

Language:ShellLicense:MITStargazers:154Issues:0Issues:0