Swall0w

Swall0w's repositories

torchstat

Model analyzer in PyTorch

Language:PythonMIT1495 11 46

dtrocr

MIT62 310

alpaca_ja

alpacaデータセットを日本語化したものです

Apache-2.0100

cougar

PyTorch deep learning Vision library for fast prototyping

Language:PythonMIT1 20

SparseR-CNN

End-to-End Object Detection with Learnable Proposal, CVPR2021

Language:PythonMIT1 10

Swall0w

MIT1 20

tide

A General Toolbox for Identifying Object Detection Errors

Language:PythonMIT010

yabumi

Language:PythonMIT010

CMaskTrack-RCNN

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Language:PythonApache-2.0010

ConformerViT

ViT + Conformer = ( ͡❛ ͜ʖ ͡❛)👌

Language:Python000

cycle-diffusion

Code for "Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance"

Language:PythonNOASSERTION000

deep-text-recognition-benchmark

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Language:Jupyter NotebookApache-2.0000

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonApache-2.0000

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonMIT000

DPText-DETR

[AAAI 2023 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer

Language:PythonNOASSERTION000

FastChat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

Language:PythonApache-2.0000

IFC

Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)

Language:PythonApache-2.0000

norfair

Lightweight Python library for adding real-time object tracking to any detector.

Language:PythonBSD-3-Clause000

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonNOASSERTION000

SEED

Language:Python000

SeqFormer

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation

Language:PythonNOASSERTION010

SLTnet

Spatio-temporal object detector

Language:Python010

SRFormer-Text-Det

Language:Python000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

Swall0w.github.io

Language:HTMLGPL-3.0010

TeViT

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022 Oral

Language:PythonMIT000

TextZoom

A super-resolution dataset of paired LR-HR scene text images

Language:Python000

TransVTSpotter

A new video text spotting framework with Transformer

Language:Python000

unilm

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

Language:PythonMIT010

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT010