Fragrance307

Fragrance307

Geek Repo

Github PK Tool:Github PK Tool

Fragrance307's starred repositories

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:289Issues:0Issues:0

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonLicense:MITStargazers:303Issues:0Issues:0
Language:PythonStargazers:812Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29355Issues:0Issues:0

CIF-PyTorch

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Language:PythonLicense:Apache-2.0Stargazers:65Issues:0Issues:0

image-feature-learning-pytorch

PyTorch implementation of Center Loss & Contrastive-Center Loss.

Language:PythonLicense:MITStargazers:63Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13354Issues:0Issues:0

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonLicense:MITStargazers:1174Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8136Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13143Issues:0Issues:0

Total-Text-Dataset

Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Language:MATLABLicense:BSD-3-ClauseStargazers:736Issues:0Issues:0

AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Language:PythonLicense:NOASSERTIONStargazers:3344Issues:0Issues:0

ucasthesis

LaTeX Thesis Template for the University of Chinese Academy of Sciences

Language:TeXStargazers:3393Issues:0Issues:0

ObjectDetectionImbalance

Lists the papers related to imbalance problems in object detection [TPAMI]

Stargazers:1114Issues:0Issues:0

overhaul-distillation

Official PyTorch implementation of "A Comprehensive Overhaul of Feature Distillation" (ICCV 2019)

Language:PythonLicense:MITStargazers:409Issues:0Issues:0

CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Language:PythonStargazers:2902Issues:0Issues:0

chinese-ocr

基于CTPN(tensorflow)+CRNN(pytorch)+CTC的不定长文本检测和识别

Language:PythonStargazers:297Issues:0Issues:0

Non-local_pytorch

Implementation of Non-local Block.

Language:PythonLicense:Apache-2.0Stargazers:1566Issues:0Issues:0

imutils

A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.

Language:PythonLicense:MITStargazers:4510Issues:0Issues:0

CTPN

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1282Issues:0Issues:0

awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

License:Apache-2.0Stargazers:2497Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8630Issues:0Issues:0
Language:PythonStargazers:284Issues:0Issues:0

SceneTextPapers

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

Stargazers:781Issues:0Issues:0

lstm_ctc_ocr

Use CTC + tensorflow to OCR

Language:PythonStargazers:354Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29847Issues:0Issues:0

ctcdecode

PyTorch CTC Decoder bindings

Language:C++License:MITStargazers:42Issues:0Issues:0

cocoapi

COCO API - Dataset @ http://cocodataset.org/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6018Issues:0Issues:0

E2E-MLT

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text

Language:C++License:MITStargazers:292Issues:0Issues:0

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonLicense:Apache-2.0Stargazers:9638Issues:0Issues:0