Wini1680's starred repositories
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
asr_nlp_paper_code
Papers of ASR, Tools of ASR
early-stopping-pytorch
Early stopping for PyTorch
RawGAT-ST-antispoofing
This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.org/abs/2107.12710) published in the ASVspoof 2021 workshop.
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
bash-tutorial
Bash 教程
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
world-vocoder
A high-quality speech analysis, manipulation and synthesis system
RIR-Generator
Generating room impulse responses
PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
DUP-ocropy
Python-based tools for document analysis and OCR
english-words
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
research-charnet
CharNet: Convolutional Character Networks
customs_cvat_anno
cvat annotation of customs data
machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
tensorflow_PSENet
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog: