Wini1680

followers

following

stars

XiaMen

Wini1680's starred repositories

e9x-fun

超有趣

Language:JavaScriptMIT6500

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

asr_nlp_paper_code

Papers of ASR, Tools of ASR

MIT3600

early-stopping-pytorch

Early stopping for PyTorch

Language:Jupyter NotebookMIT120400

RawGAT-ST-antispoofing

This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.org/abs/2107.12710) published in the ASVspoof 2021 workshop.

Language:PythonMIT6200

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT89100

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.0379800

bash-tutorial

Bash 教程

Language:Shell418000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION1386200

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonCC-BY-4.099800

SEED

Language:Python16400

world-vocoder

A high-quality speech analysis, manipulation and synthesis system

Language:C++NOASSERTION100

RIR-Generator

Generating room impulse responses

Language:C++MIT40600

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonApache-2.03964500

PytorchOCR

基于Pytorch的OCR工具库，支持常用的文字检测和识别算法

Language:Python130200

C-OCR

C-OCR是携程自研的OCR项目，主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。项目包含4个部分，拒识、检测、识别、后处理。

Language:Java236400

awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Apache-2.0249600

kiss

Code for the paper "KISS: Keeping it Simple for Scene Text Recognition"

Language:PythonGPL-3.011000

DUP-ocropy

Python-based tools for document analysis and OCR

Language:Jupyter NotebookApache-2.0339700

english-words

:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

Language:PythonUnlicense1018100

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonMIT1361600

research-charnet

CharNet: Convolutional Character Networks

Language:PythonNOASSERTION61100

configure

Language:Vim script100

customs_cvat_anno

cvat annotation of customs data

100

PubLayNet

Language:Jupyter NotebookNOASSERTION85600

machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习，概率模型和深度学习的讲义(2000+页)和视频链接

Language:Jupyter Notebook827700

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptMIT1164300

masr

中文语音识别; Mandarin Automatic Speech Recognition;

Language:Python182800

tensorflow_PSENet

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

Language:C++MIT49200

PSENet

Official Pytorch implementations of PSENet.

Language:PythonApache-2.0116600