Wini1680

Wini1680

Geek Repo

Location:XiaMen

Github PK Tool:Github PK Tool

Wini1680's starred repositories

e9x-fun

超有趣

Language:JavaScriptLicense:MITStargazers:65Issues:0Issues:0

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:419Issues:0Issues:0

asr_nlp_paper_code

Papers of ASR, Tools of ASR

License:MITStargazers:36Issues:0Issues:0

early-stopping-pytorch

Early stopping for PyTorch

Language:Jupyter NotebookLicense:MITStargazers:1204Issues:0Issues:0

RawGAT-ST-antispoofing

This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection" (https://arxiv.org/abs/2107.12710) published in the ASVspoof 2021 workshop.

Language:PythonLicense:MITStargazers:62Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:891Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3798Issues:0Issues:0

bash-tutorial

Bash 教程

Language:ShellStargazers:4180Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:13862Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:998Issues:0Issues:0
Language:PythonStargazers:164Issues:0Issues:0

world-vocoder

A high-quality speech analysis, manipulation and synthesis system

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0

RIR-Generator

Generating room impulse responses

Language:C++License:MITStargazers:406Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39645Issues:0Issues:0

PytorchOCR

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Language:PythonStargazers:1302Issues:0Issues:0

C-OCR

C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。

Language:JavaStargazers:2364Issues:0Issues:0

awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

License:Apache-2.0Stargazers:2496Issues:0Issues:0

kiss

Code for the paper "KISS: Keeping it Simple for Scene Text Recognition"

Language:PythonLicense:GPL-3.0Stargazers:110Issues:0Issues:0

DUP-ocropy

Python-based tools for document analysis and OCR

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3397Issues:0Issues:0

english-words

:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

Language:PythonLicense:UnlicenseStargazers:10181Issues:0Issues:0

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonLicense:MITStargazers:13616Issues:0Issues:0

research-charnet

CharNet: Convolutional Character Networks

Language:PythonLicense:NOASSERTIONStargazers:611Issues:0Issues:0
Language:Vim scriptStargazers:1Issues:0Issues:0

customs_cvat_anno

cvat annotation of customs data

Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:856Issues:0Issues:0

machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

Language:Jupyter NotebookStargazers:8277Issues:0Issues:0

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:TypeScriptLicense:MITStargazers:11643Issues:0Issues:0

masr

中文语音识别; Mandarin Automatic Speech Recognition;

Language:PythonStargazers:1828Issues:0Issues:0

tensorflow_PSENet

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

Language:C++License:MITStargazers:492Issues:0Issues:0

PSENet

Official Pytorch implementations of PSENet.

Language:PythonLicense:Apache-2.0Stargazers:1166Issues:0Issues:0