hubeibei007

followers

following

stars

hubeibei007's repositories

3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

Language:PythonApache-2.0020

ACA-Slides

Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201 - Computational Music Analysis. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012

Language:TeXNOASSERTION020

acoustid-index

Minimalistic search engine used by AcoustID for searching in audio fingerprints

Language:C++NOASSERTION020

AdvancedEAST

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.

Language:PythonMIT020

AttentionBasedProsodyPrediction

Encoder and Decoder and Attention Based Prosody Prediction

Language:Python000

awesome-deep-learning-music

List of articles related to deep learning applied to music

Language:TeXMIT000

awesome-ocr

A curated list of promising OCR resources

MIT000

caffe

Caffe: a fast open framework for deep learning.

Language:C++NOASSERTION000

caffe_ocr

主流ocr算法研究实验性的项目，目前实现了CNN+BLSTM+CTC架构

Language:C++000

Chinese-Names-Corpus

中文人名语料库。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。

Apache-2.0000

chinese-ocr

运用tensorflow实现自然场景文字检测,keras/pytorch实现crnn+ctc实现不定长中文OCR识别

Language:Python000

chinese_ocr

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

Language:PythonApache-2.0000

crnn-study

crnn study with attention

Language:Python000

ctw-baseline

Baseline methods for [CTW dataset](https://ctwdataset.github.io/)

Language:Python020

das2018-tutorial

A tutorial on the PyTorch-based ocropus components.

Language:Jupyter Notebook000

deep-learning-benchmark

Deep Learning Benchmark for comparing the performance of DL frameworks, GPUs, and single vs half precision

Language:Python020

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:Python000

DeepAudioClassification

Finding the genre of a song with Deep Learning

Language:Python000

DeepHashingBaselines

Deep Hashing Baselines

000

DSS

code for "Deeply supervised salient object detection with short connections" published in CVPR 2017

Language:Python000

fma

FMA: A Dataset For Music Analysis

Language:Jupyter NotebookMIT000

HashNet

Code release for "HashNet: Deep Learning to Hash by Continuation" (ICCV 2017)

Language:C++MIT000

huxpro.github.io

My Blog / Jekyll Themes / PWA

Language:CSSApache-2.0000

MDLSTM_CV

020

mtcnn

mtcnn in python

Language:PythonMIT020

notes-linear-algebra

线性代数笔记

Language:Jupyter Notebook020

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonApache-2.0000

pydata-notebook

利用Python进行数据分析第二版 (2017) 中文翻译笔记

Language:Jupyter Notebook020

speaker_adapted_tts

Making a TTS model with 1 minute of speech samples within 10 minutes

010

warp-ctc

Pytorch Bindings for warp-ctc

Language:CudaApache-2.0000