hubeibei007's repositories
3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
ACA-Slides
Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201 - Computational Music Analysis. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012
acoustid-index
Minimalistic search engine used by AcoustID for searching in audio fingerprints
AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
awesome-deep-learning-music
List of articles related to deep learning applied to music
awesome-ocr
A curated list of promising OCR resources
caffe
Caffe: a fast open framework for deep learning.
caffe_ocr
主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
Chinese-Names-Corpus
中文人名语料库。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。
chinese-ocr
运用tensorflow实现自然场景文字检测,keras/pytorch实现crnn+ctc实现不定长中文OCR识别
chinese_ocr
CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras
crnn-study
crnn study with attention
ctw-baseline
Baseline methods for [CTW dataset](https://ctwdataset.github.io/)
das2018-tutorial
A tutorial on the PyTorch-based ocropus components.
deep-learning-benchmark
Deep Learning Benchmark for comparing the performance of DL frameworks, GPUs, and single vs half precision
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
DeepAudioClassification
Finding the genre of a song with Deep Learning
DeepHashingBaselines
Deep Hashing Baselines
DSS
code for "Deeply supervised salient object detection with short connections" published in CVPR 2017
fma
FMA: A Dataset For Music Analysis
HashNet
Code release for "HashNet: Deep Learning to Hash by Continuation" (ICCV 2017)
huxpro.github.io
My Blog / Jekyll Themes / PWA
notes-linear-algebra
线性代数笔记
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
pydata-notebook
利用Python进行数据分析 第二版 (2017) 中文翻译笔记
speaker_adapted_tts
Making a TTS model with 1 minute of speech samples within 10 minutes
warp-ctc
Pytorch Bindings for warp-ctc