symao's starred repositories
machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
hypertunity
A toolset for black-box hyperparameter optimisation.
Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
DWT-DCT-Digital-Image-Watermarking
A digital image watermarking algorithm based on combining two transforms; DWT and DCT.
TensorFlow-Tutorials
TensorFlow Tutorials with YouTube Videos
Data-analysis-and-visuliastion
Analyze and Visualize data insights of an audio file in the format .wav (Speech signal ). And communicating findings and Extracting features.
Emotion-Detection-in-Speech
Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine, K-Nearest Neighbor, Random Forest and Recurrent Neural Network. Analyzing the performance of each model based on the dataset.
neat-vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
efficientdensenet_crnn
memory efficient densenet+lstm+ctc实现中文识别
ctc_tensorflow_example
CTC + Tensorflow Example for ASR
CRNN_Tensorflow
Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition
ml-tutorial
machine learning algorithms and implementations
transformer-tensorflow
Implementation of Transformer Model in Tensorflow
emotion_recognition
CTC for emotion recognition
keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
multimodal-speech-emotion
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
lihang_book_algorithm
致力于将李航博士《统计学习方法》一书中所有算法实现一遍
generative-models
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.