symao (Maoshuiyang)

Maoshuiyang

Geek Repo

Company:The Chinese University of Hong Kong

Location:Hong Kong

Home Page:https://maoshuiyang.github.io/

Github PK Tool:Github PK Tool

symao's starred repositories

visdom

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

Language:PythonLicense:Apache-2.0Stargazers:10008Issues:0Issues:0

machine-learning-systems-design

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

Language:HTMLStargazers:9001Issues:0Issues:0

hypertunity

A toolset for black-box hyperparameter optimisation.

Language:PythonLicense:Apache-2.0Stargazers:136Issues:0Issues:0

Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Language:PythonLicense:MITStargazers:2843Issues:0Issues:0

DWT-DCT-Digital-Image-Watermarking

A digital image watermarking algorithm based on combining two transforms; DWT and DCT.

Language:PythonStargazers:80Issues:0Issues:0

TensorFlow-Tutorials

TensorFlow Tutorials with YouTube Videos

Language:Jupyter NotebookLicense:MITStargazers:9271Issues:0Issues:0

Data-analysis-and-visuliastion

Analyze and Visualize data insights of an audio file in the format .wav (Speech signal ). And communicating findings and Extracting features.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

Emotion-Detection-in-Speech

Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine, K-Nearest Neighbor, Random Forest and Recurrent Neural Network. Analyzing the performance of each model based on the dataset.

Language:Jupyter NotebookStargazers:18Issues:0Issues:0

librosa

Python library for audio and music analysis

Language:PythonLicense:ISCStargazers:7104Issues:0Issues:0

neat-vision

Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)

Language:VueLicense:MITStargazers:251Issues:0Issues:0

deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Language:HTMLStargazers:12213Issues:0Issues:0

efficientdensenet_crnn

memory efficient densenet+lstm+ctc实现中文识别

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Language:PythonLicense:MITStargazers:1979Issues:0Issues:0

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:942Issues:0Issues:0

ctc_tensorflow_example

CTC + Tensorflow Example for ASR

Language:PythonLicense:MITStargazers:313Issues:0Issues:0

CRNN_Tensorflow

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

Language:PythonLicense:MITStargazers:1032Issues:0Issues:0

ml-tutorial

machine learning algorithms and implementations

Language:Jupyter NotebookLicense:MITStargazers:114Issues:0Issues:0

transformer-tensorflow

Implementation of Transformer Model in Tensorflow

Language:PythonStargazers:445Issues:0Issues:0

emotion_recognition

CTC for emotion recognition

Language:PythonStargazers:60Issues:0Issues:0

pase

Problem Agnostic Speech Encoder

Language:PythonLicense:MITStargazers:439Issues:0Issues:0

keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Language:PythonStargazers:72Issues:0Issues:0

SMHA

My master thesis: Siamese multi-hop attention for cross-modal retrieval.

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

Multimodal-Transformer

[ACL'19] [PyTorch] Multimodal Transformer

Language:PythonLicense:MITStargazers:811Issues:0Issues:0

multimodal-speech-emotion

TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18

Language:Jupyter NotebookLicense:MITStargazers:258Issues:0Issues:0

lihang_book_algorithm

致力于将李航博士《统计学习方法》一书中所有算法实现一遍

Language:PythonStargazers:5700Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:14220Issues:0Issues:0

generative-models

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Language:PythonLicense:UnlicenseStargazers:7325Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:5853Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:25243Issues:0Issues:0