weimeng23 / speech-recognition-learning-resources

:white_check_mark: A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

Home Page:https://github.com/weimeng23/speech-recognition-learning-resources

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Speech Recognition Learning Resources

This repo contains several learning resources for speech recognition, including courses, books, tutorials, papers and toolkits.(continuously updating)

Table of contens

Courses

  • (Recommended) Automatic Speech Recognition (ASR) 2018-2019 Lectures, School of Informatics, University of Edingburgh [Website]
  • Speech recognition, EECS E6870 - Spring 2016, Columbia University [Website]
  • CS224N: Natural Language Processing with Deep Learning, Stanford [Website] [Video(Winter 2021)] [Video(Winter 2017)]
  • CS224S: Spoken Language Processing (Winter 2021), Stanford [Website]
  • DLHLP: DEEP LEARNING FOR HUMAN LANGUAGE PROCESSING, 2020 SPRING, Hung-yi Lee [Website] [Video(Spring 2020)]
  • Microsoft DEV287x: Speech Recognition Systems, 2019 [Website]
  • 语音识别从入门到精通,2019,谢磊 (NOT FREE) [Website]
  • 數位語音處理概論,国立**大学,李琳山 [Website]

Books

  • Fundamentals of speech recognition, Lawrence Rabiner, Being-Hwang Juang, 1993 [Book]
  • Spoken language processing: A guide to theory, algorithm, and system levelopment, xuedong Huang, Alex acero, hsiao-wuen Hon, 2001 [Book]
  • Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Daniel Jurafsky & James H. Martin [Website] [Book 3rd Ed]
  • Automatic speech recognition: A Deep Learning Approach, Dong Yu and Li Deng, Springer, 2014 [Book]
  • Foundations of Statistical Natural Language Processing, Chris Manning and Hinrich Schütze, 1999 [Website] [Book]
  • 《解析深度学习:语音识别实践》,俞栋,邓力,电子工业出版社
  • 《Kaldi 语音识别实战》,陈果果,电子工业出版社
  • 《语音识别:原理与应用》,洪青阳,电子工业出版社
  • 《语音识别基本法》,汤志远,电子工业出版社
  • 《统计学习方法》李航,清华大学出版社
  • 《语音信号处理》韩继庆,清华大学出版社
  • 《语音信号处理》赵力,机械工业出版社

Papers

  • HMM: Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2): 257-286. [Paper]
  • EM: Bilmes J A. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models[J]. International Computer Science Institute, 1998, 4(510): 126. [Paper]
  • CTC: Graves A, Fernández S, Gomez F, et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks[C]//Proceedings of the 23rd international conference on Machine learning. 2006: 369-376. [Paper]

Tutorials

  • WFST
    • An Introduction to Weighted Automata in Machine Learning, Awni Hannun, 2021. [PDF]
  • k2
    • Speech Recognition with Next-Generation Kaldi (K2, Lhotse, Icefall), Interspeech 2021. [Video]
    • Progress in ASR with Next-Gen Kaldi, BAAI 2022. [Video] [Slides]
    • Speech Recognition with Icefall + Lhotse, Interspeech 2023. [Slides]

Toolkits

listed in no particular order

About

:white_check_mark: A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

https://github.com/weimeng23/speech-recognition-learning-resources

License:Creative Commons Attribution Share Alike 4.0 International