UA LING/CS438/538 (Fall 2016) Statistical Natural Language Processing.
Department of Linguistics, University of Arizona (imported from https://sites.google.com/site/ling439539fall2016/)
Schedule
Date | Description | Course Materials |
---|---|---|
Aug 23 | introduction to NLP | |
25 | ch. 1, linguistic essentials | [slides] |
30 | ch. 3, introduction | [slides] |
Sep 1 | (continued) | [assignment#1] |
compound splitting | [slides] [Koehn-Knight2003] | |
6 | ch. 6, statistical inferences: n-gram | [slides] |
8 | (continued) | [slides] |
13 | (continued) | [slides] [assignment#2] |
ch. 10, part-of-speech tagging | [slides] | |
15 | (continued) | |
20 | HMM POS tagging | [slides] [Rabiner1989] |
22 | (continued) | |
27 | (continued) | |
Introduction to EM & unsupervised hmm | [slides] [Dempster-EtAl1977] [Blime1998] | |
29 | chap. 11, probabilistic cfg | [slides] |
Oct 4 | (continued) cky | |
6 | (continued) early | |
11 | (continued) nltk chart parsing + partial parsing | [assignment#3] |
13 | probabilistic cfg | [slides] |
18 | final project proposal | [slides] |
20 | (continued) | |
25 | chap. 12, probabilistic parsing | [slides] |
27 | inside-outside algorithm | [slides] [Lari-Young1990] |
Nov 1-3 | chap. 13, SMT: word alignment, IBM Model 1 | [slides] |
8-10 | (continued) | |
15,17,22 | classification vs. clustering, WSD using EM | [slides] |
29 & Dec. 1,6 | final project presentation | [slides] |
Wed. Dec 7, 2016 | public poster session | [slides] |
[Koehn-Knight2003] Philipp Koehn; Kevin Knight (2003). Empirical Methods for Compound Splitting. In Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics. Pages 187-193. PA, USA.
[Rabiner1989] Lawrence R. Rabiner (1989). A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE. 77(2):257-286.
[Dempster-EtAl1977] Arthur P. Dempster, Nan M. Laird, and Donald B. Rubin (1977). Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B (Methodological). 39(1):1-38.
[Blime1998] Jeff A. Bilmes (1998). A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models. TR-97-021, International Computer Science Institute.
[Lari-Young1990] Karim Lari and Steve Young (1990) The estimation of stochastic context-free grammars using the Inside-Outside algorithm. Computer Speech & Language. 4(1):35-56.
Jungyeul Park (2016), Lecture Notes on Statistical Natural Language Processing. Department of Linguistics, University of Arizona (Fall 2016)