Rshcaroline / UCSD-CSE256-Statistical-NLP

This course will explore statistical techniques for the automatic analysis of natural language data. Specific topics covered include: probabilistic language models, which define probability distributions over text sequences; text classification; sequence models; parsing sentences into syntactic representations; machine translation, and machine reading.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

UCSD-CSE256-Statistical-NLP

Natural language processing (NLP) is a field of AI which aims to equip computers with the ability to intelligently process natural (human) language. This course will explore statistical techniques for the automatic analysis of natural language data. Specific topics covered include: probabilistic language models, which define probability distributions over text sequences; text classification; sequence models; parsing sentences into syntactic representations; machine translation, and machine reading.

Assignments Topics Requirement Project Report Score
1. Comparing Language Models Language Model + Trigram + Smoothing Requirement Report 100/100; 15%
2. Semi-supervised Text Classification Feature Engineering + Semi-supervised learning Requirement Report 100/100; 20%
3. Sequence Tagging Trigram HMM Requirement Report 100/100; 10%
4. Machine Translation IBM Model 1&2 Requirement Report 104/104; 15%

About

This course will explore statistical techniques for the automatic analysis of natural language data. Specific topics covered include: probabilistic language models, which define probability distributions over text sequences; text classification; sequence models; parsing sentences into syntactic representations; machine translation, and machine reading.


Languages

Language:JavaScript 36.5%Language:TeX 34.0%Language:Python 22.7%Language:Jupyter Notebook 6.5%Language:C++ 0.3%Language:Shell 0.1%Language:Makefile 0.0%