greyatomtestuser3 / nlp_day_01_project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Natural Language Processing Project

NLP is one of the components of Artificial intelligence which is the ability of a computer program to understand the human speech.

Now, let's dig deep to learn how we can make a fully fledged model to understand textual data.

What have we learnt so far

In today's lecture we learned some of the core concepts invloved in Natural language processing such as :-

  • Sentence Splitter
  • Tokenization
  • POS (part of speech tagging)
  • Stop-word Removal
  • Stemming
  • Lemmatization
  • Chunking
  • Deep Parsing
  • Word Cloud
  • TF-IDF

Why solve this assignment?

Lets review some of the concepts taught in the class. At the end of the assignment you'll be able to:

  • Implement the necessary preprocessing steps required while processing natural language.
  • Show contextual understanding learned by machine.

Dataset

For this exercise, we will use the Twenty Newsgroups Data Set,which comprises of 20000 messages taken from 20 newsgroups.

About


Languages

Language:Python 90.7%Language:Jupyter Notebook 9.3%