azf99 / CDRI-test-assign

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CDRI-test-assign

This notebook focuses on mainly extracting articles from the the Times of India website and performing preliminary analysies on the text and also visualizing our results.

Some feaures could not be completed, like some nltk and textblob processes of lemmatization, TF-IDF etc. due to the lack of time

I'll be updating this repository as I have completed all the left out stages.

Created by:- Azfar Lari using Google Colaboratory http://linkedin.com/in/azfar-lari Submitted to:- Dr. Sukant Khurana https://scholar.google.co.in/citations?user=LiTpdBYAAAAJ&hl=en&oi=ao

References: https://www.datacamp.com/community/tutorials/web-scraping-using-python https://www.analyticsvidhya.com/blog/2015/10/beginner-guide-web-scraping-beautiful-soup-python/ https://www.analyticsvidhya.com/blog/2018/02/the-different-methods-deal-text-data-predictive-python/ https://www.datacamp.com/community/tutorials/text-analytics-beginners-nltk

About


Languages

Language:Jupyter Notebook 100.0%