Intorduction
This is a repository collcting the code written in iThome competition essays: Data and Machine Learning in NLP aspect(玩轉資料與機器學習-以自然語言處理為例)
Table of Content
Although all of them are written in Chinese, I translate their title into English.
- Roadmap
- Web Crawling
- Web Crawling Day1 - Introduction
- Web Crawling Day2 - Html File Obtaining and General Problems
- Web Crawling Day3 - Html File Obtaining and General Problems(Cont.)
- Web Crawling Day4 - Html File Analysis
- Web Crawling Day5 - Advaced: Async Carwling
- Web Crawling Day5 - Advaced: Async Carwling and Multithread
- Database
- Pandas
- Natural Language Processing
- Information Retrieval
- Preprocessing
- Machine Learning
- Machine Learning Introduction
- Clustering Algorithm Theory
- Clustering Algorithm Implementation - Distinguishing Between Different Algorithm
- Product Label Clustering - Word2Vec
- Classification Theory - Traditional Algorithm
- Classification Theory - SVM and XGB+CV
- Classification Implementation - LineBot Project
- Conclesion
- Weekend Special
Feedback I got from linkedin
Google Translation:
Hi, I see your article on Iron Man
Feel very good writing, although I am now a Web Developer
But the data chain that I studied at the institute previously also uses Python to handle ML, data mining, etc.
Your article made me impulsive and wanted to return to Python, and looked at your article to return to practice
Hope to establish a relationship with you:)