zyliutw / NEW_YORK_NEWS_CRAWLER

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

New York News Crawler

Easily getting the news from New York Times.

INFO

ONLY FOR EDUCATION 方便從New York Times抓取新聞資料,供學術使用

Requirements

  1. Beautiful Soup
  2. selenium
  3. python3

INPUT FILE FORMAT

It will fetch all the search topics during start time and end time.
Search Topic Start Time End Time
Wet Seal Inc 2010/7/1 2010/12/31
Local Corp 2010/7/1 2010/12/31

OUTPUT FILE FORMAT

One topic output to one folder.

NEWS_TOPIC
NEWS_TIME
NEWS_CONTENT

RUN

git clone https://github.com/yad50968/NEW_YORK_NEWS_CRAWLER.git
cd NEW_YORK_NEWS_CRAWLER
python3 ./main.py

About

License:MIT License


Languages

Language:Python 100.0%