Sunghee2 / Lottery-crawler

๐Ÿค‘ ์ž๋™์œผ๋กœ ๊ตฌ๋งคํ•œ 1๋“ฑ์˜ ๊ตฌ๋งค์ฒ˜ ํฌ๋กค๋ง using Python + AWS Lambda + AWS RDS(MySQL)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Lottery-crawler

๋กœ๋˜ 6/45์—์„œ "์ž๋™"์œผ๋กœ ๊ตฌ๋งคํ•œ 1๋“ฑ์˜ ๊ตฌ๋งค์ฒ˜๋ฅผ ํฌ๋กค๋งํ•œ๋‹ค.

๋กœ๋˜ ๋ฒˆํ˜ธ ์„ ํƒ ๋ฐฉ๋ฒ•์—๋Š” ์ž๋™, ๋ฐ˜์ž๋™, ์ˆ˜๋™ 3๊ฐ€์ง€๊ฐ€ ์žˆ๋‹ค. ์†Œ์œ„ ๋งํ•˜๋Š” '๋กœ๋˜ ๋ช…๋‹น' ์ˆœ์œ„๋Š” ์ด 3๊ฐ€์ง€ ์„ ํƒ ๋ฐฉ๋ฒ•์„ ๋ชจ๋‘ ํฌํ•จํ•œ 1๋“ฑ ๋‹น์ฒจ์ž์˜ ์ˆ˜๋กœ ์ง€์ •๋˜๊ธฐ ๋•Œ๋ฌธ์— '์ž๋™'์œผ๋กœ ๊ตฌ๋งคํ•œ 1๋“ฑ์˜ ๊ตฌ๋งค์ฒ˜๋ฅผ ํฌ๋กค๋ง ํ•ด๋ณด์•˜๋‹ค.


Architecture


Data Acquisition

ํŒ๋งค์  ์ •๋ณด ์—…๋ฐ์ดํŠธ ์‹œ๊ฐ„์„ ๊ณ ๋ คํ•˜์—ฌ ๋งค์ฃผ ์ผ์š”์ผ์— ์ƒํ˜ธ๋ช…๊ณผ ์†Œ์žฌ์ง€๋ฅผ ์ˆ˜์ง‘ํ•œ๋‹ค.

์ƒํ˜ธ๋ช…์ด ๊ฐ™๊ฑฐ๋‚˜ ์—†๋Š” ํŒ๋งค์ ์ด ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ตฌ๋ถ„์„ ์œ„ํ•ด ์†Œ์žฌ์ง€๋„ ํ•จ๊ป˜ ์ˆ˜์ง‘ํ•œ๋‹ค.

crawler.py ๋Š” amazon lambda๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ์•Š๊ณ  262ํšŒ์ฐจ ~ 853ํšŒ์ฐจ ํŒ๋งค์ ์„ ํฌ๋กค๋งํ•œ๋‹ค.

lambda_crawler.py ๋Š” amazon lambda๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋งค์ฃผ ์ผ์š”์ผ๋งˆ๋‹ค ์ตœ์‹  ํšŒ์ฐจ์˜ ํŒ๋งค์ ์„ ํฌ๋กค๋งํ•œ๋‹ค.

lambda.zip ์€ ์‚ฌ์šฉํ•œ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์™€ ์œ„์˜ lambda_crawler.py ๋ฅผ ์••์ถ•ํ•œ ํŒŒ์ผ๋กœ amazon lambda์— ์—…๋ฐ์ดํŠธํ•œ ํŒŒ์ผ์ด๋‹ค.

Prerequisites

mysql -h [Endpoint] -P 3306 -u [user_name] -p ์œผ๋กœ ์ ‘์†ํ•œ ํ›„ ์•„๋ž˜์™€ ๊ฐ™์ด ํ…Œ์ด๋ธ”์„ ์ƒ์„ฑํ•œ๋‹ค.

CREATE TABLE lottery ( 
lottery_id INT NOT NULL AUTO_INCREMENT, 
shop VARCHAR(50) NOT NULL, 
location VARCHAR(200) NOT NULL, 
PRIMARY KEY(lottery_id) 
) DEFAULT CHARSET=utf8 COLLATE=utf8_bin;

Result

About

๐Ÿค‘ ์ž๋™์œผ๋กœ ๊ตฌ๋งคํ•œ 1๋“ฑ์˜ ๊ตฌ๋งค์ฒ˜ ํฌ๋กค๋ง using Python + AWS Lambda + AWS RDS(MySQL)


Languages

Language:Python 100.0%