- Idea is to somehow scrape LinkedIn job listings on a timed interval
-
Login first with
aws ecr get-login-password --region ap-south-1 | docker login --username AWS --password-stdin 832214191436.dkr.ecr.ap-south-1.amazonaws.com
-
One time
aws ecr create-repository --repository-name linkedinjobscraper
-
Run on every update
- Build docker image with
docker build -t linkedinjobscraper .
- Add latest tag with
docker tag linkedinjobscraper:latest 832214191436.dkr.ecr.ap-south-1.amazonaws.com/linkedinjobscraper:latest
- Push to AWS ECR with
docker push 832214191436.dkr.ecr.ap-south-1.amazonaws.com/linkedinjobscraper:latest
- Build docker image with
- Create a
.env
file containing the relevant variables from thescraper.py
script - Run the
scraper.py
script - Refer to CSV data saved