ehddnr301 / Investment-Data-Pipeline

Pykrx-Data-Pipeline

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pykrx-Data-Pipeline

Project Environment

  • GCP Compute Engine (e2-medium vCPU2, 4GB memory, Ubuntu 20.04.5 LTS)
  • Docker version 20.10.22
  • docker-compose version 1.25.0
  • pre-commit: black

Appendix

  • google-colud-function을 이용한 추가
    • gcloud functions deploy stock_data_etl2 --runtime python38 --trigger-http --memory=1024MB
    gcloud scheduler jobs create http my_job \
    --location us-central1 \
    --schedule "30 18 * * *" \
    --time-zone "Asia/Seoul" \
    --uri "{YOUR CLOUD FUNCTION API URI}" \
    --http-method POST \
    --oidc-service-account-email {YOUR SERVICE ACCOUNT}

To-Do List (2023-01-19)

  • Postgres를 대체할 DB형태 고려

To-Do List (2023-01-18)

  • Notification 추가

To-Do List (2023-01-17)

  • Prefect 로 스케줄 관리

To-Do List (2023-01-15)

  • 중복데이터 처리 로직 추가
  • Insert Performance 향상

To-Do List (2023-01-13)

  • pykrx 테스트
    • ohlcv
    • marketcap
    • 외국인(foreigner)
    • 금융투자(brokerage)
    • 투신(investment)
    • 개인(individual)
  • db 저장 (postgres)
  • Dockerfile형태 제작
  • docker-compose형태 제작
  • crontab 설정

About

Pykrx-Data-Pipeline


Languages

Language:Python 94.3%Language:Shell 4.0%Language:Dockerfile 1.7%