Tooru Ogata's repositories
mef-opendata-analysis
This project automates the download and processing of public finance datasets from the Peruvian Ministry of Economy and Finance (MEF), specifically using the datastorefiles service. It supports downloading both .csv and .zip files, extracting zipped files, and organizing them into a clean folder structure for further processing.
DotaMatchPrediction
This project predicts Dota 2 match outcomes using machine learning models, leveraging data from OpenDota. The workflow includes data extraction, cleaning, feature engineering, and model training (including XGBoost and linear/logistic regression).
pte-personal-scrap
A data pipeline project for scraping, transforming, and analyzing personnel data from various ministries.
bcrp-data-hub
Download and consolidate time series from the Banco Central de Reserva del Perú (BCRP) API. This tool scrapes metadata, downloads 16k+ series in async batches, and stores them as Parquet files for fast analytics.
seace-contract-ocr-bow
This project automates the process of downloading PDF contracts from a public source, converting them to images, extracting text using OCR (Optical Character Recognition), and transforming the extracted text into a structured format for further analysis. It is designed for large-scale document processing and text mining tasks.
Tooruogata.github.io
Personal web
CryptoData
Download crypto data from binance using python API. The reference of the guide is mentioned in the code. The principal parameters you need are the crypto currency, the time interval and the intial date. You will also find data exported in this repository.