izouazou / AirQuality-ETL

Air Quality ETL is a Python repository facilitating the extraction, transformation, and loading of air quality data from RapidAPI to a Pandas DataFrame for easy analysis and customization.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Air Quality ETL Process

This repository contains a simple Python script AirQuality_ETL.py for extracting, transforming, and loading (ETL) air quality data from a specified location using the RapidAPI service. The extracted data is then transformed into a Pandas DataFrame and saved as a CSV file for further analysis.

Getting Started

To use this script, follow the steps below:

Prerequisites

RapidAPI Key: Obtain your RapidAPI key and update the .env file with your key. If you don't have a key, sign up on the RapidAPI website and create a new project to obtain your API key.

Environment Setup: Set up a Python virtual environment:

    c:/python311/python.exe -m venv env
    env\Scripts\activate

Dependencies: Install the required Python libraries by running the following command:

    pip install -r requirements.txt  

Configuration

Ensure that you have updated the .env file with your RapidAPI key:

    RAPIDAPI_KEY="your_rapidapi_key_here"  

Usage

Run the AirQuality_ETL.py script:

    python AirQuality_ETL.py  

Check the generated air_quality_data.csv file for the extracted air quality data in CSV format.

Project Structure

AirQuality_ETL.py: Python script for ETL process.
config.py: Configuration file containing the RapidAPI key.
.env: Environment file storing sensitive information (e.g., API keys).
requirements.txt: List of Python dependencies.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

Air Quality ETL is a Python repository facilitating the extraction, transformation, and loading of air quality data from RapidAPI to a Pandas DataFrame for easy analysis and customization.

License:MIT License


Languages

Language:Python 100.0%