Clone the repository
$ git clone https://github.com/open-data-kazakhstan/top-10-car-models-by-year.git
Requires Python 3.11.3
Create a virtual environment and activate it
pip install venv
python -m venv /path/to/localrepo
Swicth to venv directory by using cd comand
cd /path/to/localrepo
Scripts/activate
Install dependecies in venv by using pip
pip install -r requirements.txt
Run the project:
python scripts/main.py
Car data collected by hand from https://auto.vercity.ru/statistics/sales/marks/
We downoladed data from these sources and placed it in the acrhive folder as car_models.csv.
We have processed the source data to make it normalized and derived several aggregated datasets from it:
archive/car_models.csv
- sourсe datadata/car.csv
- wranged and transposed datadata/csv_expandeds.csv
- expanded main datasetdatapackage.json
- conatins all of the key information about our dataset
wrang.py
- cleaning and wranging the source data scriptexpand.py
- uses main dataset and expands it to 25 steps to make animation smootheranimate.py
- uses matplotlib to create an infographic about car sales for all modelsdatapack.py
- creating datapckage.json file that conatinsall meatadatamain.py
- launches all scripts step by step
Final result is visualized data that displays average salary and inflation data
This dataset is licensed under the Open Data Commons Public Domain and Dedication License.