chtompki / rva-data-workshop

Repository with exercises for the RVA data summit 2020 intro to data engineering workshop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RVA Data Workshop 2020 Intro to Data Engineering

This work was created as part of the RVA Data Summit Workshop 2020 session, "Introduction to Data Engineering."

It provides a tested, deployable REST microservice that returns a calculated value.

For information on how to work with the microservice, see specification.yaml.

Getting Started

Prerequisites

Please install Docker, Pip and Python3

On Mac:

brew install docker
brew install python3

Installing

To run the service locally:

Install the requirements

pip install -r requirements.txt

Install an editable, local version of the package

pip install -e .

Run the service locally

python3 app.py

Build and run the service in a container

docker build -t rvadatasummit/intro .
docker run -d -p 5000:5000 rvadatasummit/intro 

Running the tests

You can run the tests using the Python pytest module.

pytest

Deployment

To deploy in the cloud, follow instructions for your cloud provider:

Built With

  • Python - Used to write the service
  • Flask - Used as the web framework
  • Docker - Used to run the service in a container

Contributing

If you'd like to contribute please create a feature branch and a pull request. For more information on contributing best practices, check out the excellent contributing guidelines provided by Pandas.

Authors

See also the list of contributors who participated in this project.

License

This project is licensed under the MIT License, as can be seen in the LICENSE file.

Acknowledgments

About

Repository with exercises for the RVA data summit 2020 intro to data engineering workshop

License:MIT License


Languages

Language:Python 87.0%Language:Dockerfile 13.0%