FastAPI starter for Machine Learning

This is a small project I made to solve an issue I had:

Users can submit different requests with multiple configurations and different arguments. We want to cache said predictions when possible.

This project will:

Use FastAPI to create a REST API
Use SQLModel to store predictions based on the config and on the request.
Use HuggingFace Transformers to run the prediction.

Hopefully, this can help others.

DISCLAIMER I am aware that model_runner.py should be split in multiple "Task" objects. This is out of scope for this project. If you want to use this project, let me know, and we can work on refactoring that part together. It won't be too hard.

Future work

Expand set of capabilities
1. Work on entire datasets
2. Uncertainty estimation
Progress bar on progress
Async routes

How to use:

Run the app:
1. make DEVICE=cpu compose (optionally DEVICE=gpu to use a GPU)
2. go to 0.0.0.0:8080/docs
3. Execute the /predictions API with different requests.
Run test
1. make test
DEV Autoformatting
1. make black

About

Starting API for machine learning projects

Languages

Language:Python 91.0%Language:Dockerfile 5.8%Language:Makefile 3.3%