docker-flexgen flexgen in docker requirements cuda-drivers nvidia-container-toolkit start chatbot apps todo $ docker compose build $ docker compose run chatbot