cc-hpc-itwm / tarantella

An Open-Source Distributed Deep Learning Framework

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tarantella



Tarantella is an open-source, distributed Deep Learning framework built on top of TensorFlow, providing scalable Deep Neural Network training on CPU and GPU compute clusters.

Tarantella offers an easy-to-use data parallel solution for speeding up the training of Tensorflow models. It provides full support for the TensorFlow Keras and Dataset APIs, allowing users to efficiently harness large numbers of compute resources without requiring any knowledge of parallel computing.

Goals

Tarantella is designed to meet the following goals:

  • ease of use
  • synchronous training scheme
  • seamless integration with existing Keras models
  • support for GPU and CPU systems
  • strong scalability

Quick Start

To get started, you only need to add two lines of code to enable data parallel training for your Keras model.

Take a look at the highlighted lines in the following code snippet:

That's it!

All the necessary steps to distribute training and datasets will now be automatically handled by Tarantella. A full version of the above example can be found here.

Now simply train the model distributedly by executing one of the following commands:

Detailed instructions and configuration options are provided in the technical docs.

Install

To build Tarantella from source, check out the installation guide.

Tarantella relies on the following dependencies:

Resources

License

Tarantella is licensed under the GPL-3.0 License. See LICENSE for details.

About

An Open-Source Distributed Deep Learning Framework

License:GNU General Public License v3.0


Languages

Language:Python 79.7%Language:C++ 10.7%Language:CMake 9.5%Language:Shell 0.1%