WhiskersReneeWe / orchest

Orchest is a tool for creating data science pipelines.

Home Page:https://www.orchest.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


WebsiteDocsQuickstart


Join us on Slack

Orchest is a web based tool for creating data science pipelines. Under the hood Orchest runs a collection of containers to provide a scalable platform that can run on your laptop as well as on a large scale cloud cluster.

Orchest lets you:

  • Visually construct pipelines.
  • Write code using JupyterLab.
  • Write code using any other editor of choice.
  • Run any subset of a pipeline.
  • Skip certain cells when executing a notebook top-to-bottom.
  • Parametrize your data science pipelines to experiment with different modeling ideas.
  • Integrate commonly used data sources.
  • Easily define your custom runtime environment.
  • Version your pipelines through git.

Preview

In our docs you can find a comprehensive quickstart tutorial!

clip-3-cropped A preview of running pipelines in the pipeline editor of Orchest.

Installation

NOTE: Orchest is in alpha.

For GPU support and other installation methods, such as building from source, refer to our installation docs.

Requirements

  • Docker

If you do not yet have Docker installed, please visit https://docs.docker.com/get-docker/.

Linux, macOS and Windows

Simply follow the steps below to install Orchest. For Windows, please read the note at the bottom first.

git clone https://github.com/orchest/orchest.git
cd orchest

# The update command is used both for installation and updating to
# the newest release.
./orchest update

# Verify the installation. This should print the help message.
./orchest

NOTE: On Windows, Docker has to be configured to use WSL 2. Make sure to clone Orchest inside the Linux environment. For more info and installation steps for Docker with WSL 2 backend, please visit https://docs.docker.com/docker-for-windows/wsl/.

Starting Orchest

# Make sure to be in the cloned "orchest" directory.
./orchest start

License

The software in this repository is licensed as follows:

  • All content residing under the "orchest-sdk/" directory of this repository is licensed under the "Apache-2.0" license as defined in "orchest-sdk/LICENSE".
  • Content outside of the above mentioned directory is available under the "AGPL-3.0" license.

Contributing

Contributions are more than welcome! Please see our contributer guides for more details.

We love your feedback

We would love to hear what you think and potentially add features based on your ideas. Come chat with us on our Slack.

About

Orchest is a tool for creating data science pipelines.

https://www.orchest.io

License:GNU Affero General Public License v3.0


Languages

Language:Python 55.8%Language:JavaScript 36.6%Language:Shell 2.8%Language:SCSS 2.7%Language:Dockerfile 1.0%Language:HTML 0.8%Language:CSS 0.2%Language:Jupyter Notebook 0.1%