puqiu / orchest

A new kind of IDE for Data Science.

Home Page:https://www.orchest.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


WebsiteDocsQuickstart


Join us on Slack

Orchest is a browser based IDE for Data Science. It integrates your favorite Data Science tools out of the box, so you don’t have to. The application is easy to use and can run on your laptop as well as on a large scale cloud cluster.

orchest-0.3.0-demo

A preview of creating pipelines in Orchest. Watch the full video to learn more.

Features

For a complete list of Orchest's features, check out the overview in our docs!

  • Visually construct pipelines.
  • Run any subset of a pipeline directly or on a cron-like schedule.
  • Parametrize your data science pipelines to try out different modeling ideas.
  • Easily define your custom runtime environment that runs on any machine.

Who should use Orchest?

  • Data Scientists who want to rapidly prototype.
  • Data Scientists who like to work in Notebooks.
  • Data Scientists who are looking to create pipelines through a visual interface instead of YAML.

Installation

NOTE: Orchest is in alpha.

For GPU support, language dependencies other than Python, and other installation methods, such as building from source, please refer to our installation docs.

Requirements

  • Docker

If you do not yet have Docker installed, please visit https://docs.docker.com/get-docker/.

NOTE: On Windows, Docker has to be configured to use WSL 2. Make sure to clone Orchest inside the Linux environment. For more info and installation steps for Docker with WSL 2 backend, please visit https://docs.docker.com/docker-for-windows/wsl/.

Linux, macOS and Windows

git clone https://github.com/orchest/orchest.git && cd orchest
./orchest install

# Verify the installation.
./orchest --help

# Start Orchest.
./orchest start

Now that you have installed Orchest, get started with our quickstart tutorial, check out pipelines made by your fellow users, or have a look at our knowledge base videos explaining and showing some of Orchest's core concepts.

License

The software in this repository is licensed as follows:

  • All content residing under the "orchest-sdk/" directory of this repository is licensed under the "Apache-2.0" license as defined in "orchest-sdk/LICENSE".
  • Content outside of the above mentioned directory is available under the "AGPL-3.0" license.

We love your feedback

We would love to hear what you think and add features based on your ideas. Come chat with us on our Slack Channel or open an issue on GitHub.

Contributing

Contributions are more than welcome! Please see our contributor guides for more details.

Not sure where to start? Book a free, no-pressure pairing session with one of our core contributors.

Contributors

About

A new kind of IDE for Data Science.

https://www.orchest.io

License:GNU Affero General Public License v3.0


Languages

Language:Python 61.2%Language:JavaScript 32.0%Language:SCSS 2.8%Language:Shell 2.2%Language:Dockerfile 0.9%Language:TypeScript 0.6%Language:CSS 0.1%Language:Mako 0.1%Language:HTML 0.1%Language:Jupyter Notebook 0.1%