dtrapezoid / cdm

Cassandra Dataset Manager

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cassandra Data Manager

Tool for installing cassandra datasets. This is not a bulk loader. It is intended to be used as a tool for learning and demonstration purposes.

This repository contains the cdm tool only. Other repositories in this repo contain the actual datasets.

Installation

pip install cassandra-dataset-manager

Clone the repo. Using a visualenv:

python setup.py develop

The project is still under heavy development, a lot is changing very quickly.

Quickstart

Let's install the movielens-small dataset. It's a quick download at just a few MB and gives you a database you can play with.

cdm update
cdm install movielens-small

Options are all available at cdm help

I encourage you to read through the documentation.

Requesting a dataset

Please submit a GitHub issue requesting the dataset, with as much detail as possible. You should include the following:

  • Dataset URL
  • What you'd like to use the data for (so we can structure tables accordingly)

About

Cassandra Dataset Manager

License:Apache License 2.0


Languages

Language:Python 100.0%