ven2day / datahub-docker-compose

Getting started with LinkedIn's DataHub project on compose

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DataHub docker-compose

Getting started with LinkedIn's DataHub project on a all-in-one compose configuration

Homepage you'll be getting

Running DataHub

Raise your host's ulimits for ElasticSearch to handle high I/O :

# Persist this setting in `/etc/sysctl.conf` and execute `sysctl -p`
sudo sysctl -w vm.max_map_count=512000

Run the whole DataHub cluster :

# Edit env variables with desired credentials
cp .env.example .env

docker-compose up -d

Now, wait a bit for magic to happen !

Access DataHub on port 9002 with default username AND password datahub

Ingesting a dataset with metadata

Start injecting bootstrap_mce.json by running :

docker-compose -f ingest.docker-compose.yml run ingestion

About

Getting started with LinkedIn's DataHub project on compose


Languages

Language:Shell 100.0%