zillow / intake-hive

Intake plugin to read and write to Hive

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

https://travis-ci.org/zillow/intake-hive.svg?branch=master https://coveralls.io/repos/github/zillow/intake-hive/badge.svg?branch=master

Welcome to the Intake Hive plugin

This Intake plugin :

Example where the Hive table is user_events_hive partitioned by userid:

sources:
  user_events_hive:
    driver: hive
    args:
      urlpath: 'user_events_yaml_catalog?userid={{userid}}'
import pandas as pd
import intake

catalog = intake.open_catalog(catalog_path)

# Reads partition userid=42
pandas_df: pd.DataFrame = catalog.entity.user.user_events_partitioned(userid="42").read()

About

Intake plugin to read and write to Hive

License:Apache License 2.0


Languages

Language:Python 100.0%