MeltanoLabs / target-jsonl-blob

JSONL Singer target for local storage, S3, GCS and Azure Blob Storage

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

target-jsonl-blob

JSONL Singer target for local storage, S3, GCS and Azure Blob Storage.

Installation

To install this Singer tap, you can download a prebuilt binary, or you can build it from source.

Configuration

Setting Required Default Description
bucket Yes - Blob storage bucket URL
key_template No {{.StreamName}}.jsonl Template string for file keys

Bucket URLs

Storage Example URL Supported URL parameters
local file:///path/to/directory See supported parameters
S3 s3://my-bucket See supported parameters
Azure azblob://my-container See supported parameters
GCS gs://my-bucket See supported parameters

Available fields for key_template

  • StreamName
  • Date (YYYY-MM-DD)
  • TimestampSeconds
  • Minute
  • Hour
  • Day
  • Month
  • Year

Example: {{.StreamName}}/{{.Year}}/{{.Month}}/{{.Day}}/{{.Hour}}/{{.Minute}}/{{.StreamName}}.jsonl

Build from source

go build -o target-jsonl-blob

Usage with Meltano

  1. Download the appropriate asset
  2. Add a custom Meltano plugin to your project
  3. Run a pipeline

1. Download the appropriate asset

You can see the full list of assets in the release page: https://github.com/MeltanoLabs/target-jsonl-blob/releases/latest.

The gh tool makes downloading an asset easy:

gh release download v0.0.6 \
  -R MeltanoLabs/target-jsonl-blob \
  -p '*darwin_amd64*' \
  --clobber

tar -xvf target-jsonl-blob_0.0.6_darwin_amd64.tar.gz target-jsonl-blob

You can also install with Homebrew:

brew tap MeltanoLabs/target-jsonl-blob https://github.com/MeltanoLabs/target-jsonl-blob
brew install target-jsonl-blob

2. Add a custom Meltano plugin to your project

# meltano.yml
plugins:
  loaders:
  - name: target-jsonl-blob
    namespace: target_jsonl_blob
    executable: ./target-jsonl-blob
    settings:
    - name: bucket
      label: Bucket
      description: Target directory (local, S3, Azure Blob)
    - name: key_template
      label: Key Template
      description: Template string for file keys
    config:
      bucket: file://./output/my-bucket
      key_template: $MELTANO_EXTRACTOR_NAMESPACE/{{.StreamName}}.jsonl

You also need to ensure the local "bucket" exists:

mkdir output/my-bucket

3. Run a pipeline

meltano run tap-github target-jsonl-blob

About

JSONL Singer target for local storage, S3, GCS and Azure Blob Storage

License:Apache License 2.0


Languages

Language:Go 88.6%Language:Ruby 11.4%