whacked / sdflow

simple dataflow runner

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sdflow -- schematized/simple data flow runner

tiny in--{command}-->out runner

you probably should use something else:

sdflow fits in a project that does small-scale, standardized, repeated data load and transformation.

I like make, but want an easier way of doing input/output hash validation. just also doesn't have built-in validation. task syntax is too verbose for these low-complexity targets. dvc is too cumbersome for repeat runs.

sdflow just puts these features together:

  • suitable for small number of targets, tiny syntax, simple dependency chains
  • built-in sha256 validation or generation for inputs/outputs, so the Sdflow.yaml file serves as the reference for data integrity.
  • built-in support for HTTP(S)/S3 sources as inputs

Anything else, you should use a more mature tool.

About

simple dataflow runner


Languages

Language:Go 87.1%Language:Nix 7.9%Language:Jsonnet 3.2%Language:Shell 1.8%