marcosschroh / dc-avro

Dataclasses Avro Schema CLI

Home Page:https://marcosschroh.github.io/dc-avro/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dataclasses Avro Schema CLI

Command line interface from dataclasses-avroschema to work with avsc resources

Tests GitHub license codecov python version

Requirements

python 3.8+

Documentation

https://marcosschroh.github.io/dc-avro/

Usage

You can validate one avro schema either from a local file or url:

Assuming that we have a local file schema.avsc that contains an avro schema, we can check whether it is valid

dc-avro validate-schema --path schema.avsc

Valid schema!! πŸ‘ 

{
    'type': 'record',
    'name': 'UserAdvance',
    'fields': [
        {'name': 'name', 'type': 'string'},
        {'name': 'age', 'type': 'long'},
        {'name': 'pets', 'type': {'type': 'array', 'items': 'string', 'name': 'pet'}},
        {'name': 'accounts', 'type': {'type': 'map', 'values': 'long', 'name': 'account'}},
        {'name': 'favorite_colors', 'type': {'type': 'enum', 'name': 'FavoriteColor', 'symbols': ['BLUE', 'YELLOW', 'GREEN']}},
        {'name': 'has_car', 'type': 'boolean', 'default': False},
        {'name': 'country', 'type': 'string', 'default': 'Argentina'},
        {'name': 'address', 'type': ['null', 'string'], 'default': None},
        {'name': 'md5', 'type': {'type': 'fixed', 'name': 'md5', 'size': 16}}
    ]
}

You can validate several .avsc files with lint command

dc-avro lint tests/schemas/example.avsc tests/schemas/example_v2.avsc

πŸ‘ Total valid schemas: 2
tests/schemas/example.avsc
tests/schemas/example_v2.avsc

To see all the commands execute dc-avro --help

Usage in pre-commit

Add the following lines to your .pre-commit-config.yaml file to enable avro schemas linting

  - repo: https://github.com/marcosschroh/dc-avro.git
    rev: 0.7.0
    hooks:
      - id: lint-avsc
        additional_dependencies: [typing_extensions]

Features

  • Validate schema
  • Lint schemas
  • Generate models from schemas
  • Data deserialization with schema
  • Data serialization with schema
  • View diff between schemas
  • Generate fake data from schema

Development

  1. Install requirements: poetry install
  2. Code linting: ./scripts/format
  3. Run tests: ./scripts/test

About

Dataclasses Avro Schema CLI

https://marcosschroh.github.io/dc-avro/

License:MIT License


Languages

Language:Python 96.4%Language:Shell 3.6%