There are 2 repositories under avro topic.
Apache Avro is a data serialization system.
Record Query - A tool for doing record analysis and transformation
Confluent Schema Registry for Kafka
Apache Kafka and Confluent Platform examples and demos
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
What's in your data? Extract schema, statistics and entities from datasets
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Avro schema generation and serialization / deserialization for Scala
Benchmark comparing various data serialization libraries (thrift, protobuf etc.) for C++
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Iceberg is a table format for large, slow-moving tabular data
Command Line Tool for managing Apache Kafka
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
A tool for data sampling, data generation, and data diffing
Flexible, Fast & Compact Serialization with RPC
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Mu (μ) is a purely functional framework for building micro services.
Uber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
MongoDB Kafka Connector
Lightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
A Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
⛈️ RumbleDB 1.18.0 "Scarlet Ixora" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Golang Client for Schema Registry
**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
:sunny: A tool for validating data using JSON Schema and converting JSON Schema documents into different data-interchange formats
Scala library to eliminate boilerplate