Albert Franzi (afranzi)

afranzi

Geek Repo

Company:AilyLabs

Location:Bilbao

Home Page:https://medium.com/albert-franzi

Github PK Tool:Github PK Tool

Albert Franzi's repositories

mlflow-workshop

First steps to interact with MLflow (mlflow.org)

Language:DockerfileLicense:MITStargazers:10Issues:1Issues:0

mini-data-platform

Mini Data Platform

Language:HCLLicense:Apache-2.0Stargazers:6Issues:0Issues:0

pytest-dbt-postgres

Unittest DBT Postgres projects

Language:PythonStargazers:3Issues:0Issues:0

airflow-aws-shared-secrets

SecretsManagerBackend with cross-account access

Language:PythonStargazers:1Issues:0Issues:0

awesome-spark

A curated list of awesome Apache Spark packages and resources.

License:CC0-1.0Stargazers:1Issues:0Issues:0

rabbitmq-poc

Notification System PoC with Delayed/Expired queues.

Language:ScalaStargazers:1Issues:0Issues:0

airbyte

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

data-access-layer

Library to facilitate accessing Data from Databricks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datahub

A Generalized Metadata Search & Discovery Tool

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datahub-helm

Repository of helm charts for deploying DataHub on a Kubernetes cluster

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

License:Apache-2.0Stargazers:0Issues:0Issues:0

elementary

Open-source data observability for analytics engineers.

License:Apache-2.0Stargazers:0Issues:0Issues:0

helm-charts

Lightdash Community helm charts

Language:ShellStargazers:0Issues:0Issues:0

json-schema

JSON Schema validator for java, based on the org.json API

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kafdrop

Kafka Web UI

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-connect-field-and-time-partitioner

Kafka Connect Store Partitioner by a custom field and time

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

prefect

The easiest way to automate your data

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

prefect-poc

Prefect Flow Evaluation

Language:PythonStargazers:0Issues:0Issues:0

presto

Distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

quinn

pyspark methods to enhance developer productivity 📣 👯 🎉

Language:PythonStargazers:0Issues:0Issues:0

redis-poc

Notification System PoC with ZSETs using the time to send as Scores

Language:ScalaStargazers:0Issues:0Issues:0

rudderstack-helm

Open-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your customer data lake on your data warehouse.

License:MITStargazers:0Issues:0Issues:0

rust-efimer

PoC to try & learn Rust

Stargazers:0Issues:0Issues:0

scala-skeleton

Scala Skeleton

Language:ScalaStargazers:0Issues:0Issues:0

spark-daria

Essential Spark extensions and helper methods ✨😲

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0

spark-json-schemas

Create Spark schemas using JSON-schemas

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

telegraf

The plugin-driven server agent for collecting & reporting metrics.

License:MITStargazers:0Issues:0Issues:0

thunderstruck

CDP based on ray.io

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0