Dataminded (datamindedbe)

Dataminded

datamindedbe

Geek Repo

Location:Belgium

Home Page:https://dataminded.com

Github PK Tool:Github PK Tool

Dataminded's repositories

lighthouse

Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.

Language:ScalaLicense:Apache-2.0Stargazers:60Issues:27Issues:3

blog-tpcds-dbt-duckdb

This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb

Language:HCLStargazers:17Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0

conveyor-samples

Samples on how to use Conveyor.

Language:Jupyter NotebookStargazers:3Issues:4Issues:0

iceberg-ingestion

Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg

Language:PythonStargazers:3Issues:15Issues:0

blog-platform-quack-quack-ka-ching

The duck escapes with the credits.

Stargazers:2Issues:0Issues:0

homebrew-conveyor-formulas

Brew tap repository for Conveyor

Language:PythonStargazers:1Issues:2Issues:0
Language:ShellStargazers:1Issues:2Issues:0

conveyor-templates

Cookiecutter templates used by Conveyor.

Language:PythonLicense:MITStargazers:1Issues:4Issues:2
Stargazers:0Issues:5Issues:0

aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

conveyor-dbt-devenv

Repository to show the use of dev environments in the context of dbt

Language:DockerfileStargazers:0Issues:0Issues:0

dbt-conveyor-snowflake

The Conveyor Snowflake adapter is a thin shell around the Snowflake adapter to allow authenticating users in Conveyor IDE's with Snowflake to run DBT projects

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dbt-playground

Try out dbt in a Gitpod environment in one click, with a Postgres database pre-configured

License:Apache-2.0Stargazers:0Issues:0Issues:0

ecr-mirror

Mirror public repositories to internal ECR repos

Language:PythonStargazers:0Issues:2Issues:0

eks-spark-benchmark

Performance optimization for Spark running on Kubernetes

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

git-credential-oauth

A Git credential helper that securely authenticates to GitHub, GitLab and BitBucket using OAuth.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

iris

Artifacts related to a training on running stream processing pipelines

Language:KotlinLicense:MITStargazers:0Issues:2Issues:0
Language:DockerfileStargazers:0Issues:4Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

terraform-aws-eks

Terraform module to create an Elastic Kubernetes (EKS) cluster and associated resources 🇺🇦

Language:HCLLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0