Tiago Didoné (Didone)

Didone

Geek Repo

Company:compass.uol

Location:São Paulo

Home Page:https://www.linkedin.com/in/didoné

Github PK Tool:Github PK Tool


Organizations
doraproject

Tiago Didoné's repositories

0722-bootcamp-sql

Bootcamp SQL - Data engineer

Language:PythonLicense:MITStargazers:14Issues:9Issues:23

spark-glue

Spark env to Glue development

Language:Jupyter NotebookLicense:MITStargazers:9Issues:1Issues:0

csvms

Python module to manage CSV files like a DBMS application, with educational purposes

Language:PythonLicense:MITStargazers:2Issues:3Issues:0

FirehoseFunction

Firehouse lambda function

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

spark-adl

Apache Spark 3.0 with Azure Data Lake Storage (ADL) support

Language:DockerfileStargazers:2Issues:3Issues:0

churn

Customer Churn Prediction with Amazon SageMaker Autopilot

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

spark-carbon

Spark with carbon data for Huawei cloud

Language:DockerfileLicense:MITStargazers:1Issues:1Issues:3

aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

aws-glue-libs

AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

sam-pyspark

Serverless PySpark

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

aws-lambda-container-image-converter

The AWS Lambda container image converter tool (img2lambda) repackages container images (such as Docker images) into AWS Lambda layers, and publishes them as new layer versions.

Language:GoLicense:MIT-0Stargazers:0Issues:1Issues:0

carbondata

Mirror of Apache CarbonData

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cdktf-remote-template-python-poetry

A terraform-cdk CLI template for Python projects using Poetry for dependency management

Language:JavaScriptLicense:MPL-2.0Stargazers:0Issues:0Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fastavro

Fast Avro for Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hive

Apache Hive

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mo-sql-parsing

Let's make a SQL parser so we can provide a familiar interface to non-sql datastores!

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

ply

Python Lex-Yacc

Language:PythonStargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0