ismail simsek's repositories
iceberg-examples
Apache iceberg Spark s3 examples
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
bigquery-utils
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
coursera-machine-learning-engineering-for-prod-mlops-specialization
Programming assignments and quizzes from all courses within the Machine Learning Engineering for Production (MLOps) specialization offered by deeplearning.ai
debezium-connector-db2
An incubating Debezium connector for Db2
debezium-server
Debezium Server runtime for standalone execution of Debezium connectors
debezium-server-batch
Debezium server batch consumers
debezium-server-bigquery
Debezium Bigquery Consumer
debezium-server-iceberg
Replicates database CDC events to Iceberg Tables
debezium.github.io
Source for the Debezium website; Please log issues in our tracker at https://issues.redhat.com/projects/DBZ/.
Diffusion-GAN
Official PyTorch implementation for paper: Diffusion-GAN: Training GANs with Diffusion
docker-images
Docker images for Debezium. Please log issues in our JIRA at https://issues.redhat.com/projects/DBZ/summary
iceberg
Apache Iceberg
jdbi
jdbi is designed to provide convenient tabular data access in Java; including templated SQL, parameterized and strongly typed queries, and Streams integration
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
pyliquibase
Python wrapper for liquibase.
python-odata
A simple library for read/write access to OData services
querydsl
Unified Queries for Java