Jerome Banks (jeromebanks)

jeromebanks

Geek Repo

Company:Tatari.tv

Github PK Tool:Github PK Tool

Jerome Banks's repositories

brickhouse

Hive UDF's for the data warehouse

Language:JavaLicense:NOASSERTIONStargazers:10Issues:3Issues:0

experimental_bigdata-interop

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

satisfaction

The Next Generation Hadoop Scheduler

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:15

reair

ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

artemis-corpus-test-framework

A test framework for working with test corpora for unit tests.

Language:JavaStargazers:0Issues:1Issues:0

aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

boilerpipe

Work in progress transmit from Google Code

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Chat-with-Github-Repo

This repository contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

classutil

Scala-friendly, fast class-finder library (using ASM under the covers)

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

docker-spark-k8s-aws

Docker image for running Spark 3 on Kubernetes on AWS

Stargazers:0Issues:1Issues:0

document-api-python

Create and modify Tableau workbook and datasource files

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

experimental_spark-bigquery

Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

experimental_spark-bigquery-1

Google BigQuery support for Spark, SQL, and DataFrames

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

generalized-kmeans-clustering

This project generalizes the Spark MLLIB Batch and Streaming K-Means clusterers in every practical way.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

incubator-hivemall

Mirror of Apache Hivemall (incubating)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

influxdb-java

Java client for InfluxDB

Language:JavaLicense:MITStargazers:0Issues:2Issues:0

js-murmur3-128

A JavaScript implementation of the 128bit variant of Murmur3 (that is compatible with Guava)

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nutch

Apache Nutch

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

okhttp

An HTTP+HTTP/2 client for Android and Java applications.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

reactive-kafka

Reactive Streams API for Apache Kafka

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

redshift-auto-schema

Redshift Auto Schema is a Python library that takes a delimited flat file or parquet file as input, parses it, and provides a variety of functions that allow for the creation and validation of tables within Amazon Redshift.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

sbt-google-cloud-storage

A SBT resolver and publisher for Google Cloud Storage

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

scala.rx

An experimental library for Functional Reactive Programming in Scala

Language:ScalaStargazers:0Issues:2Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

spark-glue

Spark releases with AWS Glue support

Language:DockerfileStargazers:0Issues:1Issues:0

spark-on-k8s-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

spark-on-kubernetes-docker

Spark on Kubernetes infrastructure Docker images repo

Language:ShellLicense:Apache-2.0Stargazers:0Issues:1Issues:0

spark-on-kubernetes-helm

Spark on Kubernetes infrastructure Helm charts repo

Language:HTMLStargazers:0Issues:1Issues:0

terrapin

Serving system for batch generated data sets

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0