Takeshi Yamamuro (maropu)

maropu

Geek Repo

Location:Tokyo/Japan

Home Page:https://twitter.com/maropu

Twitter:@maropu

Github PK Tool:Github PK Tool


Organizations
apache

Takeshi Yamamuro's repositories

spark-tpcds-datagen

All the things about TPC-DS in Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:98Issues:5Issues:8

spark-sql-flow-plugin

Visualize column-level data lineage in Spark SQL

Language:ScalaLicense:Apache-2.0Stargazers:79Issues:6Issues:4

spark-sql-server

Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol

Language:ScalaLicense:Apache-2.0Stargazers:34Issues:9Issues:23

datasketches-spark

Data Sketches for Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:20Issues:6Issues:0

spark-data-repair-plugin

Provide functionality to build statistical models to repair dirty tabular data in Spark

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11Issues:4Issues:2

spark-query-log-plugin

A simple toolkit to analyze Spark query logs

Language:ScalaLicense:Apache-2.0Stargazers:9Issues:4Issues:1

fuzz-testing-for-spark

[WIP] Run SQL-aware fuzz tests for the Catalyst optimizer in Apache Spark

Language:C++License:Apache-2.0Stargazers:6Issues:3Issues:1

spark-graphx-pregel-personalized-pagerank

Personalized PageRank on Pregel/GraphX

Language:ScalaLicense:Apache-2.0Stargazers:4Issues:3Issues:0

mlflow-example

An example code for MLflow

Language:PythonStargazers:3Issues:3Issues:0

spark-executor-dict-plugin

Fast Read-only Data Dictionary Attached to Each Spark Executor

Language:ScalaLicense:Apache-2.0Stargazers:3Issues:4Issues:0

jupyterlab-dockerfile

A docker file for JupyterLab including pyspark

Language:PythonStargazers:2Issues:3Issues:0

jvmci-test

A toy box to test JVMCI in JDK11

Language:C++Stargazers:1Issues:3Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

equipartitioning-example

Equipartitioning in Spark

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

janino

Janino is a super-small, super-fast Java™ compiler.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

LAMA

LAnguage Model Analysis

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

lstm-crf-pytorch

LSTM-CRF in PyTorch

Language:PythonStargazers:0Issues:0Issues:0

lz4-java

LZ4 compression for Java

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

neon

Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, branching, and bottomless storage.

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pg_stats_exporter

A PostgreSQL metrics exporter for Prometheus.

Language:RustLicense:Apache-2.0Stargazers:0Issues:2Issues:0

polars

Fast multi-threaded, hybrid-out-of-core DataFrame library in Rust | Python | Node.js

Language:RustLicense:MITStargazers:0Issues:1Issues:0

pydeps-neo4j

Exports Python package dependencies into Neo4j

Language:ShellLicense:Apache-2.0Stargazers:0Issues:2Issues:0

rag-postgres

A trial place for RAG with PostgreSQL resources

Language:HTMLStargazers:0Issues:0Issues:0

sedona

A cluster computing framework for processing large-scale geospatial data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

spark-tpcds-sf-1

TPC-DS queries with 1GB scale factor

Stargazers:0Issues:3Issues:0

spark-website

Mirror of Apache Spark Website

License:Apache-2.0Stargazers:0Issues:3Issues:0