Luca Canali (LucaCanali)

LucaCanali

Geek Repo

Location:Geneva, Switzerland

Home Page:https://cern.ch/canali

Twitter:@LucaCanaliDB

Github PK Tool:Github PK Tool


Organizations
cerndb

Luca Canali's repositories

sparkMeasure

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.

Language:ScalaLicense:Apache-2.0Stargazers:678Issues:34Issues:40

Miscellaneous

Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter notebooks examples for Spark, examples for Oracle and other DB systems.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:410Issues:25Issues:6

Linux_tracing_scripts

Scripts and tools for troubleshooting and performance analysis in Linux. This includes dynamic tracing scripts with SystemTap both for system calls and for userspace function tracing.

Language:PythonLicense:Apache-2.0Stargazers:129Issues:22Issues:0

Oracle_DBA_scripts

A collection of old-school CLI scripts for Oracle RDBMS monitoring and performance troubleshooting.

Language:PLSQLLicense:GPL-2.0Stargazers:56Issues:9Issues:0

PyLatencyMap

PyLatencyMap is a tool for heat map visualization on the CLI. It is integrated with scrips to collect and visualize I/O latency heat maps from various sources, including SystemTap, DTrace, Oracle wait events, NetApp filers, trace files.

Language:PythonLicense:GPL-3.0Stargazers:33Issues:10Issues:0

PerfSheet4

PerfSheet4 is a tool for performance troubleshooting of Oracle databases. Query and visualize Oracle AWR data using pivot charts.

License:GPL-3.0Stargazers:16Issues:9Issues:0

Stack_Profiling

Tools and scripts for stack profiling: Userspace, Kernel, OS state and optionally Oracle wait

Language:CLicense:GPL-3.0Stargazers:14Issues:8Issues:0

PerfSheet.js

PerfSheet.js is a tool for Oracle RDBMS performance troubleshooting. Use it to extract and visualize Oracle AWR time series data in the browser using JavaScript and dynamic pivot charts.

Language:HTMLLicense:GPL-3.0Stargazers:11Issues:12Issues:0

ipython-sql

%%sql magic for IPython, hopefully evolving into full SQL client

Language:PythonStargazers:8Issues:6Issues:0

OraLatencyMap

OraLatencyMap is a performance widget running on SQL*plus (Oracle's CLI) to collect and visualize latency histograms for Oracle wait events using heat maps.

Language:PLSQLLicense:GPL-2.0Stargazers:3Issues:6Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:2Issues:3Issues:0

hadoop

Fork of Apache Hadoop, used to work on S3A and HDFS instrumentation

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

bcc

BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:0

dist-keras

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:3Issues:0

gallery

A set of examples for CERN SWAN a Service for Web based ANalysis

Language:ShellLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

hbase-connectors

Apache HBase Connectors

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

jupyter-extensions

Jupyter extensions for SWAN

Language:JavaScriptLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

jupyterhub-extensions

Customized components of the Jupyterhub server in SWAN (handlers, spawners, templates).

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

oci-hdfs-connector

HDFS Connector for Oracle Cloud Infrastructure

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

SLOB_2.5.4

Official SLOB distribution for version 2.5.4.0

License:NOASSERTIONStargazers:0Issues:1Issues:0

SLOB_distribution

A Git repository used only for distributing the official SLOB release.

License:NOASSERTIONStargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:3Issues:0

spark-root

Apache Spark Data Source for ROOT File Format

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

SparkDLTrigger

Notebooks with code and sample data for the blog article: "Machine Learning Pipelines for High Energy Physics Using Apache Spark with BigDL and Analytics Zoo"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

sparkmonitor

Monitor Apache Spark from Jupyter Notebook

Language:JavaScriptLicense:LGPL-2.1Stargazers:0Issues:3Issues:0

tf-spawner

spawn workers for tensorflow MultiWorkerMirroredStrategy

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0