CERN Database and Analytics Group's repositories
spark-dashboard
Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
SparkPlugins
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
SparkDLTrigger
Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
grafana-mimir-cardinality-dashboards
Grafana Mimir dashboards used for cardinality exploration
sparkMeasure
This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.
cern-sso-python
Python Re-implementation of the cern-get-sso-cookie functionality
SparkTraining
Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
tf-spawner
TF-Spawner is an experimental tool for running TensorFlow distributed training on Kubernetes clusters.
storage-api
Unified RESTful interface for managing CERNs data storage back-ends
hadoop-xrootd
Mirror of CERN db/hadoop-xrootd. Hadoop-XRootD Filesystem Connector
SparkExecutorPlugins2.4
Spark Executor Plugins Examples for Spark 2.4
netapp-api-python
A re-implementation of (parts of) NetApp's ZAPI in idiomatic Python using Requests
hbase-packet-inspector
Analyzes network traffic of HBase RegionServers
NotebooksExamples
This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery
tomcat-sso-integration-components
Set of valves classes that helps CERN applications with the integration in the CERN Authentication
argo-helm
ArgoProj Helm Charts
binderhub
Run your code in the cloud, with technology so advanced, it feels like magic!
jdbc-connector-for-apache-kafka
Aiven's JDBC Sink and Source Connectors for Apache Kafka®
oci-hdfs-connector
HDFS Connector for Oracle Cloud Infrastructure
opentelemetry-collector-contrib
Contrib repository for the OpenTelemetry Collector
ords-config-image
This image generates configuration and war files for Oracle Rest DataServices based on data provided by dadEdit3 database.
rundeck-nomad-plugin
Rundeck plugin running jobs on Nomad cluster.