Yacine's repositories
ESGI-spark-streaming
Ressource pour le cours spark Streaming S2 ESGI
cloudera-deploy
A general purpose framework for automating Cloudera Products
cloudera-playbook
Cloudera deployment automation with Ansible
cloudera-scripts-for-log4j
Scripts for addressing log4j zero day security issue
cloudera_upgrade_utils
Various tools to help plan HDP and CDH upgrades to CDP
ESGI-spark-core
Official Repository for IABD2 - Spark Core
gpt-3
GPT-3: Language Models are Few-Shot Learners
hash-prospector
Automated integer hash function discovery
hdp
hdp admin cluster scripts
hdp3_upgrade_utils
Assist with HDP 3 Upgrade Planning
hms-mirror
Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster
homebridge-own
Homebridge Plugin for OpenWebNet standard
javakeystore
A simple role for setting up Keystore and Truststore used in HDP
mastering-apache-spark-book
Mastering Apache Spark 2
people-timeusage-Apache-Spark
Develop nationally representative estimates of how people spend their time using Apache Spark. Using data from The American Time Use Survey
prereq-checks
Prerequisites checker for Cloudera Manager, CDH and CDP-DC installations
ranking-prog-language-wikipedia-spark
Ranking pupolar programming languages according to wikipedia articale using Apache Spark
ScalaExercises
Repository to help beginners to learn Scala
spark
Apache Spark
spark-elasticsearch
Write Hive Data To ElasticSearch in a kerberized Cluster
spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
universalhashudf
hive UDF to hash integers and long
xsltjson
XSLTJSON - Convert XML to JSON using XSLT
ycsb
YCSB perf test phoenix & hbase