Svetlana's starred repositories
sql-server-samples
Azure Data SQL Samples - Official Microsoft GitHub Repository containing code samples for SQL Server, Azure SQL, Azure Synapse, and Azure SQL Edge
parquet-compatibility
compatibility tests to make sur C and Java implementations can read each other
spark-and-python-for-big-data-with-pyspark
Course on Udemy by Jose Portilla
ElasticSearch
Misc scripts for ElasticSearch
HiveToPhoenix
An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase
zeppelin-solr
Apache Solr interpreter for Apache Zeppelin
tldextract
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
mlcourse.ai
Open Machine Learning Course
flume-elasticsearch-sink
Flume sink plugin for Elasticsearch
FlumeElasticsearchRestSink
Flume elasticsearch REST sink
Python-programming-exercises
100+ Python challenging programming exercises
Machine-Learning-Based-Botnet-Detection
Machine Learning Based Botnet Detection is a tool to classify network traffic as being botnet affected or not based on the network traffic flows. It involves various classifiers including Neural Networks, Decision Tree, SVM, Naive Bayes, Logistic Regression, k-Nearest Neighbours.
sparklingpandas
Sparkling Pandas
TensorFlow-Time-Series-Examples
Time Series Prediction with tf.contrib.timeseries
cloud-dataproc
Cloud Dataproc: Samples and Utils
tensorflow_scala
TensorFlow API for the Scala Programming Language