Sampath Sree's repositories
Page_Rank_WikiPedia
Running PageRank Algorithm on Wikipedia Data Set
TF_IDF_HADOOP
Term Frequency - Inverse Document Frequency is used Information Retrieval. This is implemented in distributed computing environment using Apache HADOOP.
data-science-my-way
Few Data Science projects & competitions attempted
data-science-your-way
Ways of doing Data Science Engineering and Machine Learning in R and Python
kafka-spark-avro-example
Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry
Multiple_Linear_Regression_SPARK
Implementation of multiple linear regression using Spark. Used closed form expression for the ordinary least squares estimate of the linear regression coefficients computed using summation
dockerjenkins_tutorial
A repository for items learned in my Getting Started with Jenkins and Docker tutorial series
elastic4s
Elasticsearch Scala Client - Non Blocking, Type Safe, Reactive HTTP Client
example-spark
Spark, Spark Streaming and Spark SQL unit testing strategies
example-spark-kafka
Apache Spark and Apache Kafka integration example
FutureNiners
Project for Database Systems course
github-slideshow
A robot powered training repository :robot:
reactjs.org
The React documentation website
sampathsree.github.io
Personal Portfolio
shmack-utils
Spark, Hadoop, Mesos, Akka, Cassandra and Kafka Utils
Sortings_Comparison
Implemented Comparison-based sorting algorithms and observed their performance for different data sets.
spark-bench
Benchmark Suite for Apache Spark
spark2.0-examples
Examples of Spark 2.0
SparkBuildExamples
Example projects for using Spark and Cassandra With DSE Analytics
SparkInternals
Notes talking about the design and implementation of Apache Spark