Ayush Arora's repositories
java-netty-protobuf-chat-application
P2P Netty Chat application | A simple chat client and server with Netty with Protocol Buffers as chat wire format.
airflow-spark-stackoverflow-logs-analysis
Ask Ubuntu Logs analysis with PySpark on GCP | Pipeline with Airflow (Cloud Composer)
Distributed-File-System
POSIX compatible Distributed File System | Block Storage, Probabilistic Routing with Bloom Filter, Fault tolerance with Replication using Priority Queue
machine-learning-interview-enlightener
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
adtech-logs-processing-pipeline-dataflow
Adtech Logs processing Pipeline with Apache Beam, Cloud Dataflow, Java, Protocol Buffer. | Data Analysis with BigQuery
hadoop-stackoverflow-logs-analysis
Ask Ubuntu Logs analysis with Hadoop, MapReduce 2(Yarn) | https://ayusharora.me/stack-overflow-logs-hadoop-analysis/
pyspark-spatiotemporal-data-analysis
Report Link : https://ayusharora.me/projects/Project03%20Report.pdf
boto3-ec2-user-setup-skeleton
This project reads a YAML configuration file and set ups ec2 instances with EBS volumes and users to ssh into the machines
c-concepts
C Concepts - hiredis (C Redis Client)
go-data-structures-concepts
Go Data Structures and related concepts
java-concepts-design-pattern
Java Core Concepts & Design Pattern Implementation
k-means-clustering-implementation-sklearn
Custom implementation for k-means in python, comparison with the one in sklearn.
python-concepts
Concepts like Multithreading, Regex, Bit Manipulation, Functional Programming, asyncio, Text Justification, Serializaion
python-log-generation-simulator
A python based log logs generator that generates per seconds logs for 2000 servers with Python Multiprocessing module. (~172,800,000 logs / day) | Trie Based Logs Parser (WIP)
r-data-structure-concepts
Data Structures | Concepts - regex
scala-data-structure-concepts
Data Structure and concepts in Scala
system-design
High-level system design and object-oriented design
TravelHelper
Social hotel discovery platform for travelers to explore hotels based on crowd sourced reviews
twitter-hadoop-analysis
Twitter Data analysis with Hadoop | Multiline Record Reader : Read more than 1 lines at a time for processing.