Arvind Venkatasubramanian's repositories
Lucene-Document-Search
This is a simple Java project to perform a word search from a directory of documents. It can handle multiple Document types, from PDF to txt to XML.
Big-Data-Technologies-Implementations
This repo is aimed at working on the grouplens data to solve simple problems using the different applications in the Hadoop Ecosystem.
Credit-card-fraud-Detection
Credit card fraud detection for any transaction. Used a dataset and created a 'credit score' for a customer. Tried to implement Deep learning libraries for effective prediction.
CtCI-6th-Edition
Cracking the Coding Interview 6th Ed. Solutions
Elastic-Computing-Simulation
A simple Java data structures project to try and simulate the working of an elastic computing application of request and response using Java swing applications
FlyHigh
A flight fare prediction system using Big Data engineering. Based on the application 'Hopper', this model will help identify which is the cheapest date to book a ticket. This is known as the date of Pricefall. This model will also help to predict which is the cheapest date to travel with a delta of almost + or - 5 days.
Python-AWS-application-using-Flask
This application uses a postgres AWS instance to connect. Once the data has been stored to the database, this application was tested and deployed on AWS server.
Python-Projects
This repository contains some simple applications which have be built using Python language.The intention of this repo is to build the applications with clean code by following good coding practices and good delivery structure.
Search-Engine-Optimizer
This project is aimed at creating a simple WebCrawler in Python. From the raw HTML data extracted after crawling through the webpage, the relevant texts are extracted and resturned as a list.
Data-Structures-and-Problems
A simple problem list of assignments solved as part of the curriculum in Data Structures and Algorithms. We used different approaches to solve real time problems.
hadoopecosystemtable.github.io
This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open source, free software enviroment.
java-interviews
A collection of Java interview questions and answers to them
PrudentialChallenge
A kaggle competition to try and understand the prudential insurance data. This model is to help perform risk analysis for the customers and predict the eligibility of the customer.
Python-For-Beginners
Coursera course on Python for beginners - Specialization
Semantic-Analysis-on-Amazon-products-using-NLP
This project is to use Natural Language Processing (NLP) to score and rate Amazon product reviews. after scraping of the data from Amazon website, I used NLP to try and create an algorithm to score the reviews.
system-design-interview
System design interview for IT companies