Karan Murthy's repositories
amazon-kinesis-analytics-taxi-consumer
Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, processes and aggregates them, and ingests the result to an Amazon Elasticsearch Service cluster for visualization with Kibana.
perception
Perceptual hashing tools for detecting child sexual abuse material
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
machine-learning-flask-example
This project demonstrates how to train and deploy a simple model. Using a pima indians diabetes dataset, I create a model that can predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Other applications can use this model through a single route server. This project is composed of two python files: a model training script and the web server.
celery-practice
Small project to get started on Celery and RabbitMQ.
gatsby
⚛️📄🚀 Blazing fast static site generator for React
nlp-to-sql
SQLNet to convert NLP to SQL
language-translation-using-rnn
This repository contains code that will teach a neural network to translate from French to English.
computerscience
Free technical resources for faculty, students, and Microsoft developer advocates for use in computer science learning forums.
algorithms-practice
This repository consists of my accepted LeetCode, HackerRank, and CTCI solutions.
youtube-tag-recommender
Every day, 30 million users visit YouTube on an average, and approximately 5 billion videos are viewed every day. When about 300 hours of video content are uploaded every minute on YouTube, it’s not surprising if a video gets lost in the limbo. With an ever-increasing number of YouTube uploaders, it’s of paramount importance for one to be ahead of the game and incorporate smart strategies to make one’s video(s) relevant. That’s where tagging comes into the picture. Tags are one of the most important ways for an uploader to maximize viewership. A set of relevant tags can help the YouTube search engine optimization to cherry pick such videos and place them much higher on the search results.
e-commerce-website
This is an e-commerce website built using Angular 4
fake-news-classification
The latest hot topic in the news is fake news and many are wondering what data scientists can do to detect it and stymie its viral spread.
tic-tac-toe-using-ai
This is a simple tic-tac-toe application with AI using mini-max algorithm along with alpha-beta pruning.
chatbot-using-artificial-intelligence
A chatbot is a computer program which conducts a conversation via auditory or textual methods. Such programs are often designed to convincingly simulate how a human would behave as a conversational partner, thereby passing the Turing test. Chatbots are typically used in dialog systems for various practical purposes including customer service or information acquisition. Some chatterbots use sophisticated natural language processing systems, but many simpler systems scan for keywords within the input, then pull a reply with the most matching keywords, or the most similar wording pattern, from a database.
web-crawler-python
A Web crawler, sometimes called a spider, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering).
Progresa_Data_Analysis
This is an in-depth analysis of the Progresa program, a government social assistance program in Mexico. This program, as well as the details of its impact, are described in the paper "School subsidies for the poor: evaluating the Mexican Progresa poverty program", by Paul Shultz (available on Canvas). The goal of this problem set is to implement some of the basic econometric techniques to measure the impact of Progresa on secondary school enrollment rates. The timeline of the program was: Baseline survey conducted in 1997 Intervention begins in 1998, "Wave 1" of data collected in 1998 "Wave 2 of data" collected in 1999 Evaluation ends in 2000, at which point the control villages were treated. The data is the actual data collected to evaluate the impact of the Progresa program. In this file, each row corresponds to an observation taken for a given child for a given year. There are two years of data (1997 and 1998), and just under 40,000 children who are surveyed in each year.
nycflights13_analysis
This repository contains my analysis of the nycflights13 data set using Python libraries. It also aims to identify the associations between weather and flight delays in New York in the year 2013.
easy-application
400 software engineering companies that are easy to apply to
coding-interview-university
A complete computer science study plan to become a software engineer.
awesome-courses
:books: List of awesome university courses for learning Computer Science!
karanmurthy7.github.io
Personal Website
Hospital_Ranking_Using_R
The data used for this project is from the Hospital Compare website (http://hospitalcompare.hhs.gov) run by the U.S. Department of Health and Human Services. This project will rank the hospitals from each state in the U.S. according to their 30-day mortality rates for heart attack, heart failure and pneumonia.
CachedMeanInR
This project will compute the mean value of a list of numbers using caching in R