tanaymukherjee

Tanay Mukherjee's starred repositories

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonMIT29964 624 179

data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Language:PythonNOASSERTION27266 1614 41

machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

9392 215 4

scipy2018-geospatial-data

Language:Jupyter NotebookBSD-3-Clause333 34 15

my-awesome-AI-bookmarks

Curated list of my reads, implementations and core concepts of Artificial Intelligence, Deep Learning, Machine Learning by best folk in the world.

270 170

Complex-SQL-Exercise

SQL queries of all kind being put together as a single repository

500

Case-Study-Predicting-Bankruptcy

Based on available data from bank and parameters to identify the variables that influence the most, predict the bankruptcy of the given financial model

Language:R4 20

findjakes.com

This app is developed to help you locate the nearby public toilets and the direction to reach. This will also help Govt. to keep a track of all the public toilets as regular feedback will be generated from random citizens who in turn will guide the local Municipality to improve any infrastructure problem identified and keep the toilets clean. Also, the motive of building public toilets will be fulfilled. Additionally, for women this will be an added advantage as they can now know where the public toilets are, and use them whenever needed and may not rush for home

Language:CSS200

Dimensionality-Reduction

In statistics, machine learning, and information theory, dimensionality reduction or dimension reduction is the process of reducing the number of random variables under consideration by obtaining a set of principal variables. Approaches can be divided into feature selection and feature extraction.

Language:Jupyter Notebook200

Dissecting-Yelp-Dataset

This dataset is a subset of Yelp's businesses, reviews, and user data. It was originally put together for the Yelp Dataset Challenge which is a chance for students to conduct research or analysis on Yelp's data and share their discoveries. In the dataset you'll find information about businesses across 11 metropolitan areas in four countries.

Language:Jupyter Notebook2 30

Exploring-SQL-with-R

The idea is to use the SQL skills in R by converting data into relational database from text files and then using it to run queries to filter data by SQL

Language:R200

Google-Analytics-with-R

How to automate reporting suite from GA to R, so that one can pull data at will without even interacting with Google Analytics interface. There are various things one can do and we will cover each one of them.

Language:R200

A-B-Testing-in-R

A/B testing (or split-testing) is a randomized experiment with two variants A and B. It includes application of statistical hypothesis testing (or two-sample hypothesis testing), as used in the field of statistics. A/B testing is a way to compare two versions of a single variable, typically by testing a subject's response to variant A against variant B, and determining which of the two variants is more effective.

Language:R1 20

Analysing-NYC-Felony-Offenses-in-2019

In this exercise we will apply many of the multivariate statistics techniques on NYC felony dataset and see if their any association between the features.

Language:SAS100

Attribution-Modeling-in-R

Application on Markov Chain and Removal Effect (Attribution Modeling)

Language:R1 20

Bayesian-Data-Analysis-in-R

Bayesian A testing for Swedish Fish Incorporated

Language:R100

Building-your-own-chatbox

We will use NLTK(Natural Language Toolkit) to develop our own simple chatbox that will respond based on user queries using a defined corpus.

Language:Jupyter Notebook100

findjakes.com

This android app is developed to help you locate the nearby public toilets and the direction to reach. This will also help Govt. to keep a track of all the public toilets as regular feedback will be generated from random citizens who in turn will guide the local Municipality to improve any infrastructure problem identified and keep the toilets clean. Also, the motive of building public toilets will be fulfilled. Additionally, for women this will be an added advantage as they can now know where the public toilets are, and use them whenever needed and may not rush for home

Language:CSS1 20

Implementing-TrelliscopeJS-in-R

Trelliscopejs is an R package that brings faceted visualizations to life while plugging in to common analytical workflows like ggplot2 or the “tidyverse”.

Language:R100

Interactive-Sales-Dashboard-in-RShiny

Create a sales dashboard in R shiny that can be customized by users with some cool features and graphs

Language:R1 20

Investigating-NYC-Parking-Violations

For this project, we will analyze millions of NYC Parking violations since January 2016

Language:Python1 20

K-Means-Clustering-in-R-and-Python

k-means clustering is a method of vector quantization, originally from signal processing, that is popular for cluster analysis in data mining. k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster.

Language:Jupyter Notebook100

Knapsack-Problem

Implementation of knapsack problem in Python

Language:Jupyter Notebook100

Machine-Learning-Digit-classifier-in-R

For any given image with a digit written on it (handwritten), we calculate the pixel and analyse the info to predict what digit it could be. This is a classic example of machine learning by using some train data to predict the info for a set of test data.

Language:R100

tanaymukherjee

Tanay Mukherjee's starred repositories

pytorch-tutorial

data-science-ipython-notebooks

machine-learning-interview

scipy2018-geospatial-data

my-awesome-AI-bookmarks

Complex-SQL-Exercise

Case-Study-Predicting-Bankruptcy

findjakes.com

Dimensionality-Reduction

Dissecting-Yelp-Dataset

Exploring-SQL-with-R

Google-Analytics-with-R

A-B-Testing-in-R

Analysing-NYC-Felony-Offenses-in-2019

Attribution-Modeling-in-R

Bayesian-Data-Analysis-in-R

Building-your-own-chatbox

findjakes.com

Implementing-TrelliscopeJS-in-R

Interactive-Sales-Dashboard-in-RShiny

Investigating-NYC-Parking-Violations

K-Means-Clustering-in-R-and-Python

Knapsack-Problem

Machine-Learning-Digit-classifier-in-R

MapReduce-Exercise

Mastering-SQL

Regression-analysis-of-pedestrains-data-from-New-York

Shortest-path-algorithm

Speech-Recognition-in-Python

Streaming-Finance-Data-with-AWS-Lambda