tanaymukherjee

Tanay Mukherjee's repositories

A-B-Testing-in-R

A/B testing (or split-testing) is a randomized experiment with two variants A and B. It includes application of statistical hypothesis testing (or two-sample hypothesis testing), as used in the field of statistics. A/B testing is a way to compare two versions of a single variable, typically by testing a subject's response to variant A against variant B, and determining which of the two variants is more effective.

Language:R1 20

Dissecting-Yelp-Dataset

This dataset is a subset of Yelp's businesses, reviews, and user data. It was originally put together for the Yelp Dataset Challenge which is a chance for students to conduct research or analysis on Yelp's data and share their discoveries. In the dataset you'll find information about businesses across 11 metropolitan areas in four countries.

Language:Jupyter Notebook1 30

Investigating-NYC-Parking-Violations

For this project, we will analyze millions of NYC Parking violations since January 2016

Language:Python1 20

Shapley-Value

Language:Jupyter NotebookMIT1 20

Spoken-Language-Processing-in-Python

Language:Jupyter Notebook1 20

Deep-Learning-with-PyTorch

PyTorch is an open source machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, primarily developed by Facebook's AI Research lab. It is free and open-source software released under the Modified BSD license.

Language:Jupyter Notebook020

Natural-Language-Processing

Natural language processing is a subfield of linguistics, computer science, information engineering, and artificial intelligence concerned with the interactions between computers and human languages, in particular how to program computers to process and analyze large amounts of natural language data.

Language:Jupyter Notebook000

CIS_9440_Project_YouTube-and-Netflix-Viewership-Analysis

This is a repository to put together all the work for the final project from CIS 9440 - Data Warehousing and Analytics

Language:Jupyter NotebookMIT020

Data-Science-Hacks-in-Python-Part-2

Simple hacks to speed up your Data Analysis

Language:Jupyter Notebook010

Debugging-NY-Times-library

This is a web scrapping project and I am trying to gather info from NY Times using APIs

Language:Jupyter NotebookMIT020

Epileptic-Seizure-Recognition

Language:RMIT020

Flow-in-R

Language:R010

HackerRank-Challenges

Language:Jupyter NotebookMIT000

Humana-Mays-Healthcare-Analytics-Case-Competition-2020

Mays Business School in partnership with Humana presents the fourth annual Humana-Mays Healthcare Analytics Case Competition. The competition will be held virtually and offers an opportunity for U.S. masters students to showcase their analytical skills and solve a real-world business problems for Humana utilizing real data.

Language:Jupyter NotebookMIT000

Kaggle-Competition-Santander-Customer-Transaction-Prediction

https://www.kaggle.com/c/santander-customer-transaction-prediction

Language:Jupyter NotebookMIT010

Learning-Kafka

MIT000

Linear-Regression-in-SQL

In this exercise we will try to learn how can we implement linear regression just using SQL.

Language:TSQL010

Machine-Learning-Fall-2020

This repo includes all the work/assignments I did as part of my coursework in Fall 2020 under the subject code STA 9891 with Prof. Rad.

Language:RMIT020

ML-in-Bioinformatics

Bioinformatics is a subdiscipline of biology and computer science concerned with the acquisition, storage, analysis, and dissemination of biological data, most often DNA and amino acid sequences.

Language:Jupyter NotebookMPL-2.0000

Network-Analysis

The promise of network analysis is the placement of significance on the relationships between actors, rather than seeing actors as isolated entities. The emphasis on complexity, along with the creation of a variety of algorithms to measure various aspects of networks, makes network analysis a central tool for digital humanities.

Language:R010

NLP-Class-Fall-2020

Language:Jupyter NotebookMIT000

No-SQL-in-Python

Language:Jupyter NotebookMIT000

OOP-in-Python

Demystifying the world of object oriented programming in Python

Language:PythonMIT020

PB_Challenge_2021

In this exercise we are trying to predict that for given information can we predict whether a device will fail in next 7 days.

Language:Jupyter NotebookMIT020

Real-and-Fake-News-Analysis

Language:Jupyter NotebookMIT010

SQL-Exercise-2

In this exercise we will try to answer a specific data requirement.

Language:SQLPL000

Tableau-Dashboards

This repository is a showcase of all the tableau dashboards I have built so far.

020

tanaymukherjee

010

Useful-Python-libraries-for-Data-Science

In this repository, I am trying to compile some useful Python libraries for data science tasks other than the commonly used ones like pandas, scikit-learn, matplotlib, etc. My idea is to regularly update the kernel to include some awesome Python libraries which can real come in handy for the Data Analysis and Machine learning tasks.

Language:Jupyter Notebook000

Working-With-Python-Functions

Language:Jupyter NotebookMIT020