Pratik Khandelwal's repositories
Business-Intelligence-and-Data-Warehousing
Business Intelligence and Data Warehousing Project
Hotel-Reservation-System-Database-
Designed and implemented database schema for hotel reservation system. Identified key business metrics for the system and constructed complex SQL queries to retrieve desired information. Documented the entire system by developing comprehensive UML diagrams.
Credit-Card-Fraud
Identify fraudulent credit card transactions so that customers are not charged for items that they did not purchase. (Python, Logistic Regression Classifier, Unbalanced dataset).
SF-Crime-Visualization
Analyzed and visualize criminal incidents across 10 police districts of San Francisco. (Python, Tabealu: Story, Dashboard, Actions, Maps)
Zomato-Tableau-Visualization
Zomato is an Indian restaurant search and discovery service, operates in 24 countries. It provides information and reviews of restaurants, including images of menus. In Tableau I have created visualization based on their data.
Airbnb-newuser
Marketing Analytics (Python, Random Forrest): Predicting destination country of Airbnb users with various models such as Random Forrest & logistic regression. Performed exploratory analysis of the data-set.
Algorithms
Algorithms
Buy-online-and-pick-up-in-store-Startegy
Impact of BOPS(Buy Online and Pick up in Store) strategy on online sales & return, consumer purchase & return behavior.
Detect-Target-Customers
Identity customers who have high probability to purchase protection plan (Geek Squad Protection Plan)
Google-App-Store
Google App Store Data - Application Data Analysis
IEEE-Fraud-Detection
Kaggle Competition (https://www.kaggle.com/c/ieee-fraud-detection)
Machine-Learning-Bias
Identify bias in ML algorithm and reduce the same using different techniques.
People-Analytics-Developer-Salaries
Identify important parameters that will help a developer to get salary higher in- their respective country? (Python: Pandas, Scikit-Learn, Seaborn, K-Means Clustering, Decision Tree)
Recommendation-Engine
Work in Progress
T-Test-in-Pyspark
Conduct t-test between two sets of biosets (i.e. A & B) with the help of spark transformations, dataframes and user defined functions.
Topic-Modelling
Topic Modelling
Tweets-Sentiment-Analysis
Sentiment Analysis is done on the word 'Bandersnatch' to gauge latest public sentiment (via tweets) about the movie.
Word_count_pyspark
Using Spark count the number of words used in BIBLE. Print the top 5 words along with their frequencies. Also, print the number of unique words with length of 4,5,6,7 and 8.