Ambar Chatterjee's repositories
ADM-HW2
This repository contains a comprehensive exploratory data analysis on a dataset about books and their authors. The analysis aims to extract insights about genres, authors, publication dates, ratings, and more. It also includes answers to research questions, bonus points, and AWS and Command Line Questions.
ADM_HW3_Group3
Code and analysis for building a search engine to retrieve and rank master's degrees. Implements data collection, preprocessing, inverted indexing, conjunctive queries, custom scoring, and map visualization.
ADM_HW4_Group3
This repository contains code and analysis for a homework assignment on recommendation systems and clustering algorithms in Python. Implements techniques like minhash, LSH, feature engineering, dimensionality reduction, K-means and DBSCAN clustering.
ADM_HW5_Group21
The "ADM_HW5_Group21" repository focuses on analyzing citation networks in academic research using graph analysis. It includes a Jupyter Notebook with homework solutions, Python scripts for backend and frontend functions, and GraphML files for graph structures.
Canoo_Research
Internship project analyzing Canoo's industry, competitors, and market trends using Google's Gemini Pro AI model API.
FDS_Final_Project
"FDS_Final_Project" focuses on predicting which passengers of Spaceship Titanic are transported to an alternate dimension after a spacetime anomaly collision, using data science techniques.
data-engineering-interview-questions
More than 2000+ Data engineer interview questions.