Nicha Ruchirawat's repositories
Collocations
N-gram Extraction Approaches (bigrams, trigrams)
topic_modeling
Understanding key topics being discussed in text data via Latent Dirichlet Allocation - Optimization for Human Interpretability
article-recommender
Flask App to browse news articles and recommend similar articles.
ALS_expected_percent_rank_cv
Alternate cross validation approach for ALS models with implicit ratings utilizing an expected percent ranking metric for model performance evaluation.
anomaly-detection-resources
Anomaly detection related books, papers, videos and toolboxes
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
DL-Recommender
Building Recommender System via MF Embeddings and Deep Learning (PyTorch)
exploratory_data_analysis
Exploratory Data Analysis using R on US Air Traffic Data and Traffic Stops Data in North Carolina.
courseware
Homework will be published on Fridays
Data-Science--Cheat-Sheet
Cheat Sheets
Data-Science-Cheatsheet
A helpful 4-page data science cheatsheet to assist with exam reviews, interview prep, and anything in-between.
GHC19-Interpreting-ML-Models
Coding exercises for workshop on Breaking the Black Box: Interpreting ML Models
handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
lookalike-modelling
Finding customer lookalikes (similar customers) in a Big Data environment
mathematics-statistics-for-data-science
Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
yelp-sentiment-prediction
Using SVM, Logistics Regression Model on Yelp Reviews via Spark to Predict Sentiment.