Ricky Kim's repositories
efficient_frontier
Markowitz portfolio optimisation (efficient frontier) in Python
setiment_analysis_pyspark
Simple sentiment analysis model with PySpark
twitter_sentiment_analysis_part11
Twitter sentiment analysis part 11: Word2Vec with Convolutional Neural Network
twitter_sentiment_analysis_part2
redefining data-cleaning, preparation for visualisation
twitter_sentiment_analysis_part6
Twitter sentiment analysis part 6: Doc2Vec
twitter_sentiment_analysis_part4
Data split, feature extraction with count vectorizer
twitter_sentiment_analysis_part8
Twitter sentiment analysis part 8: Dimensionality reduction (chi-squared, PCA)
twitter_sentiment_analysis_part10
Twitter sentiment analysis part 9: Neural Networks with Doc2Vec, Word2Vec, GloVe
twitter_sentiment_analysis_part9
Twitter sentiment analysis part 9: Neural Networks with Tfidf vectors using Keras
pyspark_sa_gcp
PySpark Sentiment Analysis on Google Dataproc
twitter_sentiment_analysis_part3
Zipf's Law, text data visualisation
twitter_sentiment_analysis_part5
Twitter sentiment analysis part 5: Tfidf vectorizer, model comparison, lexical approach
twitter_sentiment_analysis_part7
Twitter sentiment analysis part 7: Phrase modeling + Doc2Vec
luigi_spotify
Get Spotify Discover Weekly emailed
flask_sparkml
Deploying PySpark ML Model on Google Compute Engine as a RESTÂ API
Secret_Santa
small code for organising secret Santa
bert_serving
export bert model for serving
data-engineering-practice
Data Engineering Practice Problems
getting-started
Getting started with Docker
workalendar
Worldwide holidays and workdays computational toolkit.