Eveline Surbakti's repositories
Automated-Keywords-Extraction-of-Data-Analyst-Job-Descriptions-from-Indeed-using-NLP
Scraped job description and leveraged the concepts of Natural Language Processing (NLP) and GloVe Algorithm to extract the keywords through data and performed analysis. Presenting the vital keywords from data analyst job summary from the Indeed website..
Optimizing-Retail-Strategy-based-on-the-Pattern-of-Products-in-Groceries-Transactions
What kind of relationship of product that we have in the UK and Ireland wholesalers? How can that information be used to help business decisions making? You can find the answers in this repo.
evelinesurbakti
A brief summary about me
Bayesian-Analysis
How to fit Bayesian model using JAGS
Buffon-s-Needle-vs-Monte-Carlo
Calculate Pi with Buffon's Needle and Monte Carlo
Climr-Package
The package can be used to analyze and plot the latest global hemispheric climate data
data-engineering-book
Accumulated knowledge and experience in the field of Data Engineering
Machine-Learning-and-AI
3 Capstone Projects: Predicting Daily Activities and Sports Project, Predicting the Amount of Active Ingredient of Pharmaceutical Tablets Data, and Predicting handwritten Digits from the USPS.
Satellite-Images-Prediction-with-Random-Forests-and-Multinomial-Logistic-Regression
Developed a random forest model to predict satellite images with 91.4% accuracy and 0.02% standard deviation of accuracies for all replications.
House-Price-Prediction-with-Regression-Model
Developed a regression model to predict house price. Conducted EDA, ANOVA and feature engineering to derive feature relationships and treat data anomalies such as outliers, influences and leverage.
Lehmer-Random-Number-Generator
When randomness is not random
Multivariate-Analysis
Clustering, Binary Data Clustering and Metric Scaling
powerbi-clustering
How to implement clustering in Power BI using PyCaret
R-exercises
because basic is important
shiny_scraper
Shiny object that scrapes coinmarketcap.com
Stochastics-Models
A stochastic model is a tool for estimating probability distributions of potential outcomes by allowing for random variation in one or more inputs over time.
Time-Series
A time series is a series of data points indexed in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data.