Silvia Onofrei, PhD's repositories
Blogs_Content
Contains Google Colab or Jupyter notebooks, as well as other associated files for my Medium blogposts.
Cypher_Generator
Generate a dataset to finetune a LLM to generate Cypher code from questions given in natural language (English).
Knowledge_Graphs_Assortment
Eclectic collection of knowledge graphs and useful datasets.
Disaster_Response_Texts
Text classification of messages collected during and after a natural disaster. Deploy a Flask app on Heroku .
Healthcare_SparkNLP_Study
SparkNLP and Healthcare SparkNLP based analysis of scientific literature on equine colic.
Recommendation_Systems
Project 3 in Data Scientist Nanodegree with Udacity. Build a recommender engine for IBM Watson.
text2cypher
collection of text2cypher datasets, evaluations, and finetuning instructions
Customer_Churn_Prediction
Binary classification project in PySpark on an AWS-EMR cluster to predict customer churn.
Developers_Survey_Analysis
Use data visualization, hypothesis testing and machine learning methods to analyze StackOverflow developers survey data.
Elements_of_Data_Science
Projects completed for the Data Scientist Nanodegree with Udacity.
Elements_of_Machine_Learning
Projects for the Machine Learning Nanodegree with Udacity.
Elements_of_NLP
Projects completed for the NLP Nanodegree with Udacity.
Exploratory_Data_Analysis
Projects for Data Analyst Nanodegree with Udacity
TDS_Blogs_Knowledge_Graph
Create a knowledge graph based on Towards Data Science blogs.
World_Bank_Data_App
Dashboard app using World Bank Data API.