Caterina Bonan's repositories
aicore-recommendation-ranking-system
My Machine Learning specialisation project at AiCore.
aicore-web-scraping-pipeline
Web scraping pipeline I worked on as part of my 'AI and data engineering' training at AiCore.
hugging-face-course-translation
The Hugging Face course
parameters-corpus-work
Paper that Giuseppe Samo and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
NLP-for-spoken-corpora
Personal project I am working on to create a corpus of interactions in spoken French for syntactic and prosodic investigations.
aicore-retail-data-centralisation
Project I'm working on as a part of my AI and Data training at AiCore.
clefts-corpus-study
Study on the evolution and distribution of declarative and interrogative clefts intended for use in joint paper with Adam Ledgeway.
data-cleaning-in-python
This is a LinkedIn Learning repo for Data Cleaning in Python Essential Training.
full-stack-nlp-pipeline
Full stack NLP pipeline I'm working on as a personal project.
geneva-talk-beamer
A beamer presentation for a talk I'm giving in Geneva on 15 December 2022.
health-communication-paper2
Bonan & Samo. January 2023. Paper on cross-linguistic bias in health-related content in Transformer-based language models.
holiday-chatbot
Virtual assistant in Google Dialogflow for the loveholidays website.
interrogatives-corpus-work
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
italoromance-survey-paper
Paper intended for publication in Bonan & Ledgeway (2023). Based on survey material collected in 2021.
NLP-les-incontournables
A foundation course in NLP.
nlp-practice
Just getting my hands dirty!
parameters-lesson-beamer
Slide presentation for Ur Shlonsky's 'Lectures en syntaxe contemporaine' class at the University of Geneva.
sentiment-analysis-pipeline
Sentiment analysis pipeline in TensorFlow.
100-python-programs
100+ programs (from basic to advanced) to practice Python syntax.
CaterinaBi
Config files for my GitHub profile.
curriculum-vitae
My up-to-date CV.
intermediate-coding
My solutions to Python Codewars challenges.
knowledge_graph
Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA
my-academic-work
Papers and books I published during my 7 years of research at the Universities of Geneva and Cambridge.
NLP-libraries-comparison
NLTK, spaCy or scikit-learn? From basic text pre-processing to advanced NLP tasks: which library is better for which task? Let me help you choose wisely!
pii-extract-plg-regex
pii-extract plugin for PII detection via regular expressions
text-analytics
This repository hosts a series of notebooks for text analytics learners. The main focus will be text pre-processing techniques, and EDA.