yennie's repositories
tokenizers-languages
Comparing LLM tokenizers in multiple languages
covid-texting-service
Texting Service providing updated Covid-19 stats and answering Covid-19 questions
JoseonMunkwa
Analyzing Korean Joseon-Dynasty Civil Service Bangmok/roster data (조선 문과 방목)
yenniejun.github.io
Personal Site
banned-books
Analysis of recently banned books in the US
contact-tracing
Topic modeling American and Korean news article that mention "contact tracing"
covid-news-analysis
Analysis of covid-related news across 6 countries (3 languages) and topic modeling for topics related to digital contact tracing
internetcafe
A virtual coffeeshop experience
satirical-headline-chrome-extension
Transform boring news headlines into a satirical form
world-history-ai
World history thru the lens of AI
covid_government_type
Did government type have an effect on restrictive COVID-19 measures?
CameraTraps
Tools for training and running detectors and classifiers for wildlife images collected from motion-triggered cameras.
clinicalnlp-ade
Clinical NLP concept extraction of ADEs in the 2018 n2c2 Adverse Drug Events and Medication Extraction (Track 2). Includes data preprocessing, model training/evaluation scripts.
course-nlp
A Code-First Introduction to NLP course
ehr-relation-extraction
NER and Relation Extraction from Electronic Health Records (EHR).
examples
Examples on the use of covid19db database
love-languages
An Expo app with a quiz to find out your love language
music-factorization
Decompose scales into chunks of other scales!
oii-sds-fundamentals-summative
This is my summative for the Fundamentals of Social Data Science course at Oxford Internet Institute
oldpersonalsite
Personal website
project-euler-gpt-langs
Testing GPT-4 on Project Euler problems in 16 different langauges
thesis-radio
Code used for my Masters thesis