Cyanjiner's repositories
CADS-Capstone
This is a capstone research project for my Certificate in Applied Data Science (CADS) at my undergraduate institution, Wesleyan University, on the topic of "Understanding the Variances in COVID-19 Pandemic Outcome - Excess Mortality - with Social, Cultural, and Environmental Factors", sponsored by Prof. Maryam Gooyabadi.
classroom-analysis
A hybrid of statistical, computational linguistic, and NLP-based approaches to measuring student learning and teaching efficacy through classroom transcripts.
EdTM
This research project evaluates the classic Latent Dirichlet (LDA) and neural topic modeling with the application to a novel corpus of research in the domain of Education, and further investigates the "topic collapsing" problem in Dieng et al. (2020)’s implementation of the embedded topic model (ETM) .
rtweetstats
The goal of rtweetstats is to improve accessibility of Twitter analyses for the average R user through two functions; keystats and userstats. The former analyzes user-specified keywords and hashtags, returning an HTML ouput with various summary graphics while the latter analyzes the pages of user-specified Twitter users, returning an HTML output with account summary information as well as the user-specified number of most recent tweets.
songrater-mobile
This is a cross-platform music rating software application.
stat-consulting
This is a statistical analysis research project on Analyzing Client Behavior in The Connection, sponsored by the Connection Inc. and Wesleyan Quantitative Analysis Center.
booknlp
BookNLP, a natural language processing pipeline for books
CILS4EU-Project
Stanford EDS seminar research project on high school friendship networks & school satisfaction
conversational-uptake
Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"
covid-19-data
Data on COVID-19 (coronavirus) cases, deaths, hospitalizations, tests • All countries • Updated daily by Our World in Data
covid-policy-tracker
Systematic dataset of Covid-19 policy, from Oxford University
Data-Science-Interview-Resources
A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
DataScienceSpCourseNotes
Compiled Notes for all 9 courses in the Coursera Data Science Specialization
ds-job-prep-notes
This repo is currently for my personal use of preparing for data science summer intern (2023). Feel free to check out the deployed website or use this template for your personal learning use. Topics covered will include: Review of statistics, probability, A/B testing, ML & NLP models / techniques, SQL, and some common asked DS job interview questions.
Mortgage-Market-Expansion
This is a data analysis project on Exansion into Nebraska Mortgage Market for 2021 MMA Datathon.
skills-github-pages
My clone repository
stanford-nlu
Code for Stanford CS224u
text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
textbook-analysis
Code for the paper "Content Analysis of Textbooks via Natural Language Processing".