delvinso

followers

following

stars

Vancouver, British Columbia

Delvin So's repositories

covid19_unique_tweets

An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000 tweets collected since mid-January 2020.

MIT57 5 2

friends-tv-show-analysis

Analysis of the Friends series by mining transcripts of all 236 episodes.

Language:RMIT400

spotify-predict-playlist-followers

A repository outlining the retrieval of Spotify's featured playlists and track level characteristics, feature engineering, exploratory data analysis, and modelling of a playlist's success based on followers.

3 10

scraping-and-analyzing-aggregate-review-sites

Can we identify fraudulent behaviour using inferential testing?

MIT100

topic-modelling-subreddit-toronto

Using Google Big Query and Topic Modelling to understand /r/Toronto

Language:RMIT100

a-2017

Public Repository for cs109a, 2017 edition

Language:Jupyter Notebook000

abstract-screening

manuscript code

Language:Python000

exome-report-scripts

Scripts for handling exome reports output from CCM's various pipelines. These are supplementary to those scripts found in `report-scripts` in the CCM repo.

Language:Jupyter Notebook000

ipm_ml_app

Language:Jinja000

kidney_label_classifier

convulational neural network for predicting ultrasound views

Language:Python000

abstract_tool

Language:Jupyter Notebook000

authorship

000

canadian-finance-benefit-subreddit-activity

Language:R000

chicago_crime

An explatory analysis of Chicago crime from 2001 to 2018.

000

covid-twitter-es

Language:Jupyter Notebook000

crg2

Research pipeline for exploring clinically relevant genomic variants

Language:PythonApache-2.0000

divvy-chicago

Code for '14 Million Bike Rides in The Windy City (2013 - 2017)' on my site

Language:RMIT000

ed-notes

Language:Python000

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:PythonApache-2.0000

misc_scripts

Language:Jupyter Notebook000

OCCC

Repository for OCCC related tidbits

Language:Python000

perl_scripts

Random assortment and snippets of perl scripts. See the readme for more details.

Language:Perl000

pos_scheduler

Language:R000

preterm_neonates_prediction

Language:Python000

pushshift-most-requent-words-posts

Using Google Big Query and Pushshift to Analyze Occurences of Words in Titles of Reddit Submissions

Language:RMIT000

rna-seq_master_script_primers

Primers for creating master shell scripts commonly associated with RNA-seq tools and their analysis of NGS data.

000

zillow-nyc-housing-scrape-prediction

Scraping NYC property data from Zillow, GIS feature engineering, and predictive modelling

Language:R000