Pat Mendoza's repositories
animelistextract
Extract API information from MyAnimeList
crunchyrolltitles
Scraping Crunchyroll titles and dataset
animelistclean
Clean Data from MyAnimeList API
annotationwrangling
Converting and integrating data from multiple sources is often tricky business. Luckily there are some great tools available that make this a breeze. I use a genetic annotation file (Brachypodium) and incorporate gene ontology definitions. This Uses dplyr and tidyr to do the data wrangling.
annotationwrangling_python
Datawrangling in Python using pandas
clustering
Clustering is a common exercise to determine how closely samples are related to each other. This shows how samples can be clustered using a PCoA and PCA and visualizing using ggplot. Particularly, how to cluster RNA-seq samples.
funimationtitles
Scraping Funimation Titles from MyAnimeList
geneontologyconversion
Oftentimes, we come across data that isn't in the form that we need to make joins, when that happens, we can convert those using simple python scripts I use the gene ontology OBO format and convert it into tabular format for making joins with other tables using this python script.
hidivescrape
Scraping HIDIVE Anime Television titles
mirrorplot
Creating a simple mirrorplot can be good visualization for showing up/down regulated genes in an RNA-seq. This details how to create a mirrorplot using ggplot2.
patmendoza330
Config files for my GitHub profile.
webscraping
Practical Web Scraping with R and rvest: Scraping anime listings from Crunchyroll and HIDIVE