Emre Erhan's repositories
pandas_workshop
Built-in data structures in Python are insufficient for data analysis. This workshop introduces Pandas, a Python library that facilitates efficient and simple data analysis with easy-to-use data structures. I will assume you're at least at a beginner level in Python. Please come with Anaconda for Python 3.6 installed (https://www.anaconda.com/download/).
random_forest_workshop
Machine learning classifiers are a powerful tool for determining to which category novel data belongs given some training data. This workshop explores the basics of using the scikit-learn Python library with some toy cancer datasets.
spaced-seeds
Scripts to empirically show how spaced seed entropy is related to their sensitivity for homology search.
Classroom
Content for running instances of STAT 545/547M at UBC
docker-imgs
A repository to store all the docker images used in flowcraft
kmer-optimization
Notebooks and scripts for building a generative model to describe expected unitigs for a given sequencing read set
modular-assembly-hs18
Mix and match modular genome assembler components
homebrew-bio
:beer::microscope: Bioinformatics formulae for the Homebrew and Linuxbrew package managers
homebrew-science
:snowflake: Scientific formulae for Homebrew and Linuxbrew (legacy)
portal_client
Python-based client for downloading data made available through portals powered by the GDC-based portal system..
scikit-learn
scikit-learn: machine learning in Python
STAT540-UBC.github.io
Public repository of STAT540@University of British Columbia. Statistical Methods for High Dimensional Biology
STAT545_participation
A respository for recording my classroom participation for STAT545 at UBC
stlfr2supernova_pipeline
A pipeline to de novo assemble the stLFR reads using Supernova Assembler
tarpaulin
π GitHub Action for code coverage reporting with tarpaulin