romcia / TeamTeri

Genomics (computational bioinformatic data analysis) running on GCP, AWS or Azure

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Team Teri

How this Repo is Organized

This Repo contains my own 'study notes' as I learn genomic-scale cloud bioinformatics. It includes descriptions of common tools, platforms and summaries of my work with clients. I update this Repo frequently. It is organized via the folder structure shown below.

  • 🗒️ Concepts and Terms (genomics files types, use cases, terminology and also whitepapers)
  • 🔬 Lab Testing (Illumina and more)
  • ⚒️ Genomic Tools (GATK, VariantSpark, HAIL and many more - this section updates OFTEN)
  • 📦 Genomics Platforms (Terra.bio, Galaxy Project, IDSeq and others)
  • ☁️ Public Cloud Genomics (Alibaba Cloud, AWS, Azure or GCP). The general approach is to implement a cloud-native Data Lake pattern for scalable genomic analysis. A conceptual rendering of this pattern is shown below.

Data Lake Pattern


More Cloud/Genomics Reources

In addition to this Repo, I have a number of other Repos with cloud bioinformatics information. Also, I've included two of my favorite link aggregator resources here for additional learning.


Who is Teri?

Teri is the impetus for my movement into the world of genomic research. She was diagnosed with breast cancer in 2016. She survived, but suffered a long course of intense and painful treatment due in part to the lack of availability of personalized treatment options at the time of her diagnosis.

About

Genomics (computational bioinformatic data analysis) running on GCP, AWS or Azure


Languages

Language:Shell 49.6%Language:JavaScript 10.4%Language:Python 8.6%Language:C# 7.8%Language:C++ 5.6%Language:Julia 5.5%Language:Scala 5.5%Language:Dockerfile 4.4%Language:R 2.7%