mattlee821 / MSc_proteins-crc

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MSc_proteins-crc

overview

There is growing evidence, from Mendelian randomisation analyses, that some proteins are "causal" for colorectal cancer (CRC) development. A limitation of Mendelian randomisation analyses is that the data used (plasma proteins) do not necessarily reflect a localised exposure, for instance protein abundance measured in serum may not reflect protein abundance in the colon. Tissue-specific MR uses tissue specific (e.g., colon tissue) gene expression data to identify instruments to be used in MR analyses. This project aims to establish whether plasma proteins causal for CRC are associated with colorectal cancer in colon tissue.

data

We obtained data for (1) plasma proteins, (2) gene expression, and (3) CRC from:

  1. plasma proteins are from Ferkingstad et al., (2021) and were downloaded from deCODE

    • we used the protein GREM1 as an exemplar
  2. gene expression data are from GTEx V8

    • README
    • we used the protein GREM1 as an exemplar
    • we used data from colon tissue (sigmoid and transverse)
  3. CRC data are from Fernandez-Rozadilla et al., 2023 and were downloaded from the GWAS catalog (accession: GCST90129505)

analysis

  1. MR analysis of plasma proteins and CRC
  2. MR analysis of CRC and plasma proteins
  3. colocalisation of plasma proteins and CRC
  4. colocalisation of plasma proteins and gene expression
  5. colocalisation of gene expression and plasma proteins
  6. MR analysis of gene expression and CRC
  7. Gene-Set enrichment analysis of associated proteins
  8. Sensitivity analysis
  9. Interpretation and visualization

code

About


Languages

Language:R 100.0%