TomSmithCGAT / Proteomics_linear_modeling

Short workshop to demonstrate some uses of linear modeling in proteomics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Identifying changes in RNA binding

A short workshop to demonstrate how to use linear modeling to identify changes in RNA binding from parallel "Total" and "RNA-bound" protein quantification.

Here, we assume the reader is aware of the basic principles of protein quantification using isobaric tags (in this case, TMT labeling). We further assume the reader is aware of the OOPS method used to obtain RNA-bound protein quantification (OOPS paper). Finally, the reader will benefit from prior understanding of the MsnSet class.

The workshop takes the form of R markdown notebooks (See here) with worked examples and questions and tasks to prompt the reader to consider important aspects of the analysis. When working through the workshop by oneself, the answers to the questions and tasks can be found here.

The notebooks cover the following:

1_simple_example.Rmd Working up from an example of modeling the abundance of a single protein in a control vs treatment experiment to modeling the change in RNA binding for a single protein in a control vs treatment experiment.

2_identify_changes_in_RNA_binding.Rmd Applying a linear model (using lm) to each protein in a real data set to identify those with a signficant change in RNA binding

3_using_limma.Rmd Applying a moderated linear model using limma to shrink the observed biological variance toward the expected values from proteins with a similar abundance. Comparing the results obtained with lm and limma.

4_get_all_go_terms_h_sapiens.Rmd Expanding the set of annotated GO terms to the complete set of implictly annotated GO terms

5_Identify_over_rep_GO_terms.Rmd Using the above expanded GO term annotations and goseq to identify the GO terms over-represented in the proteins with a significant change in RNA binding. Using the topTreat function from limma to test an adjusted null hypothesis to additionally take into account the mimimum effect size which is deemed biologically relevant.

About

Short workshop to demonstrate some uses of linear modeling in proteomics


Languages

Language:HTML 99.9%Language:R 0.1%