Module activity between branches (and over pseudotime)

Question

Module activity between branches (and over pseudotime)

anderswe opened this issue 2 years ago · comments

Hello,

Super grateful for your work and for sharing Pando with us.

Would you be willing to share code (or provide some pointers/links, in addition to the language included in your paper's Methods) about assessing branch-specific module activity / module activity over pseudotime, as in your paper Figure 2?

Thank you!

jonas · Answer 1 · Sat Nov 12 2022 22:42:53 GMT+0800 (China Standard Time)

Hi @anderswe, to compute the module activity, you can get the TF modules you can run find_modules() after you have inferred the GRN. Then your object should contain a modules object which you can access with NetworkModules(). In there you should find a list containing the peak/gene modules for each TF. With these gene sets you can then compute a module activity, e.g. by using the Seurat function AddModuleScore().
To obtain the branch-specific GRNs, we first performed a differential accessibility analysis and then filtered the GRN graph you get from Rando to only retain edges that were enriched in the branch, but one could also simply filter by some detection cutoff. I might also implement functions for this into Pando in the future. I hope this helps :)
If you tell me which exact analysis you wish to replicate I can also point you to the code of course.

anderswe · Answer 2 · Sun Nov 13 2022 23:17:49 GMT+0800 (China Standard Time)

Hi @joschif, thanks for your quick and helpful reply!

The exact analyses I'd wish to replicate are:

generating the TF activity scores in 2g
colouring nodes by pseudotime in 2c (and sizing by PageRank centrality)
inferring target specificity for branch-specific TFs, as in 2e

I would be very grateful if you could point me to the code for these, thank you!

jonas · Answer 3 · Wed Nov 16 2022 00:01:49 GMT+0800 (China Standard Time)

Most of the figure has been generated by this script. However some aspects of the analyses you are interested in were computed prior to this.

The calculation of TF activity scores can be found tarting at line 1448. Here we simply computed the product of the mean TF expression in the branch and the model coefficient from Pando.
The gene pseudotime values were computed prior to this, but it's essentially expression-weighted pseudotime of the TF computed with this code chunk:

weighted_mean_pt <- apply(expr_mat, 1, function(x){
    non0 <- which(x!=0)
    wex <- weighted.mean(pseudotime[non0], x[non0])
    return(wex)
})

Here we partitioned the GRN into branch-specific networks by only considering regulatory regions enriched in that branch. This analysis was unfortunately not done by me, but you can achieve this by performing differential accessibility between cell types and then filtering the GRN graph from Pando (NetworkGraph()) to only retain edges with celltype-enriched regions. In the resulting cell type-specific GRNs you can then simply count the number of targets for each TF.

I hope this is somewhat helpful to you :)

Cheers,
Jonas

anderswe · Answer 4 · Wed Nov 16 2022 02:00:00 GMT+0800 (China Standard Time)

Super helpful! Thank you!!

Ping-lin14 · Answer 5 · Tue Jan 03 2023 14:25:23 GMT+0800 (China Standard Time)

Hi @joschif

I have a question, how do I annotate GRN? Like Figure 2d.

jonas · Answer 6 · Wed Jan 04 2023 19:05:55 GMT+0800 (China Standard Time)

@Ping-lin14 We looked at module activity in different stages and branches to roughly annotate the TFs