Heatmap of the Count Matrix
1
0
Entering edit mode
@b39b3713
Last seen 2 days ago
United Kingdom

When following your workflow in https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#standard-workflow, why do you use rowMeans to display the Heatmap of count matrix? Can I use data from PCA to select the top 20 genes? In my case, I have 6,770 genes. If I choose only the top 20 genes, I assume it does not accurately represent the genes, so I want to use PCA loadings. Does it make sense?

library("pheatmap")
select <- order(rowMeans(counts(dds,normalized=TRUE)),
                decreasing=TRUE)[1:20]
df <- as.data.frame(colData(dds)[,c("condition","type")])
pheatmap(assay(ntd)[select,], cluster_rows=FALSE, show_rownames=FALSE,
         cluster_cols=FALSE, annotation_col=df)

sessionInfo( )
rnaseqGene • 1.3k views
ADD COMMENT
1
Entering edit mode
ATpoint ★ 4.9k
@atpoint-13662
Last seen 6 days ago
Germany

This section of the vignette is merely a very basic data exploration and by no means set into stone. The rowMeans is just an ordering function here, do whatever you feel is appropriate to explore your data properly.

ADD COMMENT
0
Entering edit mode

Thanks for clarifying that this is just a starting point papa's games for data exploration. I'll definitely dive in and use the best methods to properly explore the data.

ADD REPLY

Login before adding your answer.

Traffic: 726 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6
OSZAR »