We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: Differential analysis in Transcriptomic: The strength of randomly picking 'reference' genes

Abstract: Transcriptomic analysis are characterized by being not directly quantitative and only providing relative measurements of expression levels up to an unknown individual scaling factor. This difficulty is enhanced for differential expression analysis. Several methods have been proposed to circumvent this lack of knowledge by estimating the unknown individual scaling factors however, even the most used one, are suffering from being built on hardly justifiable biological hypotheses or from having weak statistical background. Only two methods withstand this analysis: one based on largest connected graph component hardly usable for large amount of expressions like in NGS, the second based on $\log$-linear fits which unfortunately require a first step which uses one of the methods described before.
We introduce a new procedure for differential analysis in the context of transcriptomic data. It is the result of pooling together several differential analyses each based on randomly picked genes used as reference genes. It provides a differential analysis free from the estimation of the individual scaling factors or any other knowledge. Theoretical properties are investigated both in term of FWER and power. Moreover in the context of Poisson or negative binomial modelization of the transcriptomic expressions, we derived a test with non asymptotic control of its bounds. We complete our study by some empirical simulations and apply our procedure to a real data set of hepatic miRNA expressions from a mouse model of non-alcoholic steatohepatitis (NASH), the CDAHFD model. This study on real data provides new hits with good biological explanations.
Comments: 30 pages, 2 figures
Subjects: Methodology (stat.ME); Applications (stat.AP)
Cite as: arXiv:2103.09872 [stat.ME]
  (or arXiv:2103.09872v3 [stat.ME] for this version)

Submission history

From: Yves Rozenholc [view email]
[v1] Wed, 17 Mar 2021 19:18:44 GMT (106kb,D)
[v2] Mon, 22 Mar 2021 14:14:19 GMT (106kb,D)
[v3] Tue, 23 Mar 2021 01:57:15 GMT (107kb,D)

Link back to: arXiv, form interface, contact.