We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: A super scalable algorithm for short segment detection

Abstract: In many applications such as copy number variant (CNV) detection, the goal is to identify short segments on which the observations have different means or medians from the background. Those segments are usually short and hidden in a long sequence, and hence are very challenging to find. We study a super scalable short segment (4S) detection algorithm in this paper. This nonparametric method clusters the locations where the observations exceed a threshold for segment detection. It is computationally efficient and does not rely on Gaussian noise assumption. Moreover, we develop a framework to assign significance levels for detected segments. We demonstrate the advantages of our proposed method by theoretical, simulation, and real data studies.
Comments: To be published in Statistics in Biosciences
Subjects: Methodology (stat.ME); Applications (stat.AP); Computation (stat.CO)
Cite as: arXiv:2003.12540 [stat.ME]
  (or arXiv:2003.12540v1 [stat.ME] for this version)

Submission history

From: Ning Hao [view email]
[v1] Fri, 27 Mar 2020 17:08:22 GMT (85kb,D)

Link back to: arXiv, form interface, contact.