We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.GN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Genomics

Title: Systematic clustering algorithm for chromatin accessibility data and its application to hematopoietic cells

Abstract: The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we give a clustering algorithm for genome-wide open chromatin data using a new data reduction method. This method regards the genome as a string of $1$s and $0$s based on a set of peaks and calculates the Hamming distances between the strings. This algorithm with the systematically optimized set of peaks enables us to quantitatively evaluate differences between samples of hematopoietic cells and classify cell types, potentially leading to a better understanding of leukemia pathogenesis.
Comments: 24 pages, 17 figures
Subjects: Genomics (q-bio.GN); Statistical Mechanics (cond-mat.stat-mech); Quantitative Methods (q-bio.QM)
Journal reference: PLOS Comput. Biol. 16(11), e1008422 (2020)
DOI: 10.1371/journal.pcbi.1008422
Cite as: arXiv:1912.10641 [q-bio.GN]
  (or arXiv:1912.10641v2 [q-bio.GN] for this version)

Submission history

From: Hiroki Ohta [view email]
[v1] Mon, 23 Dec 2019 06:34:36 GMT (1453kb,D)
[v2] Thu, 26 Nov 2020 19:00:27 GMT (10347kb)

Link back to: arXiv, form interface, contact.