We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.GN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Genomics

Title: Systematic clustering algorithm for epigenetic data from high-throughput sequencing and its application to hematopoietic and leukemic cells

Abstract: The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we develop a new data reduction method for genome-wide open chromatin data toward cell type classification. Regarding the genome as a string of 1s and 0s based on a systematically optimized set of peaks and calculating the Hamming distance enables us to quantitatively evaluate differences between samples of hematopoietic cells, classify cell types, and infer the origin of leukemic cells, potentially leading to a better understanding of leukemia pathogenesis.
Comments: 7 pages, 1 figure. (supplementary materials) 20 pages, 13 figures
Subjects: Genomics (q-bio.GN); Statistical Mechanics (cond-mat.stat-mech); Quantitative Methods (q-bio.QM)
Cite as: arXiv:1912.10641 [q-bio.GN]
  (or arXiv:1912.10641v1 [q-bio.GN] for this version)

Submission history

From: Hiroki Ohta [view email]
[v1] Mon, 23 Dec 2019 06:34:36 GMT (1453kb,D)
[v2] Thu, 26 Nov 2020 19:00:27 GMT (10347kb)

Link back to: arXiv, form interface, contact.