Current browse context:
q-bio.GN
Change to browse by:
References & Citations
Quantitative Biology > Genomics
Title: Systematic clustering algorithm for epigenetic data from high-throughput sequencing and its application to hematopoietic and leukemic cells
(Submitted on 23 Dec 2019 (this version), latest version 26 Nov 2020 (v2))
Abstract: The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we develop a new data reduction method for genome-wide open chromatin data toward cell type classification. Regarding the genome as a string of 1s and 0s based on a systematically optimized set of peaks and calculating the Hamming distance enables us to quantitatively evaluate differences between samples of hematopoietic cells, classify cell types, and infer the origin of leukemic cells, potentially leading to a better understanding of leukemia pathogenesis.
Submission history
From: Hiroki Ohta [view email][v1] Mon, 23 Dec 2019 06:34:36 GMT (1453kb,D)
[v2] Thu, 26 Nov 2020 19:00:27 GMT (10347kb)
Link back to: arXiv, form interface, contact.