We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Feature versus Raw Sequence: Deep Learning Comparative Study on Predicting Pre-miRNA

Abstract: Should we input known genome sequence features or input sequence itself in deep learning framework? As deep learning more popular in various applications, researchers often come to question whether to generate features or use raw sequences for deep learning. To answer this question, we study the prediction accuracy of precursor miRNA prediction of feature-based deep belief network and sequence-based convolution neural network. Tested on a variant of six-layer convolution neural net and three-layer deep belief network, we find the raw sequence input based convolution neural network model performs similar or slightly better than feature based deep belief networks with best accuracy values of 0.995 and 0.990, respectively. Both the models outperform existing benchmarks models. The results shows us that if provided large enough data, well devised raw sequence based deep learning models can replace feature based deep learning models. However, construction of well behaved deep learning model can be very challenging. In cased features can be easily extracted, feature-based deep learning models may be a better alternative.
Comments: 12 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:1704.03834
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
Cite as: arXiv:1710.06798 [cs.LG]
  (or arXiv:1710.06798v1 [cs.LG] for this version)

Submission history

From: Sael Lee [view email]
[v1] Tue, 17 Oct 2017 14:09:00 GMT (317kb,D)

Link back to: arXiv, form interface, contact.