Current browse context:
q-bio.BM
Change to browse by:
References & Citations
Quantitative Biology > Biomolecules
Title: Macromolecule Classification Based on the Amino-acid Sequence
(Submitted on 6 Jan 2020 (v1), last revised 21 Sep 2022 (this version, v2))
Abstract: Deep learning is playing a vital role in every field which involves data. It has emerged as a strong and efficient framework that can be applied to a broad spectrum of complex learning problems which were difficult to solve using traditional machine learning techniques in the past. In this study we focused on classification of protein sequences with deep learning techniques. The study of amino acid sequence is vital in life sciences. We used different word embedding techniques from Natural Language processing to represent the amino acid sequence as vectors. Our main goal was to classify sequences to four group of classes, that are DNA, RNA, Protein and hybrid. After several tests we have achieved almost 99% of train and test accuracy. We have experimented on CNN, LSTM, Bidirectional LSTM, and GRU.
Submission history
From: Faisal Ghaffar [view email][v1] Mon, 6 Jan 2020 08:33:50 GMT (979kb)
[v2] Wed, 21 Sep 2022 21:23:23 GMT (0kb,I)
Link back to: arXiv, form interface, contact.