We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.MN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Molecular Networks

Title: BioCode: A Data-Driven Procedure to Learn the Growth of Biological Networks

Authors: Emre Sefer
Abstract: Probabilistic biological network growth models have been utilized for many tasks including but not limited to capturing mechanism and dynamics of biological growth activities, null model representation, capturing anomalies, etc. Well-known examples of these probabilistic models are Kronecker model, preferential attachment model, and duplication-based model. However, we should frequently keep developing new models to better fit and explain the observed network features while new networks are being observed. Additionally, it is difficult to develop a growth model each time we study a new network. In this paper, we propose BioCode, a framework to automatically discover novel biological growth models matching user-specified graph attributes in directed and undirected biological graphs. BioCode designs a basic set of instructions which are common enough to model a number of well-known biological graph growth models. We combine such instruction-wise representation with a genetic algorithm based optimization procedure to encode models for various biological networks. We mainly evaluate the performance of BioCode in discovering models for biological collaboration networks, gene regulatory networks, metabolic networks, and protein interaction networks which features such as assortativity, clustering coefficient, degree distribution closely match with the true ones in the corresponding real biological networks. As shown by the tests on the simulated graphs, the variance of the distributions of biological networks generated by BioCode is similar to the known models' variance for these biological network types.
Subjects: Molecular Networks (q-bio.MN); Quantitative Methods (q-bio.QM)
Cite as: arXiv:2108.04776 [q-bio.MN]
  (or arXiv:2108.04776v2 [q-bio.MN] for this version)

Submission history

From: Emre Sefer [view email]
[v1] Tue, 10 Aug 2021 16:44:10 GMT (2660kb,D)
[v2] Sun, 5 Sep 2021 07:41:51 GMT (719kb,D)

Link back to: arXiv, form interface, contact.