We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Population-Based Black-Box Optimization for Biological Sequence Design

Abstract: The use of black-box optimization for the design of new biological sequences is an emerging research area with potentially revolutionary impact. The cost and latency of wet-lab experiments requires methods that find good sequences in few experimental rounds of large batches of sequences--a setting that off-the-shelf black-box optimization methods are ill-equipped to handle. We find that the performance of existing methods varies drastically across optimization tasks, posing a significant obstacle to real-world applications. To improve robustness, we propose Population-Based Black-Box Optimization (P3BO), which generates batches of sequences by sampling from an ensemble of methods. The number of sequences sampled from any method is proportional to the quality of sequences it previously proposed, allowing P3BO to combine the strengths of individual methods while hedging against their innate brittleness. Adapting the hyper-parameters of each of the methods online using evolutionary optimization further improves performance. Through extensive experiments on in-silico optimization tasks, we show that P3BO outperforms any single method in its population, proposing higher quality sequences as well as more diverse batches. As such, P3BO and Adaptive-P3BO are a crucial step towards deploying ML to real-world sequence design.
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Journal reference: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020
DOI: 10.1111/j.1365-246X.2006.03227.x
Cite as: arXiv:2006.03227 [cs.LG]
  (or arXiv:2006.03227v2 [cs.LG] for this version)

Submission history

From: Christof Angermueller [view email]
[v1] Fri, 5 Jun 2020 04:28:55 GMT (5279kb,D)
[v2] Sat, 11 Jul 2020 00:33:03 GMT (3318kb,D)

Link back to: arXiv, form interface, contact.