We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.NE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Neural and Evolutionary Computing

Title: A Study of Learning Search Approximation in Mixed Integer Branch and Bound: Node Selection in SCIP

Abstract: In line with the growing trend of using machine learning to help solve combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming (MIP) branch-and-bound tree by using a learned policy. Previous work using imitation learning indicates the feasibility of acquiring a node selection policy, by learning an adaptive node searching order. In contrast, our imitation learning policy is focused solely on learning which of a node's children to select. We present an offline method to learn such a policy in two settings: one that comprises a heuristic by committing to pruning of nodes; one that is exact and backtracks from a leaf to guarantee finding the optimal integer solution. The former setting corresponds to a child selector during plunging, while the latter is akin to a diving heuristic. We apply the policy within the popular open-source solver SCIP, in both heuristic and exact settings. Empirical results on five MIP datasets indicate that our node selection policy leads to solutions significantly more quickly than the state-of-the-art precedent in the literature. While we do not beat the highly-optimised SCIP state-of-practice baseline node selector in terms of solving time on exact solutions, our heuristic policies have a consistently better optimality gap than all baselines, if the accuracy of the predictive model is sufficient. Further, the results also indicate that, when a time limit is applied, our heuristic method finds better solutions than all baselines in the majority of problems tested. We explain the results by showing that the learned policies have imitated the SCIP baseline, but without the latter's early plunge abort. Our recommendation is that, despite the clear improvements over the literature, this kind of MIP child selector is better seen in a broader approach using learning in MIP branch-and-bound tree decisions.
Comments: Authors' version, not publisher's final version which is available at DOI
Subjects: Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
MSC classes: 90C11
ACM classes: I.2.6; I.2.8
Journal reference: AI, volume 2, number 2, pages 150-178, 2021
DOI: 10.3390/ai2020010
Cite as: arXiv:2007.03948 [cs.NE]
  (or arXiv:2007.03948v2 [cs.NE] for this version)

Submission history

From: Neil Yorke-Smith [view email]
[v1] Wed, 8 Jul 2020 08:12:44 GMT (241kb,D)
[v2] Mon, 3 Jan 2022 21:04:30 GMT (1128kb,D)

Link back to: arXiv, form interface, contact.