We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Artificial Intelligence

Title: An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Abstract: The unordered tree edit distance is a natural metric to compute distances between trees without intrinsic child order, such as representations of chemical molecules. While the unordered tree edit distance is MAX SNP-hard in principle, it is feasible for small cases, e.g. via an A* algorithm. Unfortunately, current heuristics for the A* algorithm assume unit costs for deletions, insertions, and replacements, which limits our ability to inject domain knowledge. In this paper, we present three novel heuristics for the A* algorithm that work with custom cost functions. In experiments on two chemical data sets, we show that custom costs make the A* computation faster and improve the error of a 5-nearest neighbor regressor, predicting chemical properties. We also show that, on these data, polynomial edit distances can achieve similar results as the unordered tree edit distance.
Comments: Accepted at the 14th International Conference on Similarity Search and Applications (SISAP 2021)
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
DOI: 10.1007/978-3-030-89657-7_27
Cite as: arXiv:2108.00953 [cs.AI]
  (or arXiv:2108.00953v1 [cs.AI] for this version)

Submission history

From: Benjamin Paassen [view email]
[v1] Mon, 26 Jul 2021 12:57:27 GMT (21kb)

Link back to: arXiv, form interface, contact.