We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.PL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Programming Languages

Title: Hashing Modulo Alpha-Equivalence

Abstract: In many applications one wants to identify identical subtrees of a program syntax tree. This identification should ideally be robust to alpha-renaming of the program, but no existing technique has been shown to achieve this with good efficiency (better than $\mathcal{O}(n^2)$ in expression size). We present a new, asymptotically efficient way to hash modulo alpha-equivalence. A key insight of our method is to use a weak (commutative) hash combiner at exactly one point in the construction, which admits an algorithm with $\mathcal{O}(n (\log n)^2)$ time complexity. We prove that the use of the commutative combiner nevertheless yields a strong hash with low collision probability. Numerical benchmarks attest to the asymptotic behaviour of the method.
Comments: Accepted for publication at the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI 2021)
Subjects: Programming Languages (cs.PL); Data Structures and Algorithms (cs.DS)
Cite as: arXiv:2105.02856 [cs.PL]
  (or arXiv:2105.02856v1 [cs.PL] for this version)

Submission history

From: Krzysztof Maziarz [view email]
[v1] Thu, 6 May 2021 17:44:30 GMT (743kb,D)

Link back to: arXiv, form interface, contact.