We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DM

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Discrete Mathematics

Title: Criteria for the numerical constant recognition

Abstract: The need for recognition/approximation of functions in terms of elementary functions/operations emerges in many areas of experimental mathematics, numerical analysis, computer algebra systems, model building, machine learning, approximation and data compression. One of the most underestimated methods is the symbolic regression. In the article, reductionist approach is applied, reducing full problem to constant functions, i.e, pure numbers (decimal, floating-point). However, existing solutions are plagued by lack of solid criteria distinguishing between random formula, matching approximately or literally decimal expansion and probable ''exact'' (the best) expression match in the sense of Occam's razor. In particular, convincing STOP criteria for search were never developed. In the article, such a criteria, working in statistical sense, are provided. Recognition process can be viewed as (1) enumeration of all formulas in order of increasing Kolmogorov complexity K (2) random process with appropriate statistical distribution (3) compression of a decimal string. All three approaches are remarkably consistent, and provide essentially the same limit for practical depth of search. Tested unique formulas count must not exceed 1/sigma, where sigma is relative numerical error of the target constant. Beyond that, further search is pointless, because, in the view of approach (1), number of equivalent expressions within error bounds grows exponentially; in view of (2), probability of random match approaches 1; in view of (3) compression ratio much smaller than 1.
Comments: 20 pages + Supplemental Material
Subjects: Discrete Mathematics (cs.DM); Symbolic Computation (cs.SC); Other Statistics (stat.OT)
ACM classes: G.2.3; G.3; G.4; I.1.1; I.2.m; F.2.3; G.1.m; F.1.m
Cite as: arXiv:2002.12690 [cs.DM]
  (or arXiv:2002.12690v2 [cs.DM] for this version)

Submission history

From: Andrzej Odrzywolek [view email]
[v1] Fri, 28 Feb 2020 13:01:21 GMT (2419kb,D)
[v2] Tue, 8 Jun 2021 09:12:09 GMT (3228kb,D)

Link back to: arXiv, form interface, contact.