We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: The Complexity of Comparative Text Analysis -- "The Gardener is always the Murderer" says the Fourth Machine

Abstract: There is a heated debate about how far computers can map the complexity of text analysis compared to the abilities of the whole team of human researchers. A "deep" analysis of a given text is still beyond the possibilities of modern computers.
In the heart of the existing computational text analysis algorithms there are operations with real numbers, such as additions and multiplications according to the rules of algebraic fields. However, the process of "comparing" has a very precise mathematical structure, which is different from the structure of an algebraic field. The mathematical structure of "comparing" can be expressed by using Boolean rings. We build on this structure and define the corresponding algebraic equations lifting algorithms of comparative text analysis onto the "correct" algebraic basis. From this point of view, we can investigate the question of {\em computational} complexity of comparative text analysis.
Subjects: Computation and Language (cs.CL)
MSC classes: 06Exx, 13-xx, 68Txx
ACM classes: J.5
Cite as: arXiv:2012.07637 [cs.CL]
  (or arXiv:2012.07637v1 [cs.CL] for this version)

Submission history

From: Konstantin Fackeldey [view email]
[v1] Fri, 11 Dec 2020 10:32:35 GMT (164kb,D)

Link back to: arXiv, form interface, contact.