We gratefully acknowledge support from
the Simons Foundation and member institutions.

Databases

New submissions

[ total of 3 entries: 1-3 ]
[ showing up to 1000 entries per page: fewer | more ]

New submissions for Thu, 25 Apr 24

[1]  arXiv:2404.15670 [pdf, other]
Title: HTAP Databases: A Survey
Comments: IEEE Transactions on Knowledge and Data Engineering, 2024
Subjects: Databases (cs.DB)

Since Gartner coined the term, Hybrid Transactional and Analytical Processing (HTAP), numerous HTAP databases have been proposed to combine transactions with analytics in order to enable real-time data analytics for various data-intensive applications. HTAP databases typically process the mixed workloads of transactions and analytical queries in a unified system by leveraging both a row store and a column store. As there are different storage architectures and processing techniques to satisfy various requirements of diverse applications, it is critical to summarize the pros and cons of these key techniques. This paper offers a comprehensive survey of HTAP databases. We mainly classify state-of-the-art HTAP databases according to four storage architectures: (a) Primary Row Store and In-Memory Column Store; (b) Distributed Row Store and Column Store Replica; (c) Primary Row Store and Distributed In-Memory Column Store; and (d) Primary Column Store and Delta Row Store. We then review the key techniques in HTAP databases, including hybrid workload processing, data organization, data synchronization, query optimization, and resource scheduling. We also discuss existing HTAP benchmarks. Finally, we provide the research challenges and opportunities for HTAP techniques.

Cross-lists for Thu, 25 Apr 24

[2]  arXiv:2404.15840 (cross-list from cs.LO) [pdf, ps, other]
Title: Constructive Interpolation and Concept-Based Beth Definability for Description Logics via Sequents
Comments: Accepted to IJCAI 2024
Subjects: Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Databases (cs.DB); Logic (math.LO)

We introduce a constructive method applicable to a large number of description logics (DLs) for establishing the concept-based Beth definability property (CBP) based on sequent systems. Using the highly expressive DL RIQ as a case study, we introduce novel sequent calculi for RIQ-ontologies and show how certain interpolants can be computed from sequent calculus proofs, which permit the extraction of explicit definitions of implicitly definable concepts. To the best of our knowledge, this is the first sequent-based approach to computing interpolants and definitions within the context of DLs, as well as the first proof that RIQ enjoys the CBP. Moreover, due to the modularity of our sequent systems, our results hold for any restriction of RIQ, and are applicable to other DLs by suitable modifications.

Replacements for Thu, 25 Apr 24

[3]  arXiv:2310.00749 (replaced) [pdf, other]
Title: SEED: Domain-Specific Data Curation With Large Language Models
Comments: preprint, 20 pages, 4 figures
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[ total of 3 entries: 1-3 ]
[ showing up to 1000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, recent, 2404, contact, help  (Access key information)