We gratefully acknowledge support from
the Simons Foundation and member institutions.

Databases

New submissions

[ total of 4 entries: 1-4 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Fri, 27 Jan 23

[1]  arXiv:2301.10841 [pdf, other]
Title: Free Join: Unifying Worst-Cast Optimal and Traditional Joins
Subjects: Databases (cs.DB)

Over the last decade, worst-case optimal join (WCOJ) algorithms have emerged as a new paradigm for one of the most fundamental challenges in query processing: computing joins efficiently. Such an algorithm can be asymptotically faster than traditional binary joins, all the while remaining simple to understand and implement. However, they have been found to be less efficient than the old paradigm, traditional binary join plans, on the typical acyclic queries found in practice. Some database systems that support WCOJ use a hypbrid approach: use WCOJ to process the cyclic subparts of the query (if any), and rely on traditional binary joins otherwise. In this paper we propose a new framework, called Free Join, that unifies the two paradigms. We describe a new type of plan, a new data structure (which unifies the hash tables and tries used by the two paradigms), and a suite of optimization techniques. Our system, implemented in Rust, matches or outperforms both traditional binary joins and Generic Join on standard query benchmarks.

Cross-lists for Fri, 27 Jan 23

[2]  arXiv:2301.11049 (cross-list from cs.DC) [pdf, other]
Title: Odyssey: A Journey in the Land of Distributed Data Series Similarity Search
Comments: PVLDB 2023
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB)

This paper presents Odyssey, a novel distributed data-series processing framework that efficiently addresses the critical challenges of exhibiting good speedup and ensuring high scalability in data series processing by taking advantage of the full computational capacity of modern clusters comprised of multi-core servers. Odyssey addresses a number of challenges in designing efficient and highly scalable distributed data series index, including efficient scheduling, and load-balancing without paying the prohibitive cost of moving data around. It also supports a flexible partial replication scheme, which enables Odyssey to navigate through a fundamental trade-off between data scalability and good performance during query answering. Through a wide range of configurations and using several real and synthetic datasets, our experimental analysis demonstrates that Odyssey achieves its challenging goals.

Replacements for Fri, 27 Jan 23

[3]  arXiv:2301.08482 (replaced) [pdf, other]
Title: A Simple Algorithm for Consistent Query Answering under Primary Keys
Subjects: Databases (cs.DB); Logic in Computer Science (cs.LO)
[4]  arXiv:2301.10673 (replaced) [pdf, other]
Title: An Overview on Cloud Distributed Databases for Business Environments
Comments: 9 pages, 10 figures
Subjects: Databases (cs.DB); Networking and Internet Architecture (cs.NI)
[ total of 4 entries: 1-4 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, recent, 2301, contact, help  (Access key information)