We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DB

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Databases

Title: Learning Multi-dimensional Indexes

Abstract: Scanning and filtering over multi-dimensional tables are key operations in modern analytical database engines. To optimize the performance of these operations, databases often create clustered indexes over a single dimension or multi-dimensional indexes such as R-trees, or use complex sort orders (e.g., Z-ordering). However, these schemes are often hard to tune and their performance is inconsistent across different datasets and queries. In this paper, we introduce Flood, a multi-dimensional in-memory index that automatically adapts itself to a particular dataset and workload by jointly optimizing the index structure and data storage. Flood achieves up to three orders of magnitude faster performance for range scans with predicates than state-of-the-art multi-dimensional indexes or sort orders on real-world datasets and workloads. Our work serves as a building block towards an end-to-end learned database system.
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
DOI: 10.1145/3318464.3380579
Cite as: arXiv:1912.01668 [cs.DB]
  (or arXiv:1912.01668v1 [cs.DB] for this version)

Submission history

From: Jialin Ding [view email]
[v1] Tue, 3 Dec 2019 20:10:31 GMT (1339kb,D)

Link back to: arXiv, form interface, contact.