We gratefully acknowledge support from
the Simons Foundation and member institutions.

Databases

Authors and titles for cs.DB in Jan 2021

[ total of 67 entries: 1-67 ]
[ showing 67 entries per page: fewer | more ]
[1]  arXiv:2101.00170 [pdf, ps, other]
Title: Visualization Techniques with Data Cubes: Utilizing Concurrency for Complex Data
Comments: 11 pages, 4 figures Update: Revised format to align closer to IEEE standards
Subjects: Databases (cs.DB)
[2]  arXiv:2101.00171 [pdf, ps, other]
Title: Optimizing Data Cube Visualization for Web Applications: Performance and User-Friendly Data Aggregation
Comments: 12 pages, 2 figures, 3 tables Update: Revised format to align closer to IEEE standards
Subjects: Databases (cs.DB)
[3]  arXiv:2101.00361 [pdf, other]
Title: To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams
Comments: Technical report for the paper in SIGMOD 2021
Subjects: Databases (cs.DB); Performance (cs.PF)
[4]  arXiv:2101.00808 [pdf, other]
Title: A Pluggable Learned Index Method via Sampling and Gap Insertion
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[5]  arXiv:2101.00810 [pdf, other]
Title: Searching Personalized $k$-wing in Large and Dynamic Bipartite Graphs
Comments: 13 pages, 10 figures and 4 tables
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[6]  arXiv:2101.01363 [pdf, ps, other]
Title: Exploring Data and Knowledge combined Anomaly Explanation of Multivariate Industrial Data
Subjects: Databases (cs.DB)
[7]  arXiv:2101.01507 [pdf, other]
Title: A Survey on Advancing the DBMS Query Optimizer: Cardinality Estimation, Cost Model, and Plan Enumeration
Comments: This paper was accepted by Data Science and Engineering (DSEJ) in Dec, 2020
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[8]  arXiv:2101.01852 [pdf, other]
Title: Bridging BAD Islands: Declarative Data Sharing at Scale
Comments: 10 pages, 34 figures, to appear on IEEE Big Data - Workshop on Scalable Cloud Data Management
Subjects: Databases (cs.DB)
[9]  arXiv:2101.02174 [pdf, other]
Title: Efficient Discovery of Approximate Order Dependencies
Subjects: Databases (cs.DB)
[10]  arXiv:2101.02466 [pdf, ps, other]
Title: On the Interaction of Functional and Inclusion Dependencies with Independence Atoms
Subjects: Databases (cs.DB); Logic in Computer Science (cs.LO)
[11]  arXiv:2101.02472 [pdf, other]
Title: Controlling Entity Integrity with Key Sets
Subjects: Databases (cs.DB)
[12]  arXiv:2101.02502 [pdf, other]
Title: An Algorithm for the Discovery of Independence from Data
Subjects: Databases (cs.DB)
[13]  arXiv:2101.02591 [pdf, other]
Title: Efficient Data Management in Neutron Scattering Data Reduction Workflows at ORNL
Comments: 7 pages, 4 figures, International Workshop on Big Data Reduction held with 2020 IEEE International Conference on Big Data
Subjects: Databases (cs.DB)
[14]  arXiv:2101.02914 [pdf, ps, other]
Title: Approximate Query Processing for Group-By Queries based on Conditional Generative Models
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[15]  arXiv:2101.03020 [pdf, ps, other]
Title: Dataset Definition Standard (DDS)
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[16]  arXiv:2101.03058 [pdf, other]
Title: Answer Counting under Guarded TGDs
Journal-ref: Logical Methods in Computer Science, Volume 19, Issue 3 (September 14, 2023) lmcs:8768
Subjects: Databases (cs.DB)
[17]  arXiv:2101.03298 [pdf, ps, other]
Title: FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[18]  arXiv:2101.03712 [pdf, other]
Title: Enumeration Algorithms for Conjunctive Queries with Projection
Comments: journal version for LMCS
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[19]  arXiv:2101.04226 [pdf, other]
Title: DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks
Comments: To appear in VLDB 2021
Subjects: Databases (cs.DB); Computation and Language (cs.CL)
[20]  arXiv:2101.04432 [pdf, other]
Title: Privacy Aspects of Provenance Queries
Comments: Accepted at ProvenanceWeek 2020 ( this https URL )
Subjects: Databases (cs.DB)
[21]  arXiv:2101.04964 [pdf, other]
Title: Flow-Loss: Learning Cardinality Estimates That Matter
Subjects: Databases (cs.DB)
[22]  arXiv:2101.05037 [pdf, other]
Title: Immutable and Democratic Data in permissionless Peer-to-Peer Systems
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[23]  arXiv:2101.05308 [pdf, other]
Title: Toward Data Cleaning with a Target Accuracy: A Case Study for Value Normalization
Subjects: Databases (cs.DB)
[24]  arXiv:2101.06240 [pdf, ps, other]
Title: Towards Approximate Query Enumeration with Sublinear Preprocessing Time
Subjects: Databases (cs.DB); Logic in Computer Science (cs.LO)
[25]  arXiv:2101.06637 [pdf, other]
Title: AMALGAM: A Matching Approach to fairfy tabuLar data with knowledGe grAph Model
Comments: 10 pages
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[26]  arXiv:2101.06801 [pdf, other]
Title: Real-Time LSM-Trees for HTAP Workloads
Subjects: Databases (cs.DB)
[27]  arXiv:2101.07136 [pdf, other]
Title: Trav-SHACL: Efficiently Validating Networks of SHACL Constraints
Subjects: Databases (cs.DB)
[28]  arXiv:2101.07690 [pdf, other]
Title: Efficient Mining of Frequent Subgraphs with Two-Vertex Exploration
Subjects: Databases (cs.DB); Performance (cs.PF)
[29]  arXiv:2101.08784 [pdf, other]
Title: Time series compression: a survey
Comments: 33 pages, author version
Journal-ref: ACM Comput. Surv. 2022
Subjects: Databases (cs.DB)
[30]  arXiv:2101.08819 [pdf, other]
Title: Saguaro: An Edge Computing-Enabled Hierarchical Permissioned Blockchain
Subjects: Databases (cs.DB); Networking and Internet Architecture (cs.NI)
[31]  arXiv:2101.08929 [pdf, other]
Title: REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries
Subjects: Databases (cs.DB)
[32]  arXiv:2101.09094 [pdf, other]
Title: Towards Expectation-Maximization by SQL in RDBMS
Comments: 12 pages
Journal-ref: Long version of DASFAA 2021
Subjects: Databases (cs.DB)
[33]  arXiv:2101.09441 [pdf, ps, other]
Title: DBL: Efficient Reachability Queries on Dynamic Graphs (Complete Version)
Subjects: Databases (cs.DB)
[34]  arXiv:2101.09668 [pdf, other]
Title: Multi-attributed Community Search in Road-social Networks
Subjects: Databases (cs.DB); Social and Information Networks (cs.SI)
[35]  arXiv:2101.10457 [pdf, other]
Title: Shift-Table: A Low-latency Learned Index for Range Queries using Model Correction
Subjects: Databases (cs.DB)
[36]  arXiv:2101.11259 [pdf, other]
Title: Alaska: A Flexible Benchmark for Data Integration Tasks
Subjects: Databases (cs.DB)
[37]  arXiv:2101.12158 [pdf, other]
Title: Beyond Equi-joins: Ranking, Enumeration and Factorization
Comments: 21 pages
Journal-ref: PVLDB, 14(11):2599-2612, 2021
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[38]  arXiv:2101.12289 [pdf, ps, other]
Title: Probabilistic Data with Continuous Distributions
Subjects: Databases (cs.DB)
[39]  arXiv:2101.12305 [pdf, other]
Title: Evaluating Complex Queries on Streaming Graphs
Comments: 18 pages; typos fixed; examples, experimental setup and analysis updated
Subjects: Databases (cs.DB)
[40]  arXiv:2101.12334 [pdf, other]
Title: sGrapp: Butterfly Approximation in Streaming Graphs
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[41]  arXiv:2101.12417 [pdf, other]
Title: Distributed Spatial-Keyword kNN Monitoring for Location-aware Pub/Sub
Comments: 10 pages
Subjects: Databases (cs.DB)
[42]  arXiv:2101.00172 (cross-list from cs.DS) [pdf, ps, other]
Title: Chunk List: Concurrent Data Structures
Comments: 20 pages, 3 figures A full implementation can be found at this https URL Update: Revised format to align closer to IEEE standards
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
[43]  arXiv:2101.00314 (cross-list from cs.DS) [pdf, other]
Title: SetSketch: Filling the Gap between MinHash and HyperLogLog
Authors: Otmar Ertl
Comments: VLDB 2021, extended version, 22 pages
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB)
[44]  arXiv:2101.01159 (cross-list from cs.DC) [pdf, other]
Title: New Directions in Cloud Programming
Journal-ref: CIDR 2021
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Operating Systems (cs.OS); Programming Languages (cs.PL)
[45]  arXiv:2101.01292 (cross-list from cs.LG) [pdf, other]
Title: GeCo: Quality Counterfactual Explanations in Real Time
Comments: 16 pages, 12 figures, 3 tables, 3 algorithms
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[46]  arXiv:2101.01898 (cross-list from cs.CR) [pdf, other]
Title: Connecting The Dots To Combat Collective Fraud
Authors: Mingxi Wu, Xi Chen
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[47]  arXiv:2101.01945 (cross-list from cs.DS) [pdf, other]
Title: Fine-Grained Complexity of Regular Path Queries
Journal-ref: Logical Methods in Computer Science, Volume 19, Issue 4 (November 27, 2023) lmcs:8625
Subjects: Data Structures and Algorithms (cs.DS); Computational Complexity (cs.CC); Databases (cs.DB); Formal Languages and Automata Theory (cs.FL)
[48]  arXiv:2101.02627 (cross-list from cs.CR) [pdf, other]
Title: Privacy-Preserving Data Publishing in Process Mining
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[49]  arXiv:2101.02969 (cross-list from cs.IR) [pdf, other]
Title: Spatial Object Recommendation with Hints: When Spatial Granularity Matters
Journal-ref: SIGIR Conference (2020) 781-790
Subjects: Information Retrieval (cs.IR); Databases (cs.DB); Machine Learning (cs.LG)
[50]  arXiv:2101.04102 (cross-list from cs.PL) [pdf, other]
Title: Query Lifting: Language-integrated query for heterogeneous nested collections
Comments: Full version of ESOP 2021 conference paper
Subjects: Programming Languages (cs.PL); Databases (cs.DB)
[51]  arXiv:2101.06126 (cross-list from cs.LG) [pdf, other]
Title: EAGER: Embedding-Assisted Entity Resolution for Knowledge Graphs
Comments: 10 pages, 7 figures
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[52]  arXiv:2101.06758 (cross-list from cs.DS) [pdf, ps, other]
Title: Data stream fusion for accurate quantile tracking and analysis
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[53]  arXiv:2101.06761 (cross-list from cs.CR) [pdf, other]
Title: A System for Efficiently Hunting for Cyber Threats in Computer Systems Using Threat Intelligence
Comments: Accepted paper at ICDE 2021 demonstrations track. arXiv admin note: substantial text overlap with arXiv:2010.13637
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Databases (cs.DB)
[54]  arXiv:2101.07026 (cross-list from cs.DC) [pdf, ps, other]
Title: Time-Efficient and High-Quality Graph Partitioning for Graph Dynamic Scaling
Comments: 21 pages, 15 figures. Under review
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Discrete Mathematics (cs.DM); Data Structures and Algorithms (cs.DS); Social and Information Networks (cs.SI)
[55]  arXiv:2101.07361 (cross-list from cs.LG) [pdf, other]
Title: Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification
Comments: Technical report of SIGMOD 2022 paper
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Databases (cs.DB)
[56]  arXiv:2101.07622 (cross-list from cs.DL) [pdf, other]
Title: Knowledge Graph for Microdata of Statistics Netherlands
Authors: Chang Sun
Subjects: Digital Libraries (cs.DL); Databases (cs.DB)
[57]  arXiv:2101.07731 (cross-list from cs.LG) [pdf, other]
Title: TC-DTW: Accelerating Multivariate Dynamic Time Warping Through Triangle Inequality and Point Clustering
Authors: Daniel Shen, Min Chi
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
[58]  arXiv:2101.07769 (cross-list from cs.CR) [pdf, other]
Title: A System for Automated Open-Source Threat Intelligence Gathering and Management
Comments: Accepted paper at SIGMOD 2021 demonstrations track
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB)
[59]  arXiv:2101.08167 (cross-list from cs.DC) [pdf, other]
Title: Neural-based Modeling for Performance Tuning of Spark Data Analytics
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Machine Learning (cs.LG)
[60]  arXiv:2101.08358 (cross-list from cs.LG) [pdf, other]
Title: Marius: Learning Massive Graph Embeddings on a Single Machine
Comments: Accepted into OSDI '21
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[61]  arXiv:2101.10231 (cross-list from cs.SE) [pdf, other]
Title: Creating a Virtuous Cycle in Performance Testing at MongoDB
Authors: David Daly
Comments: Author's copy and preprint. Accepted for publication at ICPE2021. 9 pages, 5 figures
Subjects: Software Engineering (cs.SE); Databases (cs.DB); Performance (cs.PF)
[62]  arXiv:2101.10699 (cross-list from cs.CR) [pdf, other]
Title: Measuring Decentralization in Bitcoin and Ethereum using Multiple Metrics and Granularities
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[63]  arXiv:2101.10905 (cross-list from cs.DS) [pdf, other]
Title: Sampling a Near Neighbor in High Dimensions -- Who is the Fairest of Them All?
Comments: arXiv admin note: text overlap with arXiv:1906.02640
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Machine Learning (cs.LG)
[64]  arXiv:2101.11727 (cross-list from cs.LO) [pdf, ps, other]
Title: Characterising Fixed Parameter Tractability of Query Evaluation over Guarded TGDs
Authors: Cristina Feier
Subjects: Logic in Computer Science (cs.LO); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Databases (cs.DB); Formal Languages and Automata Theory (cs.FL)
[65]  arXiv:2101.12602 (cross-list from cs.CR) [pdf, other]
Title: On the differential privacy of dynamic location obfuscation with personalized error bounds
Comments: 9 pages, 10 figures
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[66]  arXiv:2101.12631 (cross-list from cs.IR) [pdf, other]
Title: A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search
Comments: 28 pages, 21 figures, 24 tables, conference
Subjects: Information Retrieval (cs.IR); Databases (cs.DB)
[67]  arXiv:2101.12010 (cross-list from physics.soc-ph) [pdf, other]
Title: Modeling Spatial Nonstationarity via Deformable Convolutions for Deep Traffic Flow Prediction
Subjects: Physics and Society (physics.soc-ph); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Machine Learning (cs.LG); Systems and Control (eess.SY)
[ total of 67 entries: 1-67 ]
[ showing 67 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)