We gratefully acknowledge support from
the Simons Foundation and member institutions.

Performance

Authors and titles for cs.PF in Apr 2021, skipping first 25

[ total of 51 entries: 1-25 | 26-50 | 51 ]
[ showing 25 entries per page: fewer | more | all ]
[26]  arXiv:2104.07097 (cross-list from cs.CG) [pdf, other]
Title: Novel Matrix Hit and Run for Sampling Polytopes and Its GPU Implementation
Subjects: Computational Geometry (cs.CG); Mathematical Software (cs.MS); Performance (cs.PF)
[27]  arXiv:2104.07582 (cross-list from cs.AR) [pdf, other]
Title: SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems
Comments: Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO'21), 2021
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS); Performance (cs.PF)
[28]  arXiv:2104.07857 (cross-list from cs.DC) [pdf, other]
Title: ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[29]  arXiv:2104.08364 (cross-list from cs.DC) [pdf, other]
Title: Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning
Comments: 15 pages, 16 figures, 6 tables, ICDCS'21
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[30]  arXiv:2104.08396 (cross-list from cs.DC) [pdf, ps, other]
Title: Yelp Dataset Analysis using Scalable Big Data
Comments: 4 pages, 11 figures, 4 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[31]  arXiv:2104.08416 (cross-list from cs.MS) [pdf, other]
Title: Boosting Memory Access Locality of the Spectral Element Method with Hilbert Space-Filling Curves
Comments: 23 pages, 12 figures
Subjects: Mathematical Software (cs.MS); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[32]  arXiv:2104.08571 (cross-list from cs.DC) [pdf, other]
Title: Ripple : Simplified Large-Scale Computation on Heterogeneous Architectures with Polymorphic Data Layout
Comments: Preprint submitted to the Journal of Parallel and Distributed Computing
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[33]  arXiv:2104.09433 (cross-list from cs.SE) [pdf, ps, other]
Title: A Choreographed Outline Instrumentation Algorithm for Asynchronous Components
Subjects: Software Engineering (cs.SE); Performance (cs.PF)
[34]  arXiv:2104.10030 (cross-list from cs.SE) [pdf, other]
Title: Reproducibility Report for the Paper: QN-based Modeling and Analysis of Software Performance Antipatterns for Cyber-Physical Systems
Subjects: Software Engineering (cs.SE); Performance (cs.PF)
[35]  arXiv:2104.10873 (cross-list from cs.LG) [pdf, other]
Title: Mosaic Flows: A Transferable Deep Learning Framework for Solving PDEs on Unseen Domains
Comments: 23 pages, 10 figures
Subjects: Machine Learning (cs.LG); Performance (cs.PF); Computational Physics (physics.comp-ph)
[36]  arXiv:2104.11069 (cross-list from cs.SE) [pdf, other]
Title: Online GANs for Automatic Performance Testing
Comments: 5th International Workshop on Testing Extra-Functional Properties and Quality Characteristics of Software Systems - this https URL
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG); Performance (cs.PF)
[37]  arXiv:2104.12893 (cross-list from cs.SE) [pdf, other]
Title: Performance Testing Using a Smart Reinforcement Learning-Driven Test Agent
Comments: 10 pages, IEEE Congress on Evolutionary Computation 2021
Subjects: Software Engineering (cs.SE); Performance (cs.PF)
[38]  arXiv:2104.13242 (cross-list from cs.LG) [pdf, other]
Title: Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version)
Comments: Submitted to CCPE journal. arXiv admin note: substantial text overlap with arXiv:2010.08040
Subjects: Machine Learning (cs.LG); Performance (cs.PF)
[39]  arXiv:2104.13732 (cross-list from cs.LG) [pdf, other]
Title: A Reinforcement Learning Environment for Polyhedral Optimizations
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Discrete Mathematics (cs.DM); Performance (cs.PF)
[40]  arXiv:2104.13774 (cross-list from cs.NI) [pdf, other]
Title: Scouting the Path to a Million-Client Server
Journal-ref: In: Hohlfeld O., Lutu A., Levin D. (eds) Passive and Active Measurement. PAM 2021. Lecture Notes in Computer Science, vol 12671. Springer, Cham
Subjects: Networking and Internet Architecture (cs.NI); Performance (cs.PF)
[41]  arXiv:2104.14050 (cross-list from cs.DC) [pdf, other]
Title: The Hidden cost of the Edge: A Performance Comparison of Edge and Cloud Latencies
Comments: 15 pages, 10 figures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[42]  arXiv:2104.14246 (cross-list from cs.DC) [pdf, other]
Title: Legio: Fault Resiliency for Embarrassingly Parallel MPI Applications
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[43]  arXiv:2104.14392 (cross-list from cs.DC) [pdf, other]
Title: COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments
Comments: Accepted in IEEE Transactions on Parallel and Distributed Systems, 2021
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[44]  arXiv:2104.14447 (cross-list from math.NA) [pdf, other]
Title: Parallel implementation of a compatible high-order meshless method for the Stokes' equations
Subjects: Numerical Analysis (math.NA); Distributed, Parallel, and Cluster Computing (cs.DC); Mathematical Software (cs.MS); Performance (cs.PF); Analysis of PDEs (math.AP)
[45]  arXiv:2104.14677 (cross-list from cs.LG) [pdf, other]
Title: Search Algorithms for Automated Hyper-Parameter Tuning
Comments: 10 pages, 3 figure, 1 table
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Performance (cs.PF)
[46]  arXiv:2104.15109 (cross-list from cs.CR) [pdf, other]
Title: Memory-Efficient Deep Learning Inference in Trusted Execution Environments
Comments: To Appear in the 9th IEEE International Conference on Cloud Engineering (IC2E 21)
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Performance (cs.PF)
[47]  arXiv:2104.04579 (cross-list from physics.comp-ph) [pdf, ps, other]
Title: High Performance Implementation of Boris Particle Pusher on DPC++. A First Look at oneAPI
Subjects: Computational Physics (physics.comp-ph); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[48]  arXiv:2104.08037 (cross-list from math.PR) [pdf, other]
Title: The generalized join the shortest orbit queue system: Stability, exact tail asymptotics and stationary approximations
Subjects: Probability (math.PR); Performance (cs.PF)
[49]  arXiv:2104.10698 (cross-list from quant-ph) [pdf, other]
Title: Scalable Benchmarks for Gate-Based Quantum Computers
Comments: 54 pages, many figures
Subjects: Quantum Physics (quant-ph); Performance (cs.PF)
[50]  arXiv:2104.13350 (cross-list from math.DS) [pdf, other]
Title: Queues with Updating Information: Finding the Amplitude of Oscillations
Subjects: Dynamical Systems (math.DS); Performance (cs.PF); Probability (math.PR)
[ total of 51 entries: 1-25 | 26-50 | 51 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)