We gratefully acknowledge support from
the Simons Foundation and member institutions.

Performance

Authors and titles for recent submissions

[ total of 13 entries: 1-13 ]
[ showing up to 25 entries per page: fewer | more ]

Thu, 14 Nov 2019

[1]  arXiv:1911.05181 (cross-list from cs.LG) [pdf, other]
Title: 92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster
Comments: SC '00: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing
Journal-ref: ACM/IEEE SC 2000 Conference (SC00)
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Machine Learning (stat.ML)
[2]  arXiv:1911.05146 (cross-list from cs.DC) [pdf, other]
Title: HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training using TensorFlow
Comments: 15 pages, 16 figures, under double-blind review at a conference
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)

Wed, 13 Nov 2019

[3]  arXiv:1911.04946 (cross-list from cs.LG) [pdf, other]
Title: Optimizing Deep Learning Inference on Embedded Systems Through Adaptive Model Selection
Comments: Accepted to be published at ACM TECS. arXiv admin note: substantial text overlap with arXiv:1805.04252
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[4]  arXiv:1911.04650 (cross-list from cs.DC) [pdf, other]
Title: Throughput Prediction of Asynchronous SGD in TensorFlow
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)

Tue, 12 Nov 2019

[5]  arXiv:1911.04200 (cross-list from cs.CE) [pdf, other]
Title: Communication-Efficient Jaccard Similarity for High-Performance Distributed Genome Comparisons
Subjects: Computational Engineering, Finance, and Science (cs.CE); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Genomics (q-bio.GN)
[6]  arXiv:1911.03456 (cross-list from cs.DC) [pdf, other]
Title: Parallel Data Distribution Management on Shared-Memory Multiprocessors
Comments: Accepted for publication in the ACM Transactions on Modeling and Computer Simulation (ACM TOMACS). arXiv admin note: text overlap with arXiv:1703.06680
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS); Performance (cs.PF)

Mon, 11 Nov 2019

[7]  arXiv:1911.03282 [pdf, ps, other]
Title: nanoBench: A Low-Overhead Tool for Running Microbenchmarks on x86 Systems
Subjects: Performance (cs.PF)
[8]  arXiv:1911.02987 [pdf, other]
Title: The Pitfall of Evaluating Performance on Emerging AI Accelerators
Subjects: Performance (cs.PF); Machine Learning (cs.LG)
[9]  arXiv:1911.03062 (cross-list from physics.comp-ph) [pdf, other]
Title: Digital Blood in Massively Parallel CPU/GPU Systems for the Study of Platelet Transport
Subjects: Computational Physics (physics.comp-ph); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[10]  arXiv:1911.03011 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Kernel Value Caching for SVM Training
Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Subjects: Machine Learning (cs.LG); Performance (cs.PF); Machine Learning (stat.ML)

Thu, 7 Nov 2019

[11]  arXiv:1911.02430 [pdf, other]
Title: Graph-based Approach for Buffer-aware Timing Analysis of Heterogeneous Wormhole NoCs under Bursty Traffic
Comments: 21 pages, 22 figures, 5 tables
Subjects: Performance (cs.PF)
[12]  arXiv:1911.02549 (cross-list from cs.LG) [pdf, other]
[13]  arXiv:1911.02373 (cross-list from cs.DC) [pdf, other]
Title: KLARAPTOR: A Tool for Dynamically Finding Optimal Kernel Launch Parameters Targeting CUDA Programs
Comments: 10 pages. arXiv admin note: text overlap with arXiv:1906.00142
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[ total of 13 entries: 1-13 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 1911, contact, help  (Access key information)