We gratefully acknowledge support from
the Simons Foundation and member institutions.

Performance

Authors and titles for cs.PF in Mar 2023

[ total of 39 entries: 1-25 | 26-39 ]
[ showing 25 entries per page: fewer | more | all ]
[1]  arXiv:2303.05016 [pdf, other]
Title: Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version
Comments: Extended version of accepted short paper by ICFEC 2023
Subjects: Performance (cs.PF); Signal Processing (eess.SP)
[2]  arXiv:2303.05919 [pdf, other]
Title: eBPF-based Working Set Size Estimation in Memory Management
Comments: 8 pages, 6 figures
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[3]  arXiv:2303.06073 [pdf, other]
Title: I Tag, You Tag, Everybody Tags!
Comments: 8 pages, 8 figures
Subjects: Performance (cs.PF); Computers and Society (cs.CY)
[4]  arXiv:2303.06153 [pdf, other]
Title: CXLMemSim: A pure software simulated CXL.mem for performance characterization
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Operating Systems (cs.OS)
[5]  arXiv:2303.10844 [pdf, other]
Title: Analyzing the Performance of the Inter-Blockchain Communication Protocol
Comments: Accepted at the 53rd IEEE/IFIP DSN 2023
Subjects: Performance (cs.PF); Distributed, Parallel, and Cluster Computing (cs.DC)
[6]  arXiv:2303.11110 [pdf, other]
Title: Runtime-Adaptable Selective Performance Instrumentation
Comments: To be published in the proceedings of the 28th International Workshop on High-Level Parallel Programming Models and Supportive Environments
Subjects: Performance (cs.PF)
[7]  arXiv:2303.11733 [pdf, other]
Title: DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks
Subjects: Performance (cs.PF); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[8]  arXiv:2303.12258 [pdf, other]
Title: How does SSD Cluster Perform for Distributed File Systems: An Empirical Study
Comments: Accepted by Concurrency and Computation: Practice and Experience
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Applications (stat.AP)
[9]  arXiv:2303.15375 [pdf, other]
Title: Demystifying CXL Memory with Genuine CXL-Ready Systems and Devices
Comments: This paper has been accepted by MICRO'23. Please refer to the this https URL for the official version of this paper
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR)
[10]  arXiv:2303.15763 [pdf, other]
Title: Characterizing the Performance of Emerging Deep Learning, Graph, and High Performance Computing Workloads Under Interference
Subjects: Performance (cs.PF)
[11]  arXiv:2303.01243 (cross-list from cs.LG) [pdf, other]
Title: Poster: Sponge ML Model Attacks of Mobile Apps
Comments: 2 pages, 6 figures. Proceedings of the 24th International Workshop on Mobile Computing Systems and Applications (HotMobile). Feb. 2023
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Performance (cs.PF)
[12]  arXiv:2303.01845 (cross-list from cs.DC) [pdf, other]
Title: Extreme-scale many-against-many protein similarity search
Comments: 2022 ACM Gordon Bell Prize Finalist
Journal-ref: SC'22: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2022
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Genomics (q-bio.GN)
[13]  arXiv:2303.04274 (cross-list from cs.LG) [pdf, other]
Title: Amplitude-Varying Perturbation for Balancing Privacy and Utility in Federated Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Performance (cs.PF)
[14]  arXiv:2303.04739 (cross-list from cs.CV) [pdf, other]
Title: Advancing Direct Convolution using Convolution Slicing Optimization and ISA Extensions
Comments: 15 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
[15]  arXiv:2303.04769 (cross-list from cs.MS) [pdf, other]
Title: SMaLL: A Software Framework for portable Machine Learning Libraries
Comments: 14 pages, 12 figures
Subjects: Mathematical Software (cs.MS); Performance (cs.PF)
[16]  arXiv:2303.04878 (cross-list from cs.LG) [pdf, other]
Title: DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks
Subjects: Machine Learning (cs.LG); Performance (cs.PF); Software Engineering (cs.SE)
[17]  arXiv:2303.05098 (cross-list from cs.LG) [pdf, other]
Title: Optimizing Sparse Linear Algebra Through Automatic Format Selection and Machine Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Numerical Analysis (math.NA)
[18]  arXiv:2303.05295 (cross-list from cs.LG) [pdf, other]
Title: Dynamic Stashing Quantization for Efficient Transformer Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Performance (cs.PF)
[19]  arXiv:2303.05732 (cross-list from cs.SE) [pdf, ps, other]
Title: Securing Safety in Collaborative Cyber-Physical Systems through Fault Criticality Analysis
Comments: This paper is an extended version of an article submitted to KCSE-2021
Journal-ref: KIPS Transactions on Software and Data Engineering, vol. 10, no. 8, pp. 287-300, 2021
Subjects: Software Engineering (cs.SE); Performance (cs.PF); Systems and Control (eess.SY)
[20]  arXiv:2303.06318 (cross-list from cs.LG) [pdf, other]
Title: A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[21]  arXiv:2303.06851 (cross-list from cs.DC) [pdf, other]
Title: On the Regret of Online Edge Service Hosting
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[22]  arXiv:2303.06865 (cross-list from cs.LG) [pdf, other]
Title: FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Performance (cs.PF)
[23]  arXiv:2303.07523 (cross-list from cs.MM) [pdf, other]
Title: Investigating the Characteristics and Performance of Augmented Reality Applications on Head-Mounted Displays: A Study of the Hololens Application Store
Comments: This paper has been accepted for publication by IEEE ICC workshops 2023
Subjects: Multimedia (cs.MM); Performance (cs.PF); Software Engineering (cs.SE)
[24]  arXiv:2303.08253 (cross-list from cs.LG) [pdf, other]
Title: R2 Loss: Range Restriction Loss for Model Compression and Quantization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Image and Video Processing (eess.IV)
[25]  arXiv:2303.10001 (cross-list from cs.NI) [pdf, other]
Title: Improving Data Transfer Efficiency for AIs in the DareFightingICE using gRPC
Comments: The paper is made publically available for prospective participants of the 2023 DareFightingICE Competition. this https URL It has been accepted for presentation at the 2023 8th International Conference on Business and Industrial Research. this https URL
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Performance (cs.PF)
[ total of 39 entries: 1-25 | 26-39 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)