We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for cs.AR in Jan 2022

[ total of 63 entries: 1-25 | 26-50 | 51-63 ]
[ showing 25 entries per page: fewer | more | all ]
[1]  arXiv:2201.00485 [pdf, other]
Title: Freeway to Memory Level Parallelism in Slice-Out-of-Order Cores
Subjects: Hardware Architecture (cs.AR)
[2]  arXiv:2201.00774 [pdf, ps, other]
Title: Energy-efficient Non Uniform Last Level Caches for Chip-multiprocessors Based on Compression
Subjects: Hardware Architecture (cs.AR)
[3]  arXiv:2201.01089 [pdf, other]
Title: A Heterogeneous In-Memory Computing Cluster For Flexible End-to-End Inference of Real-World Deep Neural Networks
Comments: 14 pages (not including final biography page), 13 figures (excluded authors pictures)
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[4]  arXiv:2201.01385 [pdf, other]
Title: DR-STRaNGe: End-to-End System Design for DRAM-based True Random Number Generators
Subjects: Hardware Architecture (cs.AR)
[5]  arXiv:2201.01509 [pdf, ps, other]
Title: ADRA: Extending Digital Computing-in-Memory with Asymmetric Dual-Row-Activation
Subjects: Hardware Architecture (cs.AR)
[6]  arXiv:2201.02855 [pdf, other]
Title: A System-Level Framework for Analytical and Empirical Reliability Exploration of STT-MRAM Caches
Subjects: Hardware Architecture (cs.AR)
[7]  arXiv:2201.03558 [pdf, other]
Title: Studying the Potential of Automatic Optimizations in the Intel FPGA SDK for OpenCL
Comments: Presented in FPGA'20 as a poster. Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. 2020
Subjects: Hardware Architecture (cs.AR)
[8]  arXiv:2201.04373 [pdf, other]
Title: TA-LRW: A Replacement Policy for Error Rate Reduction in STT-MRAM Caches
Subjects: Hardware Architecture (cs.AR)
[9]  arXiv:2201.04562 [pdf, other]
Title: Reduced Softmax Unit for Deep Neural Network Accelerators
Authors: Raghuram S
Subjects: Hardware Architecture (cs.AR)
[10]  arXiv:2201.05072 [pdf, other]
Title: SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems
Comments: To appear in the Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS) 2022 and the ACM SIGMETRICS 2022 conference
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[11]  arXiv:2201.05232 [pdf, other]
Title: FARSI: Facebook AR System Investigator for Agile Domain-Specific System-on-Chip Exploration
Subjects: Hardware Architecture (cs.AR)
[12]  arXiv:2201.05698 [pdf, ps, other]
Title: Overview of contemporary systems driven by open-design movement
Comments: 27 pages, 10 Figures, 1 Table
Subjects: Hardware Architecture (cs.AR); Computers and Society (cs.CY); Software Engineering (cs.SE)
[13]  arXiv:2201.06077 [pdf, ps, other]
Title: PolicyCLOUD: A prototype of a Cloud Serverless Ecosystem for Policy Analytics
Comments: 18 pages + 5 reference pages
Subjects: Hardware Architecture (cs.AR)
[14]  arXiv:2201.06853 [pdf, other]
Title: VAR-DRAM: Variation-Aware Framework for Efficient Dynamic Random Access Memory Design
Subjects: Hardware Architecture (cs.AR)
[15]  arXiv:2201.07498 [pdf, other]
Title: A Mixed Precision, Multi-GPU Design for Large-scale Top-K Sparse Eigenproblems
Subjects: Hardware Architecture (cs.AR)
[16]  arXiv:2201.07634 [pdf, other]
Title: FAT: An In-Memory Accelerator with Fast Addition for Ternary Weight Neural Networks
Comments: 14 pages
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[17]  arXiv:2201.08022 [pdf, other]
Title: HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks
Comments: 5 pages, 2022 IEEE International Symposium on Circuits and Systems (ISCAS)
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[18]  arXiv:2201.08357 [pdf, ps, other]
Title: The Specialized High-Performance Network on Anton 3
Comments: Accepted by the 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)
Subjects: Hardware Architecture (cs.AR)
[19]  arXiv:2201.08603 [pdf, other]
Title: Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration
Comments: 20 pages
Subjects: Hardware Architecture (cs.AR)
[20]  arXiv:2201.08656 [pdf, other]
Title: Dustin: A 16-Cores Parallel Ultra-Low-Power Cluster with 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode
Comments: 13 pages, 17 figures, 2 tables, Journal
Subjects: Hardware Architecture (cs.AR)
[21]  arXiv:2201.08830 [pdf, other]
Title: APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference
Authors: Alberto Delmas Lascorz (1), Mostafa Mahmoud (1), Andreas Moshovos (1 and 2) ((1) University of Toronto (2) Vector Institute)
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[22]  arXiv:2201.08916 [pdf, other]
Title: Enabling Flexibility for Sparse Tensor Acceleration via Heterogeneity
Subjects: Hardware Architecture (cs.AR)
[23]  arXiv:2201.08978 [pdf, other]
Title: Rosebud: Making FPGA-Accelerated Middlebox Development More Pleasant
Comments: 20 pages. Final version, to appear in ASPLOS23
Subjects: Hardware Architecture (cs.AR)
[24]  arXiv:2201.09189 [pdf, other]
Title: Hardware/Software Co-Programmable Framework for Computational SSDs to Accelerate Deep Learning Service on Large-Scale Graphs
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[25]  arXiv:2201.09668 [pdf, ps, other]
Title: Variability aware Golden Reference Free methodology for Hardware Trojan Detection Using Robust Delay Analysis
Comments: 17 pages, 10 figures, 3 algorithms
Subjects: Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[ total of 63 entries: 1-25 | 26-50 | 51-63 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)