We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for cs.AR in Jan 2022

[ total of 63 entries: 1-50 | 51-63 ]
[ showing 50 entries per page: fewer | more | all ]
[1]  arXiv:2201.00485 [pdf, other]
Title: Freeway to Memory Level Parallelism in Slice-Out-of-Order Cores
Subjects: Hardware Architecture (cs.AR)
[2]  arXiv:2201.00774 [pdf]
Title: Energy-efficient Non Uniform Last Level Caches for Chip-multiprocessors Based on Compression
Subjects: Hardware Architecture (cs.AR)
[3]  arXiv:2201.01089 [pdf, other]
Title: A Heterogeneous In-Memory Computing Cluster For Flexible End-to-End Inference of Real-World Deep Neural Networks
Comments: 14 pages (not including final biography page), 13 figures (excluded authors pictures)
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[4]  arXiv:2201.01385 [pdf, other]
Title: DR-STRaNGe: End-to-End System Design for DRAM-based True Random Number Generators
Subjects: Hardware Architecture (cs.AR)
[5]  arXiv:2201.01509 [pdf]
Title: ADRA: Extending Digital Computing-in-Memory with Asymmetric Dual-Row-Activation
Subjects: Hardware Architecture (cs.AR)
[6]  arXiv:2201.02855 [pdf, other]
Title: A System-Level Framework for Analytical and Empirical Reliability Exploration of STT-MRAM Caches
Subjects: Hardware Architecture (cs.AR)
[7]  arXiv:2201.03558 [pdf, other]
Title: Studying the Potential of Automatic Optimizations in the Intel FPGA SDK for OpenCL
Comments: Presented in FPGA'20 as a poster. Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. 2020
Subjects: Hardware Architecture (cs.AR)
[8]  arXiv:2201.04373 [pdf, other]
Title: TA-LRW: A Replacement Policy for Error Rate Reduction in STT-MRAM Caches
Subjects: Hardware Architecture (cs.AR)
[9]  arXiv:2201.04562 [pdf, other]
Title: Reduced Softmax Unit for Deep Neural Network Accelerators
Authors: Raghuram S
Subjects: Hardware Architecture (cs.AR)
[10]  arXiv:2201.05072 [pdf, other]
Title: SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems
Comments: To appear in the Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS) 2022 and the ACM SIGMETRICS 2022 conference
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[11]  arXiv:2201.05232 [pdf, other]
Title: FARSI: Facebook AR System Investigator for Agile Domain-Specific System-on-Chip Exploration
Subjects: Hardware Architecture (cs.AR)
[12]  arXiv:2201.05698 [pdf]
Title: Overview of contemporary systems driven by open-design movement
Comments: 27 pages, 10 Figures, 1 Table
Subjects: Hardware Architecture (cs.AR); Computers and Society (cs.CY); Software Engineering (cs.SE)
[13]  arXiv:2201.06077 [pdf]
Title: PolicyCLOUD: A prototype of a Cloud Serverless Ecosystem for Policy Analytics
Comments: 18 pages + 5 reference pages
Subjects: Hardware Architecture (cs.AR)
[14]  arXiv:2201.06853 [pdf, other]
Title: VAR-DRAM: Variation-Aware Framework for Efficient Dynamic Random Access Memory Design
Subjects: Hardware Architecture (cs.AR)
[15]  arXiv:2201.07498 [pdf, other]
Title: A Mixed Precision, Multi-GPU Design for Large-scale Top-K Sparse Eigenproblems
Subjects: Hardware Architecture (cs.AR)
[16]  arXiv:2201.07634 [pdf, other]
Title: FAT: An In-Memory Accelerator with Fast Addition for Ternary Weight Neural Networks
Comments: 14 pages
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[17]  arXiv:2201.08022 [pdf, other]
Title: HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[18]  arXiv:2201.08357 [pdf]
Title: The Specialized High-Performance Network on Anton 3
Comments: Accepted by the 28th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)
Subjects: Hardware Architecture (cs.AR)
[19]  arXiv:2201.08603 [pdf, other]
Title: Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration
Comments: 20 pages
Subjects: Hardware Architecture (cs.AR)
[20]  arXiv:2201.08656 [pdf, other]
Title: Dustin: A 16-Cores Parallel Ultra-Low-Power Cluster with 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode
Comments: 13 pages, 17 figures, 2 tables, Journal
Subjects: Hardware Architecture (cs.AR)
[21]  arXiv:2201.08830 [pdf, other]
Title: APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference
Authors: Alberto Delmas Lascorz (1), Mostafa Mahmoud (1), Andreas Moshovos (1 and 2) ((1) University of Toronto (2) Vector Institute)
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[22]  arXiv:2201.08916 [pdf, other]
Title: Enabling Flexibility for Sparse Tensor Acceleration via Heterogeneity
Subjects: Hardware Architecture (cs.AR)
[23]  arXiv:2201.08978 [pdf, other]
Title: Shire: Making FPGA-accelerated Middlebox Development More Pleasant
Subjects: Hardware Architecture (cs.AR)
[24]  arXiv:2201.09189 [pdf, other]
Title: Hardware/Software Co-Programmable Framework for Computational SSDs to Accelerate Deep Learning Service on Large-Scale Graphs
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[25]  arXiv:2201.09668 [pdf, ps, other]
Title: Variability aware Golden Reference Free methodology for Hardware Trojan Detection Using Robust Delay Analysis
Comments: 17 pages, 10 figures, 3 algorithms
Subjects: Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[26]  arXiv:2201.09670 [pdf]
Title: Low hardware consumption, resolution-configurable Gray code oscillator time-to-digital converters implemented in 16nm, 20nm and 28nm FPGAs
Comments: 9 pages, 9 figures
Subjects: Hardware Architecture (cs.AR)
[27]  arXiv:2201.11409 [pdf, ps, other]
Title: On the RTL Implementation of FINN Matrix Vector Compute Unit
Comments: 22 pages, 7 tables, 16 figures
Subjects: Hardware Architecture (cs.AR)
[28]  arXiv:2201.11638 [pdf, other]
Title: Reuse-Aware Cache Partitioning Framework for Data-Sharing Multicore Systems
Comments: 2 pages. 7th IEEE International Symposium on Smart Electronic Systems (iSES) 2021
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[29]  arXiv:2201.11978 [pdf]
Title: Testable Array Multipliers for a Better Utilization of C-Testability and Bijectivity
Comments: 6 pages,8 figures
Subjects: Hardware Architecture (cs.AR); Logic in Computer Science (cs.LO)
[30]  arXiv:2201.12027 [pdf, other]
Title: Puppeteer: A Random Forest-based Manager for Hardware Prefetchers across the Memory Hierarchy
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
[31]  arXiv:2201.12480 [pdf, other]
Title: Interconnect Parasitics and Partitioning in Fully-Analog In-Memory Computing Architectures
Comments: 5 pages, 6 figures
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[32]  arXiv:2201.12861 [pdf, other]
Title: Neural-PIM: Efficient Processing-In-Memory with Neural Approximation of Peripherals
Comments: 14 pages, 13 figures, Published in IEEE Transactions on Computers
Journal-ref: IEEE Transactions on Computers, 2021
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[33]  arXiv:2201.13056 [pdf, ps, other]
Title: The complexity gap in the static analysis of cache accesses grows if procedure calls are added
Authors: David Monniaux (VERIMAG - IMAG)
Subjects: Hardware Architecture (cs.AR); Computational Complexity (cs.CC); Formal Languages and Automata Theory (cs.FL); Programming Languages (cs.PL)
[34]  arXiv:2201.00594 (cross-list from cs.NI) [pdf, other]
Title: A Priority-Aware Multiqueue NIC Design
Comments: The 37th ACM/SIGAPP Symposium on Applied Computing (SAC '22)
Subjects: Networking and Internet Architecture (cs.NI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Operating Systems (cs.OS)
[35]  arXiv:2201.01130 (cross-list from cs.CR) [pdf, other]
Title: Reusing Verification Assertions as Security Checkers for Hardware Trojan Detection
Comments: 6 pages, 6 figures
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[36]  arXiv:2201.01834 (cross-list from cs.CR) [pdf, other]
Title: Secure Remote Attestation with Strong Key Insulation Guarantees
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[37]  arXiv:2201.01863 (cross-list from cs.LG) [pdf, other]
Title: CFU Playground: Full-Stack Open-Source Framework for Tiny Machine Learning (tinyML) Acceleration on FPGAs
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[38]  arXiv:2201.02789 (cross-list from cs.DC) [pdf, other]
Title: A Compiler Framework for Optimizing Dynamic Parallelism on GPUs
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[39]  arXiv:2201.02986 (cross-list from cs.CR) [pdf, other]
Title: A Retrospective and Futurespective of Rowhammer Attacks and Defenses on DRAM
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[40]  arXiv:2201.03166 (cross-list from cs.IT) [pdf, other]
Title: Spatiotemporal 2-D Channel Coding for Very Low Latency Reliable MIMO Transmission
Subjects: Information Theory (cs.IT); Hardware Architecture (cs.AR)
[41]  arXiv:2201.03386 (cross-list from cs.SD) [pdf, other]
Title: Sub-mW Keyword Spotting on an MCU: Analog Binary Feature Extraction and Binary Neural Networks
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Audio and Speech Processing (eess.AS)
[42]  arXiv:2201.03861 (cross-list from cs.DC) [pdf, other]
Title: HEROv2: Full-Stack Open-Source Research Platform for Heterogeneous Computing
Comments: 14 pages, 9 figures, 3 tables
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Performance (cs.PF)
[43]  arXiv:2201.03950 (cross-list from cs.DC) [pdf, other]
Title: High Throughput Multidimensional Tridiagonal Systems Solvers on FPGAs
Comments: Under review
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[44]  arXiv:2201.05884 (cross-list from cs.PF) [pdf, other]
Title: Calipers: A Criticality-aware Framework for Modeling Processor Performance
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR)
[45]  arXiv:2201.06703 (cross-list from cs.ET) [pdf, other]
Title: Design Space Exploration of Dense and Sparse Mapping Schemes for RRAM Architectures
Comments: Accepted at 2022 IEEE International Symposium on Circuits and Systems (ISCAS). [v2] Fixed incorrectly labeled author affiliations for Chenqi Li, Amirali Amirsoleimani, and Roman Genov
Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[46]  arXiv:2201.06848 (cross-list from cs.LG) [pdf, other]
Title: High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[47]  arXiv:2201.07375 (cross-list from cs.CR) [pdf, other]
Title: A 333.9uW 0.158mm$^2$ Saber Learning with Rounding based Post-Quantum Crypto Accelerator
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[48]  arXiv:2201.08413 (cross-list from cs.LG) [pdf, other]
Title: Unicorn: Reasoning about Configurable System Performance through the lens of Causality
Comments: EuroSys 2022 (camera-ready)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[49]  arXiv:2201.08442 (cross-list from cs.LG) [pdf, other]
Title: Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)
Comments: arXiv admin note: substantial text overlap with arXiv:2106.08295
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Performance (cs.PF); Software Engineering (cs.SE)
[50]  arXiv:2201.08455 (cross-list from cs.LG) [pdf, other]
Title: Hybrid Graph Models for Logic Optimization via Spatio-Temporal Information
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[ total of 63 entries: 1-50 | 51-63 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2208, contact, help  (Access key information)