We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for cs.AR in Nov 2019

[ total of 23 entries: 1-23 ]
[ showing up to 25 entries per page: fewer | more ]
[1]  arXiv:1911.03364 [pdf, other]
Title: AMOEBA: A Coarse Grained Reconfigurable Architecture for Dynamic GPU Scaling
Subjects: Hardware Architecture (cs.AR)
[2]  arXiv:1911.05101 [pdf, other]
Title: Coordinated Management of DVFS and Cache Partitioning under QoS Constraints to Save Energy in Multi-Core Systems
Comments: Submitted to the Journal of Parallel and Distributed Computing (Nov 2019)
Subjects: Hardware Architecture (cs.AR)
[3]  arXiv:1911.05114 [pdf, other]
Title: Coordinated Management of Processor Configuration and Cache Partitioning to Optimize Energy under QoS Constraints
Comments: Submitted to the 34th IEEE International Parallel & Distributed Processing Symposium (IPDPS2020)
Subjects: Hardware Architecture (cs.AR)
[4]  arXiv:1911.06859 [pdf, other]
Title: NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[5]  arXiv:1911.07187 [pdf, other]
Title: FPGA Energy Efficiency by Leveraging Thermal Margin
Comments: Accepted in IEEE International Conference on Computer Design (ICCD) 2019
Subjects: Hardware Architecture (cs.AR)
[6]  arXiv:1911.08356 [pdf, other]
Title: Stream Semantic Registers: A Lightweight RISC-V ISA Extension Achieving Full Compute Utilization in Single-Issue Cores
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[7]  arXiv:1911.10349 [pdf, ps, other]
Title: Arsenal of Hardware Prefetchers
Subjects: Hardware Architecture (cs.AR)
[8]  arXiv:1911.11768 [pdf, ps, other]
Title: 3D IC optimal layout design. A parallel and distributed topological approach
Comments: 26 pages, 9 figures
Subjects: Hardware Architecture (cs.AR)
[9]  arXiv:1911.00267 (cross-list from cs.DC) [pdf, other]
Title: Optimal Metastability-Containing Sorting via Parallel Prefix Computation
Comments: This article generalizes and extends work presented at DATE 2018
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[10]  arXiv:1911.01258 (cross-list from cs.LG) [pdf, other]
Title: SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Neural and Evolutionary Computing (cs.NE); Performance (cs.PF)
[11]  arXiv:1911.02038 (cross-list from cs.CR) [pdf, other]
Title: Using Name Confusion to Enhance Security
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[12]  arXiv:1911.03451 (cross-list from cs.DC) [pdf, other]
Title: Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design
Comments: To appear in the 2020 26th International Symposium on High-Performance Computer Architecture (HPCA 2020)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[13]  arXiv:1911.03458 (cross-list from cs.DC) [pdf, other]
Title: MERIT: Tensor Transform for Memory-Efficient Vision Processing on Parallel Architectures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[14]  arXiv:1911.04378 (cross-list from cs.CR) [pdf, other]
Title: DRAB-LOCUS: An Area-Efficient AES Architecture for Hardware Accelerator Co-Location on FPGAs
Comments: 16 pages, initial submission
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[15]  arXiv:1911.05289 (cross-list from cs.LG) [pdf, ps, other]
Title: The Deep Learning Revolution and Its Implications for Computer Architecture and Chip Design
Authors: Jeffrey Dean
Comments: Companion paper to accompany a keynote talk at ISSCC 2020
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Machine Learning (stat.ML)
[16]  arXiv:1911.05662 (cross-list from cs.DC) [pdf, other]
Title: Communication Lower Bound in Convolution Accelerators
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[17]  arXiv:1911.05664 (cross-list from cs.CR) [pdf, other]
Title: A Brief Review on Some Architectures Providing Support for DIFT
Authors: Ali Jahanshahi
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[18]  arXiv:1911.08384 (cross-list from cs.CR) [pdf, other]
Title: MuonTrap: Preventing Cross-Domain Spectre-Like Attacks by Capturing Speculative State
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[19]  arXiv:1911.09925 (cross-list from cs.DC) [pdf, other]
Title: Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration
Comments: To appear at the 58th IEEE/ACM Design Automation Conference (DAC), December 2021, San Francisco, CA, USA
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
[20]  arXiv:1911.10741 (cross-list from cs.ET) [pdf, other]
Title: Shenjing: A low power reconfigurable neuromorphic accelerator with partial-sum and spike networks-on-chip
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR); Neural and Evolutionary Computing (cs.NE)
[21]  arXiv:1911.11642 (cross-list from cs.PF) [pdf, other]
Title: System Performance with varying L1 Instruction and Data Cache Sizes: An Empirical Analysis
Comments: 5 Figures and 3 Tables
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR)
[22]  arXiv:1911.12815 (cross-list from cs.LG) [pdf, other]
Title: Neural Network-Inspired Analog-to-Digital Conversion to Achieve Super-Resolution with Low-Precision RRAM Devices
Comments: 7 pages, ICCAD 2019
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Signal Processing (eess.SP)
[23]  arXiv:1911.08097 (cross-list from eess.SP) [pdf, ps, other]
Title: AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
Comments: 14 pages
Subjects: Signal Processing (eess.SP); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[ total of 23 entries: 1-23 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)