We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for cs.AR in Dec 2020

[ total of 56 entries: 1-50 | 51-56 ]
[ showing 50 entries per page: fewer | more | all ]
[1]  arXiv:2012.00050 [pdf, other]
Title: Aging-Aware Request Scheduling for Non-Volatile Main Memory
Comments: To appear in ASP-DAC 2021
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[2]  arXiv:2012.00102 [pdf]
Title: HeM3D: Heterogeneous Manycore Architecture Based on Monolithic 3D Vertical Integration
Comments: This work has been accepted in ACM Transactions on Design Automation of Electronic Systems
Subjects: Hardware Architecture (cs.AR)
[3]  arXiv:2012.00158 [pdf, other]
Title: Accelerating Bandwidth-Bound Deep Learning Inference with Main-Memory Accelerators
Subjects: Hardware Architecture (cs.AR)
[4]  arXiv:2012.01267 [pdf, other]
Title: Multivalued circuits and Interconnect issues
Authors: Daniel Etiemble
Comments: 6 pages, 12 figures, preprint
Subjects: Hardware Architecture (cs.AR)
[5]  arXiv:2012.01353 [pdf, other]
Title: Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines
Subjects: Hardware Architecture (cs.AR)
[6]  arXiv:2012.01571 [pdf, other]
Title: Online Model Swapping in Architectural Simulation
Subjects: Hardware Architecture (cs.AR)
[7]  arXiv:2012.02037 [pdf, other]
Title: Characteristics of Reversible Circuits for Error Detection
Comments: 6 pages, 9 figures
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[8]  arXiv:2012.02791 [pdf, other]
Title: A Unified Model for Gate Level Propagation Analysis
Authors: Jeremy Blackstone (University of California, San Diego, USA), Wei Hu (Northwestern Polytechnical University, China), Alric Althoff, Armaiti Ardeshiricham (University of California, San Diego, USA), Lu Zhang (Northwestern Polytechnical University, China), Ryan Kastner (University of California, San Diego, USA)
Subjects: Hardware Architecture (cs.AR)
[9]  arXiv:2012.02890 [pdf, other]
Title: Towards a Domain Specific Solution for a New Generation of Wireless Modems
Comments: 49 pages
Subjects: Hardware Architecture (cs.AR)
[10]  arXiv:2012.02973 [pdf, other]
Title: MemPool: A Shared-L1 Memory Many-Core Cluster with a Low-Latency Interconnect
Comments: Accepted for publication in the Design, Automation and Test in Europe (DATE) Conference 2021
Subjects: Hardware Architecture (cs.AR)
[11]  arXiv:2012.03112 [pdf, other]
Title: A Modern Primer on Processing in Memory
Comments: arXiv admin note: substantial text overlap with arXiv:1903.03988
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[12]  arXiv:2012.03177 [pdf, other]
Title: Systolic-CNN: An OpenCL-defined Scalable Run-time-flexible FPGA Accelerator Architecture for Accelerating Convolutional Neural Network Inference in Cloud/Edge Computing
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[13]  arXiv:2012.03481 [pdf, other]
Title: BinArray: A Scalable Hardware Accelerator for Binary Approximated CNNs
Authors: Mario Fischer, Juergen Wassner (Department of Engineering and Architecture, Lucerne University of Applied Sciences and Arts, Switzerland)
Comments: 11 pages, 11 figures
Subjects: Hardware Architecture (cs.AR); Signal Processing (eess.SP)
[14]  arXiv:2012.03672 [pdf, other]
Title: FPGA deep learning acceleration based on convolutional neural network
Authors: Xiong Jun
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[15]  arXiv:2012.04559 [pdf, other]
Title: DeepNVM++: Cross-Layer Modeling and Optimization Framework of Non-Volatile Memories for Deep Learning
Comments: 12 pages, 10 figures
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[16]  arXiv:2012.05079 [pdf, other]
Title: Page Tables: Keeping them Flat and Hot (Cached)
Authors: Chang Hyun Park (1), Ilias Vougioukas (2), Andreas Sandberg (2), David Black-Schaffer (1) ((1) Uppsala University, (2) Arm Research)
Comments: 13 pages, 13 figures
Subjects: Hardware Architecture (cs.AR)
[17]  arXiv:2012.05136 [pdf, other]
Title: Efficient Bypass in Mesh and Torus NoCs
Authors: Iván Pérez (1), Enrique Vallejo (1), Ramón Beivide (1) ((1) University of Cantabria)
Comments: 14 pages, 16 figures, LaTeX; this review is an update of the preprint to the accepted manuscript of the paper; the final version of this work has been published in the Journal of SystemArchitecture, DOI: this https URL
Journal-ref: Journal of Systems Architecture Volume 108, September 2020, 101832
Subjects: Hardware Architecture (cs.AR)
[18]  arXiv:2012.05181 [pdf, ps, other]
Title: Virtual-Link: A Scalable Multi-Producer, Multi-Consumer Message Queue Architecture for Cross-Core Communication
Subjects: Hardware Architecture (cs.AR)
[19]  arXiv:2012.05419 [src]
Title: A Custom 7nm CMOS Standard Cell Library for Implementing TNN-based Neuromorphic Processors
Comments: This work is dated and will be superseded by a forthcoming work
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[20]  arXiv:2012.08071 [pdf]
Title: Optimization Techniques to Improve Inference Performance of a Forward Propagating Neural Network on an FPGA
Comments: 7 pages, 6 figures
Subjects: Hardware Architecture (cs.AR)
[21]  arXiv:2012.08320 [pdf, other]
Title: A Comparative Study between HLS and HDL on SoC for Image Processing Applications
Subjects: Hardware Architecture (cs.AR); Programming Languages (cs.PL)
[22]  arXiv:2012.09852 [pdf, other]
Title: SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Comments: Published as a conference paper in HPCA 2021; 15 pages, 23 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[23]  arXiv:2012.10597 [pdf, other]
Title: MAVIREC: ML-Aided Vectored IR-DropEstimation and Classification
Comments: 6 pages paper. This has been reviewed at Design Automation and Test Conference 2021 and has been accepted as a four page paper. This is a longer version of that
Subjects: Hardware Architecture (cs.AR)
[24]  arXiv:2012.10848 [pdf, other]
Title: IntersectX: An Efficient Accelerator for Graph Mining
Subjects: Hardware Architecture (cs.AR)
[25]  arXiv:2012.11233 [pdf, other]
Title: Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Comments: Accepted for publication in IEEE Access
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[26]  arXiv:2012.11331 [pdf, other]
Title: FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[27]  arXiv:2012.11334 [pdf, other]
Title: Cognitive Computing in Data-centric Paradigm
Subjects: Hardware Architecture (cs.AR)
[28]  arXiv:2012.11473 [pdf, other]
Title: PALMED: Throughput Characterization for Superscalar Architectures -- Extended Version
Subjects: Hardware Architecture (cs.AR); Performance (cs.PF)
[29]  arXiv:2012.11890 [pdf, ps, other]
Title: SIMDRAM: A Framework for Bit-Serial SIMD Processing Using DRAM
Comments: Extended abstract of the full paper to appear in ASPLOS 2021
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Emerging Technologies (cs.ET)
[30]  arXiv:2012.12178 [pdf, other]
Title: Reducing Solid-State Drive Read Latency by Optimizing Read-Retry (Extended Abstract)
Comments: Extended abstract of the full paper to appear in ASPLOS 2021
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[31]  arXiv:2012.12381 [pdf]
Title: Intelligent Architectures for Intelligent Computing Systems
Authors: Onur Mutlu
Comments: To appear as an invited talk and accompanying summary paper at DATE 2021 conference. arXiv admin note: substantial text overlap with arXiv:2008.06112
Subjects: Hardware Architecture (cs.AR)
[32]  arXiv:2012.12563 [pdf, other]
Title: Architecture, Dataflow and Physical Design Implications of 3D-ICs for DNN-Accelerators
Subjects: Hardware Architecture (cs.AR)
[33]  arXiv:2012.13600 [pdf, other]
Title: EdgeDRNN: Recurrent Neural Network Accelerator for Edge Inference
Journal-ref: in IEEE Journal on Emerging and Selected Topics in Circuits and Systems, vol. 10, no. 4, pp. 419-432, Dec. 2020
Subjects: Hardware Architecture (cs.AR)
[34]  arXiv:2012.13645 [pdf, other]
Title: Fundamental Limits on Energy-Delay-Accuracy of In-memory Architectures in Inference Applications
Comments: 14 pages, 13 figures
Subjects: Hardware Architecture (cs.AR); Signal Processing (eess.SP)
[35]  arXiv:2012.00581 (cross-list from cs.IT) [pdf, other]
Title: Hardware Implementation of Iterative Projection-Aggregation Decoding of Reed-Muller Codes
Subjects: Information Theory (cs.IT); Hardware Architecture (cs.AR); Signal Processing (eess.SP)
[36]  arXiv:2012.01114 (cross-list from cs.LG) [pdf, other]
Title: Parallel Scheduling Self-attention Mechanism: Generalization and Optimization
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[37]  arXiv:2012.01153 (cross-list from eess.SY) [pdf, other]
Title: Towards Intelligent Reconfigurable Wireless Physical Layer (PHY)
Journal-ref: OJCAS Special Section on Circuits, Systems, and Algorithms for Beyond 5G and towards 6G, 2020
Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR); Signal Processing (eess.SP)
[38]  arXiv:2012.02453 (cross-list from cs.LG) [pdf, other]
Title: Optimising Design Verification Using Machine Learning: An Open Source Solution
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[39]  arXiv:2012.02695 (cross-list from cs.ET) [pdf, other]
Title: A Single-Cycle MLP Classifier Using Analog MRAM-based Neurons and Synapses
Authors: Ramtin Zand
Comments: arXiv admin note: substantial text overlap with arXiv:2006.01238
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[40]  arXiv:2012.02715 (cross-list from cs.CR) [pdf, other]
Title: Efficient Sealable Protection Keys for RISC-V
Comments: 7 pages, 5 figures
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[41]  arXiv:2012.04105 (cross-list from cs.LG) [pdf, other]
Title: The Tribes of Machine Learning and the Realm of Computer Architecture
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[42]  arXiv:2012.04210 (cross-list from cs.LG) [pdf, other]
Title: The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
Comments: To appear in the proceedings of the 6th Workshop on Energy Efficient Machine Learning and Cognitive Computing (EMC2) 2020
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[43]  arXiv:2012.04240 (cross-list from cs.LG) [pdf, other]
Title: Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Comments: Accepted by High-Performance Computer Architecture (HPCA'2021)
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[44]  arXiv:2012.05550 (cross-list from cs.DM) [pdf, ps, other]
Title: Constructing Depth-Optimum Circuits for Adders and AND-OR Paths
Subjects: Discrete Mathematics (cs.DM); Hardware Architecture (cs.AR)
[45]  arXiv:2012.06272 (cross-list from cs.LG) [pdf, other]
Title: Hard-ODT: Hardware-Friendly Online Decision Tree Learning Algorithm and System
Comments: arXiv admin note: substantial text overlap with arXiv:2009.01431
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[46]  arXiv:2012.06373 (cross-list from cs.LG) [pdf, other]
Title: Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment
Comments: 6 pages, 2 figures, 1 table. Oral at the Beyond Backpropagation Workshop, NeurIPS 2020
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[47]  arXiv:2012.06959 (cross-list from cs.DC) [pdf, other]
Title: Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[48]  arXiv:2012.07242 (cross-list from cs.CR) [pdf, other]
Title: Neighbors From Hell: Voltage Attacks Against Deep Learning Accelerators on Multi-Tenant FPGAs
Comments: Published in the 2020 proceedings of the International Conference of Field-Programmable Technology (ICFPT)
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[49]  arXiv:2012.09646 (cross-list from cs.DC) [pdf, other]
Title: DAG-based Scheduling with Resource Sharing for Multi-task Applications in a Polyglot GPU Runtime
Comments: 10 pages, to be published in 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[50]  arXiv:2012.09918 (cross-list from cs.ET) [pdf, other]
Title: Sorting in Memristive Memory
Subjects: Emerging Technologies (cs.ET); Hardware Architecture (cs.AR)
[ total of 56 entries: 1-50 | 51-56 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2208, contact, help  (Access key information)