We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for cs.AR in Oct 2020

[ total of 56 entries: 1-50 | 51-56 ]
[ showing 50 entries per page: fewer | more | all ]
[1]  arXiv:2010.00627 [pdf]
Title: CARLA: A Convolution Accelerator with a Reconfigurable and Low-Energy Architecture
Comments: 12 pages
Subjects: Hardware Architecture (cs.AR)
[2]  arXiv:2010.02017 [pdf, other]
Title: Synchronizer-Free Digital Link Controller
Comments: 12 page journal article
Journal-ref: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I (REGULAR PAPERS), VOLUME 67, NUMBER 10, OCTOBER 2020
Subjects: Hardware Architecture (cs.AR)
[3]  arXiv:2010.02079 [pdf, other]
Title: NATSA: A Near-Data Processing Accelerator for Time Series Analysis
Comments: To appear in the 38th IEEE International Conference on Computer Design (ICCD 2020)
Subjects: Hardware Architecture (cs.AR)
[4]  arXiv:2010.02825 [pdf, other]
Title: WoLFRaM: Enhancing Wear-Leveling and Fault Tolerance in Resistive Memories using Programmable Address Decoders
Comments: To appear in ICCD 2020
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET)
[5]  arXiv:2010.03397 [pdf, other]
Title: A Hardware-Aware Heuristic for the Qubit Mapping Problem in the NISQ Era
Authors: Siyuan Niu (LIRMM), Adrien Suau (LIRMM, CERFACS), Gabriel Staffelbach (CERFACS), Aida Todri-Sanial (LIRMM, CNRS)
Comments: IEEE Transactions on Quantum Engineering, 2020
Subjects: Hardware Architecture (cs.AR); Quantum Physics (quant-ph)
[6]  arXiv:2010.04073 [pdf, other]
Title: A Mixed-Precision RISC-V Processor for Extreme-Edge DNN Inference
Comments: 6 pages, 6 figures, 2 tables, conference
Subjects: Hardware Architecture (cs.AR)
[7]  arXiv:2010.04566 [pdf, other]
Title: An Energy-Efficient Low-Voltage Swing Transceiver for mW-Range IoT End-Nodes
Comments: ISCAS2020
Subjects: Hardware Architecture (cs.AR)
[8]  arXiv:2010.05037 [pdf, other]
Title: Cross-Stack Workload Characterization of Deep Recommendation Systems
Comments: Published in 2020 IEEE International Symposium on Workload Characterization (IISWC)
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Information Retrieval (cs.IR)
[9]  arXiv:2010.05197 [pdf, other]
Title: TaxoNN: A Light-Weight Accelerator for Deep Neural Network Training
Comments: Accepted to ISCAS 2020. 5 pages, 5 figures
Journal-ref: 2020 IEEE International Symposium on Circuits and Systems (ISCAS), 2020, pp. 1-5
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[10]  arXiv:2010.05894 [pdf, other]
Title: MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions
Comments: Accepted by MLSys'21 (the 4th Conference on Machine Learning and Systems)
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[11]  arXiv:2010.06075 [pdf, other]
Title: When HLS Meets FPGA HBM: Benchmarking and Bandwidth Optimization
Subjects: Hardware Architecture (cs.AR)
[12]  arXiv:2010.06156 [pdf, other]
Title: High Area/Energy Efficiency RRAM CNN Accelerator with Kernel-Reordering Weight Mapping Scheme Based on Pattern Pruning
Comments: 6 pages, 7 figures
Subjects: Hardware Architecture (cs.AR)
[13]  arXiv:2010.06277 [pdf, other]
[14]  arXiv:2010.07185 [pdf, other]
Title: Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices
Comments: GLSVLSI, September 7-9, 2020
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[15]  arXiv:2010.08065 [pdf, other]
Title: FPRaker: A Processing Element For Accelerating Neural Network Training
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI)
[16]  arXiv:2010.08667 [pdf, other]
Title: Combinatorics and Geometry for the Many-ported, Distributed and Shared Memory Architecture
Subjects: Hardware Architecture (cs.AR)
[17]  arXiv:2010.09330 [pdf, other]
Title: Enabling High-Capacity, Latency-Tolerant, and Highly-Concurrent GPU Register Files via Software/Hardware Cooperation
Comments: To Appear in ACM Transactions on Computer Systems (TOCS)
Subjects: Hardware Architecture (cs.AR)
[18]  arXiv:2010.09457 [pdf, other]
Title: Closed-Loop Neural Interfaces with Embedded Machine Learning
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[19]  arXiv:2010.10119 [pdf, other]
Title: A RISC-V SystemC-TLM simulator
Authors: Màrius Montón
Comments: 4 pages. Presented at CARRV 2020
Subjects: Hardware Architecture (cs.AR)
[20]  arXiv:2010.10233 [pdf, other]
Title: Eliminating the Barriers: Demystifying Wi-Fi Baseband Design and Introducing the PicoScenes Wi-Fi Sensing Platform
Subjects: Hardware Architecture (cs.AR)
[21]  arXiv:2010.10683 [pdf, other]
Title: Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability
Journal-ref: Proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'18), 2018
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
[22]  arXiv:2010.12114 [pdf, other]
Title: The nanoPU: Redesigning the CPU-Network Interface to Minimize RPC Tail Latency
Comments: 10 pages
Subjects: Hardware Architecture (cs.AR); Networking and Internet Architecture (cs.NI)
[23]  arXiv:2010.12376 [pdf, other]
Title: Efficient Floating-Point Givens Rotation Unit
Comments: 25 pages, 11 figures, this is a pre-print version of an article that has been accepted for publication in the journal Circuits, Systems, and Signal Processing
Subjects: Hardware Architecture (cs.AR)
[24]  arXiv:2010.12861 [pdf]
Title: MARS: Multi-macro Architecture SRAM CIM-Based Accelerator with Co-designed Compressed Neural Networks
Comments: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2021
Subjects: Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[25]  arXiv:2010.12869 [pdf, other]
Title: ExPAN(N)D: Exploring Posits for Efficient Artificial Neural Network Design in FPGA-based Systems
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Performance (cs.PF)
[26]  arXiv:2010.13100 [pdf, other]
Title: Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Information Retrieval (cs.IR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[27]  arXiv:2010.14145 [pdf, other]
Title: hXDP: Efficient Software Packet Processing on FPGA NICs
Comments: Accepted at USENIX OSDI'20
Subjects: Hardware Architecture (cs.AR)
[28]  arXiv:2010.14934 [pdf, other]
Title: Analysis of Energy Consumption in a Precision Beekeeping System
Authors: Hugo Hadjur (AVALON), Doreid Ammar, Laurent Lefèvre (AVALON)
Comments: IoT '20: 10th International Conference on the Internet of Things, Oct 2020, Malm{\"o}, Sweden
Subjects: Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC)
[29]  arXiv:2010.16171 [pdf, other]
Title: RVCoreP-32IM: An effective architecture to implement mul/div instructions for five stage RISC-V soft processors
Subjects: Hardware Architecture (cs.AR)
[30]  arXiv:2010.00289 (cross-list from cs.DC) [pdf, other]
Title: Weighing up the new kid on the block: Impressions of using Vitis for HPC software development
Authors: Nick Brown
Comments: Pre-print of Weighing up the new kid on the block: Impressions of using Vitis for HPC software development, paper in 30th International Conference on Field Programmable Logic and Applications
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[31]  arXiv:2010.02075 (cross-list from cs.LG) [pdf, other]
Title: Learned Hardware/Software Co-Design of Neural Accelerators
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (stat.ML)
[32]  arXiv:2010.04017 (cross-list from cs.LG) [pdf, other]
Title: DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Journal-ref: MICRO 2020
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Programming Languages (cs.PL)
[33]  arXiv:2010.04212 (cross-list from cs.PF) [pdf, other]
Title: Machine Learning Enabled Scalable Performance Prediction of Scientific Codes
Comments: Under review at ACM TOMACS 2020 for a special issue
Subjects: Performance (cs.PF); Hardware Architecture (cs.AR)
[34]  arXiv:2010.04633 (cross-list from cs.PL) [pdf, other]
Title: C for a tiny system
Subjects: Programming Languages (cs.PL); Hardware Architecture (cs.AR)
[35]  arXiv:2010.08262 (cross-list from cs.NE) [pdf, other]
Title: Local plasticity rules can learn deep representations using self-supervised contrastive predictions
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[36]  arXiv:2010.08412 (cross-list from cs.CL) [pdf, other]
Title: Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications
Comments: To appear at the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP '20), November 16-20, 2020, NMT, AI accelerators, co-design, TPU, OPU, 10 pages, 3 figures, 4 tables
Subjects: Computation and Language (cs.CL); Hardware Architecture (cs.AR)
[37]  arXiv:2010.08440 (cross-list from cs.CR) [pdf, other]
Title: Elasticlave: An Efficient Memory Model for Enclaves
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[38]  arXiv:2010.08916 (cross-list from cs.DC) [pdf, ps, other]
Title: Optimizing Memory Performance of Xilinx FPGAs under Vitis
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
[39]  arXiv:2010.09308 (cross-list from cs.RO) [pdf, other]
Title: NimbRo-OP2X: Affordable Adult-sized 3D-printed Open-Source Humanoid Robot for Research
Subjects: Robotics (cs.RO); Hardware Architecture (cs.AR)
[40]  arXiv:2010.09852 (cross-list from cs.DC) [pdf, other]
Title: Evaluating the Cost of Atomic Operations on Modern Architectures
Journal-ref: Proceedings of the 24th International Conference on Parallel Architectures and Compilation (PACT'15), 2015
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Data Structures and Algorithms (cs.DS); Performance (cs.PF)
[41]  arXiv:2010.10370 (cross-list from eess.SY) [pdf, other]
Title: Monitoring Large Crowds With WiFi: A Privacy-Preserving Approach
Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[42]  arXiv:2010.10416 (cross-list from cs.CR) [pdf, other]
Title: Composite Enclaves: Towards Disaggregated Trusted Execution
Journal-ref: IACR Transactions on Cryptographic Hardware and Embedded Systems, 2022 (1)
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[43]  arXiv:2010.10562 (cross-list from cs.NI) [pdf, other]
Title: NV-Fogstore : Device-aware hybrid caching in fog computing environments
Subjects: Networking and Internet Architecture (cs.NI); Hardware Architecture (cs.AR)
[44]  arXiv:2010.11686 (cross-list from cs.LG) [pdf]
Title: A Very Compact Embedded CNN Processor Design Based on Logarithmic Computing
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2010.13103 (cross-list from cs.DC) [pdf, other]
Title: LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[46]  arXiv:2010.13155 (cross-list from cs.CR) [pdf, other]
Title: Security Assessment of Interposer-based Chiplet Integration
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)
[47]  arXiv:2010.13216 (cross-list from cs.DC) [pdf, other]
Title: Performance Analysis of Scientific Computing Workloads on Trusted Execution Environments
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Cryptography and Security (cs.CR)
[48]  arXiv:2010.13311 (cross-list from cs.NE) [pdf]
Title: RNNAccel: A Fusion Recurrent Neural Network Accelerator for Edge Intelligence
Comments: This is a paper summited in vlsicad2020 conference in Taiwan. For more information about RNNAccel, see this https URL
Subjects: Neural and Evolutionary Computing (cs.NE); Hardware Architecture (cs.AR)
[49]  arXiv:2010.13619 (cross-list from cs.DB) [pdf, other]
Title: Exploring Memory Access Patterns for Graph Processing Accelerators
Subjects: Databases (cs.DB); Hardware Architecture (cs.AR)
[50]  arXiv:2010.14246 (cross-list from cs.LG) [pdf, other]
Title: $μ$NAS: Constrained Neural Architecture Search for Microcontrollers
Comments: $\mu$NAS is available at this https URL
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[ total of 56 entries: 1-50 | 51-56 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2205, contact, help  (Access key information)