We gratefully acknowledge support from
the Simons Foundation and member institutions.

Hardware Architecture

Authors and titles for recent submissions, skipping first 11

[ total of 27 entries: 1-25 | 12-27 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 24 Apr 2024 (continued, showing last 1 of 7 entries)

[12]  arXiv:2404.14754 (cross-list from cs.LG) [pdf, other]
Title: Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning
Comments: Accepted at Great Lakes Symposium on VLSI 2024 (GLSVLSI 24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)

Tue, 23 Apr 2024

[13]  arXiv:2404.14069 [pdf, ps, other]
Title: On the Systematic Creation of Faithfully Rounded Commutative Truncated Booth Multipliers
Subjects: Hardware Architecture (cs.AR)
[14]  arXiv:2404.14010 [pdf, other]
Title: A Stochastic Rounding-Enabled Low-Precision Floating-Point MAC for DNN Training
Authors: Sami Ben Ali (TARAN), Silviu-Ioan Filip (TARAN), Olivier Sentieys (TARAN)
Journal-ref: DATE 2024 - 27th IEEE/ACM Design, Automation and Test in Europe, Mar 2024, Valencia, Spain. pp.1-6
Subjects: Hardware Architecture (cs.AR)
[15]  arXiv:2404.13062 [pdf, other]
Title: EasyACIM: An End-to-End Automated Analog CIM with Synthesizable Architecture and Agile Design Space Exploration
Subjects: Hardware Architecture (cs.AR); Neural and Evolutionary Computing (cs.NE)
[16]  arXiv:2404.13061 [pdf, other]
Title: FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Comments: accepted by ISEDA2024
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[17]  arXiv:2404.13049 [pdf, ps, other]
Title: DG-RePlAce: A Dataflow-Driven GPU-Accelerated Analytical Global Placement Framework for Machine Learning Accelerators
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[18]  arXiv:2404.14279 (cross-list from cs.CV) [pdf, other]
Title: Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNN
Comments: Accepted to CVPR 2024 workshop, AIS: Vision, Graphics, and AI for Streaming
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[19]  arXiv:2404.14110 (cross-list from eess.SY) [pdf, other]
Title: HomeLabGym: A real-world testbed for home energy management systems
Comments: 3 pages, 2 figures, conference
Subjects: Systems and Control (eess.SY); Hardware Architecture (cs.AR)
[20]  arXiv:2404.13477 (cross-list from cs.CR) [pdf, other]
Title: Leveraging Adversarial Detection to Enable Scalable and Low Overhead RowHammer Mitigations
Subjects: Cryptography and Security (cs.CR); Hardware Architecture (cs.AR)

Mon, 22 Apr 2024

[21]  arXiv:2404.12503 [pdf, other]
Title: STRELA: STReaming ELAstic CGRA Accelerator for Embedded Systems
Authors: Daniel Vazquez (1), Jose Miranda (2), Alfonso Rodriguez (1), Andres Otero (1), Pascuale Davide Schiavone (2), David Atienza (2) ((1) Centro de Electronica Industrial, Universidad Politecnica de Madrid (UPM), (2) Embedded Systems Laboratory, Ecole Polytechnique Federale de Lausanne (EPFL))
Comments: 14 pages, 11 figures
Subjects: Hardware Architecture (cs.AR)

Fri, 19 Apr 2024

[22]  arXiv:2404.12336 [pdf, other]
Title: Combining Power and Arithmetic Optimization via Datapath Rewriting
Subjects: Hardware Architecture (cs.AR)
[23]  arXiv:2404.12306 [pdf, ps, other]
Title: Switchable Single/Dual Edge Registers for Pipeline Architecture
Subjects: Hardware Architecture (cs.AR)
[24]  arXiv:2404.11887 [pdf, other]
Title: EN-TensorCore: Advancing TensorCores Performance through Encoder-Based Methodology
Comments: 7 pages, 6 figures
Subjects: Hardware Architecture (cs.AR)
[25]  arXiv:2404.11852 [pdf, other]
Title: Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations
Subjects: Hardware Architecture (cs.AR); Graphics (cs.GR)
[26]  arXiv:2404.11788 [pdf, other]
Title: NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
[27]  arXiv:2404.11721 [pdf, other]
Title: Functionality Locality, Mixture & Control = Logic = Memory
Authors: Xiangjun Peng
Subjects: Hardware Architecture (cs.AR)
[ total of 27 entries: 1-25 | 12-27 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)