Near-DRAM Accelerated Matrix Multiplications

Aman Sinha*, Bo Cheng Lai

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

General matrix multiplication (GeMM) is a fundamental computing operation underpinning various applications in machine learning, data science, computer graphics, and scientific simulations. However, its performance on von Neumann computers, such as CPUs and GPUs, is bottlenecked due to challenges such as insufficient memory access bandwidth, low cache efficiency, hardware under-utilization, and massive power consumption. The performance of GPUs suffers drastically for complex sequential analysis often occurring along with GeMMs in various big data pipelines. Furthermore, GeMM-optimized Tensor cores available in modern Nvidia GPUs offer no extensibility to non-GeMM tasks. Various Near-Memory Computing (NMC) architectures have recently been explored to alleviate the data-intensive nature of analyses such as GeMM. This work evaluates the performance potential of GeMM using NMC through clusters of simple interconnected processing cores on a stacked DRAM platform. The organized design shows efficiency comparable to high-end Nvidia GPUs while consuming lower power and being highly extensible to various non-GeMM logically complex workloads.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 17th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, MCSoC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages245-248
Number of pages4
ISBN (Electronic)9798331530471
DOIs
StatePublished - 2024
Event17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, MCSoC 2024 - Kuala Lumpur, Malaysia
Duration: 16 Dec 202419 Dec 2024

Publication series

NameProceedings - 2024 IEEE 17th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, MCSoC 2024

Conference

Conference17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, MCSoC 2024
Country/TerritoryMalaysia
CityKuala Lumpur
Period16/12/2419/12/24

Keywords

  • DNNs
  • GeMM
  • Matrix Multiplication
  • Near-Memory Computing
  • RISC-V
  • Stacked Memory

Fingerprint

Dive into the research topics of 'Near-DRAM Accelerated Matrix Multiplications'. Together they form a unique fingerprint.

Cite this