Abstract: Numerous studies have proposed hardware architectures to accelerate sparse matrix multiplication, but these approaches often incur substantial area and power overhead, significantly ...
In industrial recommendation systems, the shift toward Generative Retrieval (GR) is replacing traditional embedding-based nearest neighbor search with Large Language Models (LLMs). These models ...
How Effective is Matrix Reordering for Improving Performance of Sparse Matrix-Vector Multiplication?
Abstract: This work evaluates the impact of matrix reordering on the performance of sparse matrix-vector multiplication across different multicore CPU platforms. Reordering can enhance performance by ...
Kolmogorov-Arnold Networks (KANs) training and inference are accelerated by orders of magnitude through exploiting the structure of the uniform linear (C⁰) B-spline (see Fig. 1). Because the intervals ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results