Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target #5427

yuanjia111 · 2025-08-22T09:06:38Z

1. The following changes were made：
(1) Adjust LMUL=2 to LMUL=8；
(2) The optimization is for the scenario of inc_x=1, mainly to increase the parallel processing of n-directional data；
(3) Adjusted code format.

2. All BLAS tests passed:

3.The performance verified on K1 [C908, vlen = 256].
（1）Using the built-in benchmark for testing, the optimized performance data is as follows：

（2）The complete performance comparison data before and after optimization is as follows：

ChipKerchner · 2025-08-25T13:09:12Z

Great job!

riscv64: optimize gemv_t_vector.c

c2cc7a3

yuanjia111 changed the title ~~Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B targets~~ Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target Aug 22, 2025

martin-frbg added this to the 0.3.31 milestone Aug 24, 2025

martin-frbg merged commit da7d0f4 into OpenMathLib:develop Aug 25, 2025
91 of 95 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target #5427

Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target #5427

yuanjia111 commented Aug 22, 2025

Uh oh!

ChipKerchner commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!

Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target #5427

Optimize the gemv_t_vector.c kernel for RISCV64_ZVL256B target #5427

Conversation

yuanjia111 commented Aug 22, 2025

Uh oh!

ChipKerchner commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!