Skip to content

Conversation

heyujiao99
Copy link
Contributor

This patch implements MPI_OP using RISC-V Vector (RVV) extensions.

Performance results:

  • On older compilers (without RVV auto-vectorization):

    • RVV-optimized ops are 1.5-4x faster than C implementations.
  • With GCC 14+ (RVV auto-vectorization enabled):

    • 2-buff RVV ops perform similarly to auto-vectorized C code.
    • 3-buff RVV ops are still 1.5-4x faster.

Signed-off-by: heyujiao99 <he.yujiao@sanechips.com.cn>
@jsquyres
Copy link
Member

Per the conversation on #13324, has this code been validated to be correct?

Signed-off-by: heyujiao99 <he.yujiao@sanechips.com.cn>
@heyujiao99
Copy link
Contributor Author

Per the conversation on #13324, has this code been validated to be correct?

I added a MPI_OP test case to ci pipeline, add mpi_op test in CI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants