add scattermoe kernel for fast MoE training #40365

mayank31398 · 2025-08-22T00:02:30Z

No description provided.

Rocketknight1 · 2025-08-22T12:15:47Z

ArthurZucker

Hello @mayank31398 ! Nice pr, happy to add something like that, do you mind using kernels like what we do for GPT_OSS?!
This way we keep a slow path, compatible with all torch, all hardwares etc and don't have code changes for the core modeling, and just have the kernel on the hub!

WDYT? 🤗

mayank31398 · 2025-08-22T16:36:33Z

@ArthurZucker scattermoe doesnt support bias for now, I will add this soon!
meanwhile supporting every model is hard since some models have expert weights as a moduleList instead of a 3D tensor :/

github-actions · 2025-08-22T20:42:53Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: granitemoe

shawntan · 2025-08-23T01:37:16Z

Hello @mayank31398 ! Nice pr, happy to add something like that, do you mind using kernels like what we do for GPT_OSS?! This way we keep a slow path, compatible with all torch, all hardwares etc and don't have code changes for the core modeling, and just have the kernel on the hub!

WDYT? 🤗

Is there an existing triton kernel you could point to that I could follow?

mayank31398 added 3 commits August 21, 2025 16:17

add scattermoe kernel

dd94386

add scattermoe kernel to granitemoe

013458c

refactor

83f2722

ArthurZucker reviewed Aug 22, 2025

View reviewed changes

ArthurZucker added the kernels label Aug 22, 2025

mayank31398 added 2 commits August 22, 2025 13:41

refactor

e558968

refactor

d8170a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add scattermoe kernel for fast MoE training #40365

add scattermoe kernel for fast MoE training #40365

mayank31398 commented Aug 22, 2025

Uh oh!

Rocketknight1 commented Aug 22, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

mayank31398 commented Aug 22, 2025

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

shawntan commented Aug 23, 2025

Uh oh!

Uh oh!

add scattermoe kernel for fast MoE training #40365

Are you sure you want to change the base?

add scattermoe kernel for fast MoE training #40365

Conversation

mayank31398 commented Aug 22, 2025

Uh oh!

Rocketknight1 commented Aug 22, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

mayank31398 commented Aug 22, 2025

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

shawntan commented Aug 23, 2025

Uh oh!

Uh oh!