Skip to content

Pull requests: pytorch/ao

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[mxfp8 moe training] refactor all var names with suffix _mx to _fp8 for clarity CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2879 opened Aug 26, 2025 by danielvegamyhre Loading…
fix ci import error CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2876 opened Aug 25, 2025 by liangel-02 Loading…
release notes script: keep not user facing rows CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2875 opened Aug 25, 2025 by vkuzo Loading…
TorchAOBaseTensor __tensor_flatten__ and __tensor_unflatten__ use… CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2874 opened Aug 25, 2025 by jerryzh168 Loading…
Move CPU kernels out of experimental CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2868 opened Aug 25, 2025 by metascroy Loading…
Split implements and implements_torch_function CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2866 opened Aug 24, 2025 by Krishn1412 Loading…
[moe fp8 training] fused reduction kernel along dim1 for 3d expert weights in backward CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2865 opened Aug 23, 2025 by danielvegamyhre Loading…
[moe fp8 training] use transpose method when quantizing to avoid uncoalesced gmem accesses CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2864 opened Aug 23, 2025 by danielvegamyhre Loading…
[moe fp8 training] test and bench new faster method for per group rowwise scaling CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2863 opened Aug 23, 2025 by danielvegamyhre Loading…
Update IntxUnpackedTensor to support dynamic activation CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2861 opened Aug 22, 2025 by metascroy Loading…
Port metadata from the linear node onto the reference custom op for int4 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. pt2e_quant pt2 export quantization (prepare_pt2e, convert_pt2e)
#2860 opened Aug 22, 2025 by kimishpatel Loading…
Enable quantizing local checkpoints in model release script CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2859 opened Aug 22, 2025 by jerryzh168 Loading…
[moe training] add test case for shared expert in distributed tests CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2856 opened Aug 22, 2025 by danielvegamyhre Loading…
use gcnArchName to get gpu_arch
#2853 opened Aug 22, 2025 by sidamd Loading…
[mxfp8 moe] add support for fbgemm 2d-3d mx8mx8bf16 grouped gemm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2848 opened Aug 22, 2025 by danielvegamyhre Loading…
Fix Llama4 example CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2846 opened Aug 22, 2025 by yiliu30 Loading…
Add Int4XPUTensorIntZP CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: new feature Use this tag if this PR adds a new feature
#2845 opened Aug 22, 2025 by liangan1 Loading…
test fsdp2 Moe CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2842 opened Aug 21, 2025 by weifengpy Loading…
[test only] testing adding optioanl tensor arg to float8 tensor CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2840 opened Aug 21, 2025 by jerryzh168 Loading…
Conditional ROCm kernel build ciflow/rocm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: rocm topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2839 opened Aug 21, 2025 by petrex Loading…
Add MyPy support CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2838 opened Aug 21, 2025 by khushi-411 Loading…
[wip] mx: expose a fast path for casting to fp4x2 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2832 opened Aug 21, 2025 by vkuzo Loading…
[fp8 blockwise] wrap triton quantization kernels in custom ops for torch.compile compatibility CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2829 opened Aug 21, 2025 by danielvegamyhre Loading…
[fp8 blockwise] load 2d chunks for groupwise quant to enable coalesced gmem accesses CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2827 opened Aug 21, 2025 by danielvegamyhre Loading…
use shared bench + profile utils in blockwise fwd bwd bench script CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#2826 opened Aug 21, 2025 by danielvegamyhre Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.