Skip to content

Conversation

tengomucho
Copy link
Collaborator

What does this PR do?

This bumps the optimum-neuron version to 0.3.0, moving completly to NxD for inference.

tengomucho and others added 4 commits July 21, 2025 12:51
Dependencies were changed accordingly, because Neuron SDK was updated to
v2.24.
Also modify the temperature in decode test to avoid granite early
stopping.
@tengomucho tengomucho requested review from Narsil and dacorvo August 4, 2025 15:29
Copy link
Collaborator

@dacorvo dacorvo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks !

@tengomucho tengomucho merged commit 8801ba1 into main Aug 26, 2025
26 of 33 checks passed
@tengomucho tengomucho deleted the optimum-neuron-0.3.0 branch August 26, 2025 09:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants