Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Joshua-Chin · 2025-08-21T22:30:47Z

What does this PR do?

This PR adds a tokenizer_kwargs argument to the TextGenerationPipeline, allowing users to pass arbitrary arguments to the tokenizer during preprocessing. In particular, this lets users set chat template arguments, such as the enable_thinking flag for Qwen3 or SmolLM3.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Joshua-Chin · 2025-08-21T22:49:11Z

The test failure seems to be an unrelated flake:

self = <test_accelerate_examples.ExamplesTestsNoTrainer testMethod=test_run_swag_no_trainer>

    @mock.patch.dict(os.environ, {"WANDB_MODE": "offline", "DVCLIVE_TEST": "true"})
    def test_run_swag_no_trainer(self):
        tmp_dir = self.get_auto_remove_tmp_dir()
        testargs = f"""
            {self.examples_dir}/pytorch/multiple-choice/run_swag_no_trainer.py
            --model_name_or_path google-bert/bert-base-uncased
            --train_file tests/fixtures/tests_samples/swag/sample.json
            --validation_file tests/fixtures/tests_samples/swag/sample.json
            --output_dir {tmp_dir}
            --max_train_steps=20
            --num_warmup_steps=2
            --learning_rate=2e-4
            --per_device_train_batch_size=2
            --per_device_eval_batch_size=1
            --with_tracking
        """.split()
    
        run_command(self._launch_args + testargs)
        result = get_results(tmp_dir)
>       self.assertGreaterEqual(result["eval_accuracy"], 0.8)
E       AssertionError: 0.4 not greater than or equal to 0.8

examples/pytorch/test_accelerate_examples.py:225: AssertionError

Pushing an empty commit to re-run the CI.

Joshua-Chin · 2025-08-21T22:59:14Z

A disjoint set of tests have failed in the re-run.

Joshua-Chin · 2025-08-21T23:07:40Z

@Rocketknight1 Please review this PR when you have a chance. The CI failures seem to be caused by unrelated, flaky tests.

Rocketknight1

This LGTM! cc @gante just in case you have opinions about the max_length generate kwarg clash.

Rocketknight1 · 2025-08-22T13:26:03Z

Also @Joshua-Chin you may need to rebase to fix some conflicts before we can merge the PR! That should also clear up the CI issues.

gante

One question about variable names, otherwise lgtm :)

gante · 2025-08-22T15:10:30Z

src/transformers/pipelines/text_generation.py

@@ -285,6 +289,9 @@ def __call__(self, text_inputs, **kwargs):
                - `None` : default strategy where nothing in particular happens
                - `"hole"`: Truncates left of input, and leaves a gap wide enough to let generation happen (might
                  truncate a lot of the prompt and not suitable when generation exceed the model capacity)
+            tokenizer_kwargs (`dict`, *optional*):


perhaps tokenizer_encode_kwargs? There are also kwargs used at decode time, and we don't want to mix the two

cc @Rocketknight1

@gante I updated the argument to tokenizer_encode_kwargs. Please take another look when you have a chance.

Joshua-Chin · 2025-08-22T18:01:42Z

The CI is currently failing because of the following test, added by a recently merged change (HunYuan opensource #39606):

FAILED tests/models/hunyuan_v1_moe/test_modeling_hunyuan_v1_moe.py::HunYuanMoEV1ModelTest::test_generate_compile_model_forward_fullgraph - torch._dynamo.exc.Unsupported: Dynamic shape operator

…ation pipeline

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from 92b4d49 to 2fe0979 Compare August 22, 2025 01:03

Rocketknight1 approved these changes Aug 22, 2025

View reviewed changes

gante reviewed Aug 22, 2025

View reviewed changes

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from 2fe0979 to e484dbb Compare August 22, 2025 17:26

Joshua-Chin added 5 commits August 22, 2025 13:46

Add tokenizer_kwargs arg to text generation pipeline.

f73dfbf

chore: re-run CI

1812580

Rename tokenizer_kwargs to tokenizer_encode_kwargs for text gener…

7c10223

…ation pipeline

Fix tokenizer_encode_kwargs doc string.

2e34d28

Fix note related to tokenizer _kwargs in text generation pipeline

f1d1dc1

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from d80a814 to f1d1dc1 Compare August 22, 2025 20:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Rocketknight1 left a comment

Uh oh!

Rocketknight1 commented Aug 22, 2025

Uh oh!

gante left a comment

Uh oh!

gante Aug 22, 2025 •

edited

Loading

Uh oh!

Joshua-Chin Aug 22, 2025

Uh oh!

Joshua-Chin commented Aug 22, 2025

Uh oh!

Uh oh!

Add tokenizer_kwargs argument to the text generation pipeline #40364

Are you sure you want to change the base?

Add tokenizer_kwargs argument to the text generation pipeline #40364

Conversation

Joshua-Chin commented Aug 21, 2025

What does this PR do?

Before submitting

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Aug 22, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Joshua-Chin Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Joshua-Chin commented Aug 22, 2025

Uh oh!

Uh oh!

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

gante Aug 22, 2025 •

edited

Loading