main add ascend scheduler support multimodal #2844

fan2956 · 2025-09-09T15:09:35Z

What this PR does / why we need it?

On main, AscendScheduler does not support Multimodels, becuse of lacking of scheduled_encoder_inputs which is need on multimodels inference

Does this PR introduce any user-facing change?

No

How was this patch tested?

vLLM version: main@93e28e6862669e3b5cf47cea9f782a65ec47e155

vLLM version: v0.10.2rc2
vLLM main: vllm-project/vllm@15b8fef

gemini-code-assist

Code Review

This pull request adds support for multimodal models in the AscendScheduler by implementing the scheduling of encoder inputs. The changes correctly remove the previous restriction and add the necessary logic for handling encoder inputs in both prefill and decode paths. However, I've identified a significant code duplication for the encoder cache allocation logic, which is present in both the prefill and decode scheduling loops. I've left a comment suggesting to refactor this duplicated code into a helper method to improve maintainability.

vllm_ascend/core/scheduler.py

github-actions · 2025-09-09T15:15:13Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2025-09-10T00:50:48Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

wangxiyuan · 2025-09-10T10:04:50Z

please rebase to main to fix the merge conflict

fan2956 · 2025-09-10T10:54:16Z

fix the merge conflict

codecov · 2025-09-12T07:07:02Z

Codecov Report

❌ Patch coverage is 52.00000% with 12 lines in your changes missing coverage. Please review.
✅ Project coverage is 75.02%. Comparing base (1bbb20e) to head (d90bde6).
⚠️ Report is 28 commits behind head on main.

Files with missing lines	Patch %	Lines
vllm_ascend/core/scheduler.py	45.45%	12 Missing ⚠️

❌ Your patch check has failed because the patch coverage (52.00%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2844      +/-   ##
==========================================
+ Coverage   74.76%   75.02%   +0.26%     
==========================================
  Files         150      154       +4     
  Lines       20891    21297     +406     
==========================================
+ Hits        15620    15979     +359     
- Misses       5271     5318      +47

Flag	Coverage Δ
unittests	`75.02% <52.00%> (+0.26%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-09-12T15:21:21Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: fan2956 <zhoufan53@huawei.com>

fan2956 changed the title ~~add ascend scheduler support multimodel~~ main add ascend scheduler support multimodel Sep 9, 2025

gemini-code-assist bot reviewed Sep 9, 2025

View reviewed changes

vllm_ascend/core/scheduler.py Show resolved Hide resolved

vllm_ascend/core/scheduler.py Show resolved Hide resolved

github-actions bot added the merge-conflicts label Sep 10, 2025

fan2956 changed the title ~~main add ascend scheduler support multimodel~~ main add ascend scheduler support multimodal Sep 10, 2025

fan2956 force-pushed the my_main branch from 23861d1 to 52c11d5 Compare September 10, 2025 09:54

github-actions bot added module:tests and removed merge-conflicts labels Sep 10, 2025

fan2956 force-pushed the my_main branch from 5208843 to 0e9b7a0 Compare September 10, 2025 10:50

fan2956 force-pushed the my_main branch from 098bc25 to d90bde6 Compare September 12, 2025 07:18

github-actions bot added the merge-conflicts label Sep 12, 2025

zhoufan2956 and others added 9 commits September 13, 2025 17:16

add ascend scheduler support multimodel

77994f4

Signed-off-by: fan2956 <zhoufan53@huawei.com>

add scheduler config UT

2562b39

Signed-off-by: fan2956 <zhoufan53@huawei.com>

fix lint

d748488

Signed-off-by: fan2956 <zhoufan53@huawei.com>

fix lint

e374b6d

Signed-off-by: fan2956 <zhoufan53@huawei.com>

add ascend_scheduler_ut

4fe40ca

Signed-off-by: fan2956 <zhoufan53@huawei.com>

add ascend_scheduler_ut

12bcc97

Signed-off-by: fan2956 <zhoufan53@huawei.com>

add ascend_scheduler_ut

50ca079

Signed-off-by: fan2956 <zhoufan53@huawei.com>

add ascend_scheduler_ut

88b887d

Signed-off-by: fan2956 <zhoufan53@huawei.com>

fix UT

4097334

Signed-off-by: fan2956 <zhoufan53@huawei.com>

fan2956 force-pushed the my_main branch from 2d98935 to 4097334 Compare September 13, 2025 09:19

github-actions bot removed the merge-conflicts label Sep 13, 2025

wangxiyuan approved these changes Sep 13, 2025

View reviewed changes

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Sep 13, 2025

wangxiyuan merged commit c5a502f into vllm-project:main Sep 14, 2025
44 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

main add ascend scheduler support multimodal #2844

main add ascend scheduler support multimodal #2844

Uh oh!

fan2956 commented Sep 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

wangxiyuan commented Sep 10, 2025

Uh oh!

fan2956 commented Sep 10, 2025

Uh oh!

codecov bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

main add ascend scheduler support multimodal #2844

main add ascend scheduler support multimodal #2844

Uh oh!

Conversation

fan2956 commented Sep 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

wangxiyuan commented Sep 10, 2025

Uh oh!

fan2956 commented Sep 10, 2025

Uh oh!

codecov bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Sep 12, 2025

Uh oh!

Uh oh!

Uh oh!

fan2956 commented Sep 9, 2025 •

edited by github-actions bot

Loading

codecov bot commented Sep 12, 2025 •

edited

Loading