[Tests][Eagle3] Extend vLLM test cases with conversion step by dsikka · Pull Request #93 · vllm-project/speculators

dsikka · 2025-08-25T17:38:43Z

Summary

Updates vLLM token assertion
Updates how vllm is detected, if installed
Extends the vLLM test flow to add an additional test with conversion
Converts a speculators model and a model from the Eagle3 repo
Runs the converted models in vllm and asserts for valid tokens

github-actions · 2025-08-25T17:41:09Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/17244541644/artifacts/3855588296.
They will be retained for up to 30 days.
Commit: d99db49

shanjiaz

🎉

markurtz

One major issue that I left as a comment on the code, otherwise there are several minor cleanups that we could get in for this or for a later PR:

The sampling params use temperature and top_p which causes extra, unneeded variability in the tests
The test is asserting that the number of returned tokens has to be 20. Should be updated to >1 and <=20 or we should null out the stop tokens so it's forced to go to 20
There's no cleanup of the vLLM resources (del llm, gc.collect, empty_cache, etc). A parameterized fixture with a yield would ensure both setup and teardown always run
Currently only checking for vLLM imports are available, we likely would want to put an extra case on there for torch.cuda / GPU availability so we don't waist resources
We could require in the dev environments hf_transfer and set HF_HUB_ENABLE_HF_TRANSFER for faster downloads
_run_vllm_engine runs as a helper, so any failures within there are not going to surface properly with pytest
Enabling a test config/settings longer term would be good so we can pull down things like gpu_memory_utilization and have a central place to update them for workers running tests
Base comparisons / interface comparisons for what the output from conversion should look like before running in vLLM

tests/e2e/vllm/test_eagle3.py

markurtz

Missed one of the pytest skips in the original review, approving with a note towards the potential future improvements we should get onto the backlog

update

14a4357

dsikka added 2 commits August 25, 2025 17:51

clean-up

661b13d

update assertion

7d18847

dsikka marked this pull request as ready for review August 25, 2025 18:34

dsikka requested review from markurtz and rahul-tuli August 25, 2025 18:34

dsikka added 2 commits August 25, 2025 18:39

fix conversion

b5ce91a

update

af59dfc

dsikka changed the title ~~[Tests] Extend vLLM test cases with conversion step~~ Aug 25, 2025

rahul-tuli approved these changes Aug 26, 2025

View reviewed changes

Merge branch 'main' into extend_vllm_test

d99db49

shanjiaz approved these changes Aug 27, 2025

View reviewed changes

markurtz requested changes Aug 27, 2025

View reviewed changes

tests/e2e/vllm/test_eagle3.py Show resolved Hide resolved

markurtz approved these changes Aug 27, 2025

View reviewed changes

dsikka merged commit b5030c5 into main Aug 27, 2025
12 checks passed

dsikka deleted the extend_vllm_test branch August 27, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tests][Eagle3] Extend vLLM test cases with conversion step#93

[Tests][Eagle3] Extend vLLM test cases with conversion step#93
dsikka merged 6 commits intomainfrom
extend_vllm_test

dsikka commented Aug 25, 2025 •

edited

Loading

github-actions bot commented Aug 25, 2025 •

edited

Loading

shanjiaz left a comment

markurtz left a comment •

edited

Loading

Uh oh!

markurtz left a comment

Uh oh!

Labels

4 participants

Conversation

dsikka commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

github-actions bot commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

shanjiaz left a comment

Choose a reason for hiding this comment

markurtz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markurtz left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

4 participants

dsikka commented Aug 25, 2025 •

edited

Loading

github-actions bot commented Aug 25, 2025 •

edited

Loading

markurtz left a comment •

edited

Loading