feat: Expand Eagle Speculator to Support Multiple Transformer Layer Types by fynnsu · Pull Request #49 · vllm-project/speculators

fynnsu · 2025-07-10T23:48:39Z

Closes #44

Expands support for other decoder layer types (MistralDecoderLayer, Qwen3DecoderLayer, etc.)

Tasks

Research and identify transformer layer classes within Hugging Face for each targeted architecture.

Update EagleSpeculatorConfig to include architecture type selection.

This is handled using the existing transformer_layer_architecture field to specify the decoder layer class.

The transformer_layer_config must also match the decoder layer type. i.e. to use LlamaDecoderLayer transformer_layer_config must be an instance of LlamaConfig, for MistralDecoderLayer it must be an instance of MistralConfig, etc.

Update EagleSpeculator to construct the selected transformer layer type correctly.

We find the corresponding decoder layer class (and also layer norm class), using the config class to determine the model type / import path. This generalizes the approach so that we can use any decoder layer and config combination in the transformers library.

Update or create relevant tests in:

tests/unit/models/test_eagle_config.py

tests/unit/models/test_eagle_model.py

Added explicit tests for the architectures listed in #44 (Llama, Mistral, Qwen, DeepSeek, Mistral, Gemma, Granite)

Ensure compatibility with SpeculatorModelConfig.from_pretrained and SpeculatorModel.from_pretrained.

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

…test_eagle_config.py Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

…test_eagle_model.py Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

…s model Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

github-actions · 2025-07-10T23:50:36Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/neuralmagic/speculators/actions/runs/16230733555/artifacts/3516837477.
They will be retained for up to 30 days.
Commit: d9871a5

markurtz

Left a few comments, take a look through and let me know what you think @fynnsu

src/speculators/models/eagle.py

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

fynnsu · 2025-07-11T22:20:09Z

Hi @markurtz, I implemented the changes requested. Let me know what you think

markurtz

Looks great!

…ypes (#49) Closes #44 Expands support for other decoder layer types (MistralDecoderLayer, Qwen3DecoderLayer, etc.) > # Tasks > * [x] Research and identify transformer layer classes within Hugging Face for each targeted architecture. > * [x] Update `EagleSpeculatorConfig` to include architecture type selection. This is handled using the existing `transformer_layer_architecture` field to specify the decoder layer class. The `transformer_layer_config` must also match the decoder layer type. i.e. to use `LlamaDecoderLayer` `transformer_layer_config` must be an instance of `LlamaConfig`, for `MistralDecoderLayer` it must be an instance of `MistralConfig`, etc. > * [x] Update `EagleSpeculator` to construct the selected transformer layer type correctly. We find the corresponding decoder layer class (and also layer norm class), using the config class to determine the model type / import path. This generalizes the approach so that we can use **any** decoder layer and config combination in the transformers library. > * [x] Update or create relevant tests in: > * `tests/unit/models/test_eagle_config.py` > * `tests/unit/models/test_eagle_model.py` Added explicit tests for the architectures listed in #44 (Llama, Mistral, Qwen, DeepSeek, Mistral, Gemma, Granite) > * [x] Ensure compatibility with `SpeculatorModelConfig.from_pretrained` and `SpeculatorModel.from_pretrained`. --------- Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

fynnsu added 6 commits July 10, 2025 19:05

Use AutoConfig to load eagle transformer layer config

6c6c85e

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Add tests for different decoder layer types (Mistral, Qwen, etc.) to …

8f75d16

…test_eagle_config.py Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Add tests for different decoder layer types (Mistral, Qwen, etc.) to …

ecccaa6

…test_eagle_model.py Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Add handling for tested DecoderLayers to eagle.py

cc084cd

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Generalize DecoderLayer + RMSNorm handling to support any transformer…

662361a

…s model Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

Clean up eagle unit tests

2bf597a

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

fynnsu requested a review from markurtz July 10, 2025 23:52

markurtz requested changes Jul 11, 2025

View reviewed changes

markurtz reviewed Jul 11, 2025

View reviewed changes

src/speculators/models/eagle.py Outdated Show resolved Hide resolved

Implement suggestions

d9871a5

Signed-off-by: Fynn Schmitt-Ulms <fschmitt@redhat.com>

fynnsu requested a review from markurtz July 11, 2025 22:19

markurtz approved these changes Jul 13, 2025

View reviewed changes

markurtz merged commit 5279aa6 into main Jul 13, 2025
10 checks passed

markurtz deleted the feat/other_decoder_layers branch July 13, 2025 22:16

markurtz mentioned this pull request Aug 13, 2025

Feature: eagle3 support for qwen2 and qwen2-vl #86

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Expand Eagle Speculator to Support Multiple Transformer Layer Types#49

feat: Expand Eagle Speculator to Support Multiple Transformer Layer Types#49
markurtz merged 7 commits intomainfrom
feat/other_decoder_layers

fynnsu commented Jul 10, 2025

github-actions bot commented Jul 10, 2025 •

edited

Loading

markurtz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fynnsu commented Jul 11, 2025

markurtz left a comment

Uh oh!

Labels

2 participants

Conversation

fynnsu commented Jul 10, 2025

Tasks

github-actions bot commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

markurtz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fynnsu commented Jul 11, 2025

markurtz left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

2 participants

github-actions bot commented Jul 10, 2025 •

edited

Loading