Skip to content

[WIP] phase 1 of peagle support#343

Draft
shanjiaz wants to merge 10 commits intomainfrom
add-peagle-support
Draft

[WIP] phase 1 of peagle support#343
shanjiaz wants to merge 10 commits intomainfrom
add-peagle-support

Conversation

@shanjiaz
Copy link
Copy Markdown
Collaborator

@shanjiaz shanjiaz commented Mar 13, 2026

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

P-EAGLE (Parallel EAGLE) is a parallel speculative decoding method that extends EAGLE-3 with multi-token prediction, potentially offering 2-3x speedups over sequential EAGLE-3 drafting. Adding this feature would enable us to train a P-EAGLE speculator model.

See RFC 292

Description

  • Added peagle model & config definition with peagle specific features like learnable hidden_mask
  • COD sampling for geometric decar across depth
  • Use flex attention for peagle attention mask construction
  • training & trainer logging support

Related Issue

See RFC 292

Tests

Still have to verify acceptance length for qwen 30b model.

I have filled in:

  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan/results, such as providing test command and pasting the results.
  • (Optional) The necessary documentation update.
  • I (a human) have written or reviewed the code in this pr to the best of my ability.
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
@mergify
Copy link
Copy Markdown

mergify bot commented Mar 13, 2026

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @shanjiaz.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify bot added the needs-rebase label Mar 13, 2026
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
@shanjiaz shanjiaz changed the title phase 1 of peagle support Mar 13, 2026
@mergify mergify bot removed the needs-rebase label Mar 13, 2026
@github-actions
Copy link
Copy Markdown

github-actions bot commented Mar 13, 2026

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/23116052974/artifacts/5933194505.
They will be retained for up to 30 days.
Commit: 99cc20e

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>
@mergify
Copy link
Copy Markdown

mergify bot commented Mar 17, 2026

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @shanjiaz.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify bot added the needs-rebase label Mar 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

1 participant