-
-
Notifications
You must be signed in to change notification settings - Fork 7.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] validate grammar and throw 400 error instead of crashing the engine when xgrammar validation fails
structured-output
v1
#17623
opened May 4, 2025 by
Jason-CKY
Loading…
Enable Pydantic mypy checks and convert configs to Pydantic dataclasses
frontend
structured-output
tpu
Related to Google TPUs
#17599
opened May 2, 2025 by
hmellor
Loading…
[V1] Disable pickle by default for new serial_utils usage
v1
#17596
opened May 2, 2025 by
russellb
Loading…
[Security] Document StatelessProcessGroup security concerns
documentation
Improvements or additions to documentation
#17591
opened May 2, 2025 by
russellb
Loading…
[Model] 1.58bits BitNet Model Support
documentation
Improvements or additions to documentation
#17588
opened May 2, 2025 by
Alex4210987
Loading…
Make key optional for rotary embedding
ready
ONLY add when PR is ready to merge/full CI is needed
#17566
opened May 1, 2025 by
sarckk
Loading…
AMD experimental all tests updated EXPERIMENT (no need to merge)
ci/build
needs-rebase
#17556
opened May 1, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[prototype] prioritized block soft pinning/evictions
documentation
Improvements or additions to documentation
frontend
v1
[V1] Add num_cached_tokens stats for request output
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#17519
opened May 1, 2025 by
simon-mo
Loading…
[Bugfix][Model] vllm-v0 engine run eagle algo with qwen2.5 model, KeyError: 'norm.weight' bugfix
#17518
opened May 1, 2025 by
Greatpanc
Loading…
[Bugfix][V1][Spec Dec] Add generator to request even when no seed is provided.
speculative-decoding
v1
#17509
opened May 1, 2025 by
luyuzhe111
Loading…
[BugFix] Qwen3 tool calling failed using qwen3 reasoning parser.
documentation
Improvements or additions to documentation
frontend
tool-calling
#17506
opened Apr 30, 2025 by
Xu-Wenqing
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.