Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[PERF] Speed up of prepare_inputs / mrope
#17617 opened May 3, 2025 by vadiklyutiy Loading…
Use git-path commit in hook
#17616 opened May 3, 2025 by thomasjpfan Loading…
[BugFix] Fix --disable-log-stats in V1 server mode bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1
#17600 opened May 2, 2025 by njhill Loading…
[Security] Document StatelessProcessGroup security concerns documentation Improvements or additions to documentation
#17591 opened May 2, 2025 by russellb Loading…
[Model] 1.58bits BitNet Model Support documentation Improvements or additions to documentation
#17588 opened May 2, 2025 by Alex4210987 Loading…
Make key optional for rotary embedding ready ONLY add when PR is ready to merge/full CI is needed
#17566 opened May 1, 2025 by sarckk Loading…
AMD tests updated experiment ci/build
#17563 opened May 1, 2025 by Concurrensee Loading…
[WIP][V1][Spec Decode] EAGLE tree-attention v1
#17560 opened May 1, 2025 by wwl2755 Draft
3 of 9 tasks
[FEAT][ROCm]: Support AITER MLA on V1 Engine ci/build rocm Related to AMD ROCm v1
#17523 opened May 1, 2025 by vllmellm Loading…
[prototype] prioritized block soft pinning/evictions documentation Improvements or additions to documentation frontend v1
#17520 opened May 1, 2025 by simon-mo Draft
[V1] Add num_cached_tokens stats for request output ready ONLY add when PR is ready to merge/full CI is needed v1
#17519 opened May 1, 2025 by simon-mo Loading…
[BugFix] Qwen3 tool calling failed using qwen3 reasoning parser. documentation Improvements or additions to documentation frontend tool-calling
#17506 opened Apr 30, 2025 by Xu-Wenqing Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.
X