Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add llmc ut back
#1974 opened Jul 1, 2026 by yiliu30 Contributor Loading…
4 tasks
[ARK] Support gemm using sycl-tla
#1968 opened Jun 30, 2026 by Zhenzhong1 Contributor Draft
fix few bugs cause less accuracy of gguf format
#1960 opened Jun 29, 2026 by n1ck-guo Contributor Loading…
4 tasks
feat: add --dry-run VRAM/size estimation mode
#1958 opened Jun 26, 2026 by mvanhorn Loading…
Fix UltraChat chat-template handling for Transformers v5
#1941 opened Jun 22, 2026 by Copilot AI Draft
2 of 4 tasks
Add quantization support for DiffusionGemma
#1935 opened Jun 17, 2026 by lvliang-intel Contributor Loading…
1 of 4 tasks
Added prefill strategy benchmarking script and results
#1923 opened Jun 15, 2026 by jijiaz Loading…
[draft]refine device
#1900 opened Jun 9, 2026 by wenhuach21 Contributor Draft
4 tasks
feat: add overlap function for multi-blocks compression
#1850 opened May 25, 2026 by ZaneMark Contributor Loading…
3 tasks
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712 opened Apr 20, 2026 by michael-rabe Loading…
4 of 9 tasks
Continuously optimize AutoScheme RAM consumption
#1703 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
ProTip! Updated in the last three days: updated:>2026-06-27.