-
Notifications
You must be signed in to change notification settings - Fork 465
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Refine DeciLM dtype handling in HF PTQ
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1869
opened Jun 30, 2026 by
realAsma
Contributor
Loading…
Add recipe used for Qwen3.5 397B NVFP4 V2 checkpoint
#1868
opened Jun 30, 2026 by
sugunav14
Contributor
Loading…
fix(skills): unblock recurring day0 eval/deploy failures (judge 401, AA-LCR ctx, sm_103/cu130, native-quant baseline)
#1863
opened Jun 30, 2026 by
Edwardf0t1
Contributor
Loading…
fix: prevent UnboundLocalError masking real errors in fsdp2_aware_weight_update
#1860
opened Jun 30, 2026 by
dinhxuanvu
Loading…
Reintroduce AIPerf for performance benchmarking, clean up docs and text
#1855
opened Jun 29, 2026 by
nfasfous
Loading…
[Fix]: Add Final Norm for vLLM Hidden Extractor
#1846
opened Jun 28, 2026 by
h-guo18
Contributor
Loading…
docs(eval): add NEL v0.3.0 migration guide + example configs
#1845
opened Jun 28, 2026 by
hychiang-git
Contributor
Loading…
launcher: fix host=None when _factory_ is dropped by nemo_run --yaml path
#1842
opened Jun 27, 2026 by
ChenhanYu
Collaborator
Loading…
3 tasks
refactor(examples): consolidate puzzletron examples under examples/pruning/puzzletron
#1841
opened Jun 27, 2026 by
valter-silva-au
Loading…
specdec(recipe): add MiniMax-M2.7-DFlash streaming multi-node pipeline
#1835
opened Jun 26, 2026 by
yeyu-nvidia
Contributor
Loading…
3 tasks
feat(export): quant-aware reverse weight conversion for unified HF export
#1833
opened Jun 26, 2026 by
Edwardf0t1
Contributor
•
Draft
Add Qwen-Image DMD2 PTQ support; save quantizer state (amax) without weights
#1827
opened Jun 25, 2026 by
jingyu-ml
Contributor
Loading…
Emit VisualGen-compatible sparse_attention_config for diffusion skip-softmax export
#1816
opened Jun 24, 2026 by
jingyu-ml
Contributor
Loading…
Add NVFP4 Conv3d export for diffusers VAE (Wan 2.2)
#1809
opened Jun 23, 2026 by
jingyu-ml
Contributor
Loading…
Support FP8 per block (weight + dynamic per token activation) export
#1807
opened Jun 23, 2026 by
sugunav14
Contributor
Loading…
MiniMax-M3 mixed MXFP8-base + NVFP4-experts PTQ export
#1806
opened Jun 23, 2026 by
chadvoegele
Contributor
Loading…
Puzzletron tutorial fixes for runtime optimization
#1803
opened Jun 23, 2026 by
grzegorz-k-karch
Contributor
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.