Name		Name	Last commit message	Last commit date
parent directory ..
trtllm_export		trtllm_export
README.md		README.md

README.md

Megatron Core Export

This module is used to export megatron core models to different inference frameworks. Currently we support TRTLLM export . In the future we will be adding support for VLLM etc.

PTQ AND EXPORT

Follow the examples of TensorRT Model Optimizer to perform post training quantization, followed by an export to a HF-like checkpoint for TensorRT-LLM, vLLM, and SGLang deployment.

TRTLLM EXPORT

Follow the instructions in trtllm_export to do export to TRTLLM checkpoint format alone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

export

export

README.md

Megatron Core Export

PTQ AND EXPORT

TRTLLM EXPORT

Files

export

Directory actions

More options

Directory actions

More options

Latest commit

History

export

Folders and files

parent directory

README.md

Megatron Core Export

PTQ AND EXPORT

TRTLLM EXPORT