Skip to content

aws/deep-learning-containers

Repository files navigation

AWS Logo

AWS Deep Learning Containers

One stop shop for running AI/ML on AWS

Docs Β· Available Images Β· Tutorials

Auto Release - vLLM EC2 Auto Release - vLLM SageMaker Auto Release - vLLM-Omni Auto Release - Ray Auto Release - SGLang EC2 Auto Release - SGLang SageMaker


About

AWS Deep Learning Containers (DLCs) are pre-built Docker images for running AI/ML workloads on AWS. Each image is tested and patched for security vulnerabilities. For more details, visit our documentation.


πŸ”₯ What's New

πŸš€ Release Highlights

  • [2026/06/29] SGLang Server v1.1 (AL2023) β€” EC2: server-cuda-v1.1 Β· SageMaker: server-sagemaker-cuda-v1.1 Β· SGLang 0.5.13; adds NIXL KV connector for prefill/decode disaggregation and runai-model-streamer[s3,gcs,azure] for fast weight streaming from object storage; starlette CVE patch.
  • [2026/06/29] SGLang v0.5.14 β€” EC2: 0.5.14-gpu-py312-ec2 Β· SageMaker: 0.5.14-gpu-py312 Β· GLM 5.2, LiquidAI LFM2.5, Kimi-K2.7-Code, DeepSeek V4 on GB300.
  • [2026/06/14] vLLM v0.23.0 β€” EC2: 0.23.0-gpu-py312-ec2 Β· SageMaker: 0.23.0-gpu-py312 Β· Step-3.7-Flash, Cosmos3 Reasoner, Gemma 4 Unified (encoder-free), Granite Speech Plus, Cohere Mini Code; Anthropic Messages API structured output.
  • [2026/06/13] SGLang v0.5.13 β€” EC2: 0.5.13-gpu-py312-ec2 Β· SageMaker: 0.5.13-gpu-py312 Β· DeepSeek V4 (BCG, HiSparse PD, PP+PD), Kimi-K2.5, MiMo-V2, Ideogram 4 (FP8/NVFP4); SM120 + FP4 indexer support.
  • [2026/06/12] SGLang Server v1.0 (AL2023) β€” EC2: server-cuda-v1.0 Β· SageMaker: server-sagemaker-cuda-v1.0 Β· First Amazon Linux 2023 SGLang Server images, built from upstream source; OpenAI-compatible API (port 30000 EC2/EKS, 8080 SageMaker); CUDA 13.0 for H100 + Blackwell; PyTorch 2.11.0; EFA, DeepEP, and Mooncake KV-cache bundled.
  • [2026/06/05] vLLM v0.22.1 β€” EC2: 0.22.1-gpu-py312-ec2 Β· SageMaker: 0.22.1-gpu-py312 Β· JetBrains Mellum v2; DeepSeek-V4, OlmoHybrid, HyperCLOVAX fixes; AMD Zen CPU zentorch kernels.
  • [2026/05/30] vLLM v0.22.0 β€” EC2: 0.22.0-gpu-py312-ec2 Β· SageMaker: 0.22.0-gpu-py312 Β· MiniCPM-V 4.6, InternS2 Preview, OpenVLA, EXAONE-4.5; DeepSeek V4 maturity (NVFP4 fused MoE, MTP speculative decoding); Blackwell SM12x support.

πŸ“’ Support Updates

  • [2026/04/28] We cannot guarantee security patching on Ubuntu-based vLLM and SGLang images due to the lack of Ubuntu Pro licensing. Customers may continue using these images at their own discretion and risk. We recommend migrating to our Amazon Linux-based images.
  • [2026/02/10] Extended support for PyTorch 2.6 Inference containers until June 30, 2026
    • PyTorch 2.6 Inference images will continue to receive security patches and updates through end of June 2026
    • For complete framework support timelines, see our Support Policy

πŸ“ Blog Posts

πŸŽ“ Workshop


License

This project is licensed under the Apache-2.0 License.

About

One stop shop for running AI/ML on AWS.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Contributors