Scaling LLM/GenAI deployment with NVIDIA Triton on Amazon EKS
Triton is open source inference serving software that simplifies the inference serving process and provides high inference performance. This session explores the synergy of NVIDIA Triton and Amazon...
Watch Video