Extreme Performance Series 2024: Enabling and Optimizing GenAI Workloads with LLMs

The Extreme Performance Series is back again for 2024! This video blog series covers the highlights of recent performance work on VMware technology.

In this video blog, Todd Muirhead talks with Lan Vu about running generative AI workloads based on large language models on VMware Cloud Foundation with optimized performance.

Links to additional resources:

Power of Virtualized ML/AI delivers near bare metal performance with ML Perf Inference 4.0
VMware vSphere 8 Performance is in the Goldilocks Zone for AI/ML Training and Inference
VMware Explore 2024
Extreme Performance Series 2024