There is no denying the potential benefit AI capabilities will have on business outcomes as the technology and utilization of it matures. On the one hand this is an amazing technical capability which has tremendous potential. On the other hand, it is another form of a compute resource that must be managed similar to historical compute resources and intrinsically tied to financial measures with renewed data security challenges.
Since it’s an emerging technology, lack of standardization, automation, and IT skills plus privacy concerns are slowing time to value for AI infrastructure requests. This creates a bottleneck that slows down the deployment of AI workloads and makes it difficult to manage and optimize the resources allocated to them. To help tackle these challenges, VMware announced the launch of VMware Private AI. This architectural approach for AI services enables privacy and control of corporate data, choice of open source and commercial AI solutions, quick time-to-value, and integrated security and management. With VMware Private AI, you get the flexibility to run a range of AI solutions for your environment and you can deploy with confidence, knowing that VMware has built partnerships with the leading AI providers.
VMware and NVIDIA also announced our plans to collaborate to develop a fully integrated Generative AI platform called VMware Private AI Foundation with NVIDIA. This platform will enable enterprises to fine-tune LLM models and run inference workloads in their data centers, addressing privacy, choice, cost, performance, and compliance concerns. The platform will include the NVIDIA NeMo™ framework, NVIDIA LLMs, and other community models (such as Hugging face) running on VMware Cloud Foundation. We will share more information on this exciting platform which will be launched in early 2024.
In this tenth episode of the Multi-Cloud Expedition, join host Alexander Romero, Senior Director of Cross-Cloud Services at VMware, as he leads a discussion around VMware Private AI and our partnership with NVIDIA enabling IT teams to embrace generative AI with privacy, choice, cost management, performance, and compliance. The Multi-Cloud Expedition is a Livestream series airing monthly on LinkedIn providing opportunities for IT professionals to learn more about the best practices, technologies, and solutions VMware offers for multi-cloud management. With each broadcast, VMware and industry experts tackle a specific topic, providing real-life scenarios and product demonstrations that showcase the challenges that customers face and how VMware can help overcome these challenges.
Join us and learn how VMware and NVIDIA can accelerate AI model deployments, enhance productivity, optimize performance, offer flexibility, and safeguard privacy.
Follow this event on LinkedIn to receive notifications!
The Multi-Cloud Expedition Episode 10: VMware’s Take on AI Ready Infrastructure Recap
Chapter Segments:
0:14 – Host Welcome, Alexander Romero, Sr. Director Cross-Cloud Services: What is the Multi-Cloud Expedition? Customer Journey from cloud first to cloud smart.
2:09 – Overview of this episode: VMware’s Take on AI Ready Infrastructure.
2:52 – Industry Expert Intro: Charlie Huang, NVIDIA Product Management & Marketing; Accelerated Computing & Artificial Intelligence
4:46 – Evolution of AI going from Predictive to Generative AI. Example ChatGPT.
7:14 – Industry drivers as the catalyst for exponential interest in Generative AI – finding cancer faster or personalizing shopping experiences.
10:36 – AI and the use of GPUs is not new. VMware and NVIDIA innovation history.
14:07 – How do enterprises use HuggingFace and OpenAI model? And how NVIDIA solutions fit in the industry.
17:23 – Keeping enterprise data secure and private for regulations and competitive advantage.
18:55 – SME Intro: Shobhit Bhutani, VMware Principal Product Marketing Manager
20:48 – Challenges of Generative AI today, mainly privacy. Avoid intellectual property breeches while seeing benefits of LLM and AI.
26:04 – Avoiding lock in – customers want choice in AI journey like they have with multi-cloud with performance and compliance while managing costs.
31:55 – How VMware helps. Announcement of VMware Private AI.
25:44 – VMware Private AI Foundation with NVIDIA announced at VMware Explore Las Vegas (avail early 2024). Generative AI easy button.
40:29 – Customer example: Helping build a Generative AI model for a large format retailer.
43:16 – Demo: NVIDIA and VMware Generative AI Integration into Enterprise Clouds.
46:57 – Demo: How allocating GPUs within vSphere looks for VI Admins. Q&A with moderator Leanne Jones, Director Cross-Cloud Services
48:45 – Q&A: What other eco system partners does VMware work with beyond NVIDIA?
51:22 – Q&A: Customer examples making progress today with AI.
53:40 – How to get started today? Reference architecture for VMware Private AI.
55:48 – Wrap up and topic review including key links to learn more around VMware AI solutions:
- https://www.vmware.com/products/vsphere/ai-ml.html
- https://core.vmware.com/blog/using-nvidias-aiml-frameworks-generative-ai-vmware-vsphere
- https://blogs.vmware.com/vsphere/2023/08/introducing-vmware-private-ai-foundation.html
Special thank you to Justin Murray for creating the demos and Shannon Waddell for producing.
The Multi-Cloud Expedition Livestream Series Continues!
The Multi-Cloud Expedition starting in February of 2023. Miss an episode? Check out our blog site for episode details and recording links: https://crosscloud.vmware.com/multi-cloud-expedition
Wondering what’s next? Here’s what we’re planning (subject to change):
- September 27: VMware’s Take on AI Ready Infrastructure
- October 25: Optimizing Multi-Cloud Workload Placement
- November 8: Live from VMware Explore Barcelona 2023 – More about VMware Private AI!