Broadcom Announces VMware Cloud Foundation 9.1
PALO ALTO, Calif., May 5, 2026 — Broadcom Inc., a global technology leader that designs, develops, and supplies semiconductor and infrastructure software solutions, today announced VMware Cloud Foundation (VCF) 9.1, a secure and cost-effective infrastructure platform for production AI workloads. VCF 9.1 delivers an AI and Kubernetes native private cloud platform with integrated security and mixed compute infrastructure support across AMD, Intel, and NVIDIA. This enables enterprises to deploy inference and agentic AI applications with significantly lower costs, enhanced security, and freedom to choose best-of-breed GPU and CPU hardware.
A preview of Broadcom’s Private Cloud Outlook 2026 report reveals private cloud continues to be the preferred platform for production AI. More than half of organizations surveyed (56%) are running or planning to run production inferencing in a private cloud. More importantly, public cloud use for production inference was 41%, down 15% year over year. Additionally, 62% of IT leaders reported being very or extremely concerned about generative AI infrastructure costs while 36% report AI is driving new requirements for data protection, privacy, security controls and risk management.
VMware Cloud Foundation provides a better alternative to public cloud for production workloads through intelligent software that maximizes infrastructure efficiency on existing servers while providing architectural control and regulatory compliance capabilities essential for production AI deployments. VMware Cloud Foundation 9.1 will enable enterprises to deploy production workloads including inference and agentic AI with:
- Up to 40% reduction in server costs through intelligent memory tiering for clusters running a mix of AI and non-AI workloads
- Up to 39% lower storage TCO through enhanced compression and deduplication for AI data pipelines
- Up to 46% reduction in Kubernetes operational costs for running AI workloads at scale
- 4x faster cluster upgrades and 2x increased fleet capacity to rapidly scale AI infrastructure
“As more enterprises turn to AI for driving competitive advantage, they face three critical challenges: data and IP privacy concerns, surging infrastructure costs, and their readiness for the world of agentic AI,” said Krish Prasad, senior vice president and general manager, VMware Cloud Foundation Division, Broadcom. “VCF 9.1 is a single unified platform that addresses all three and delivers one of the most advanced infrastructure for Private AI. We enable zero-trust security for AI, reduce costs through intelligent infrastructure optimization and hardware choice, and provide the flexibility to run both agentic workflows and accelerated inferencing on the same platform.”
Efficient Infrastructure at Scale for AI Workloads
VCF 9.1 maximizes density for both VM and containerized AI workloads on existing infrastructure while dramatically reducing operational complexity. Through intelligent resource management and automated operations, enterprises can deploy more production workloads on current servers, scale efficiently across distributed environments, and eliminate the need for costly infrastructure expansion during a period of hardware shortage and rising costs. Key capabilities include:
- Intelligent resource optimization that maximizes infrastructure utilization through advanced memory tiering and next-generation storage compression for AI data pipelines, enabling higher AI workload density without performance compromises or expensive hardware refresh.
- Automated fleet operations at scale that deliver doubled management capacity to 5,000 hosts and 4x faster cluster upgrades across distributed and air-gapped environments, eliminating manual patching overhead while supporting rapid AI infrastructure expansion.
- Multi-tenant infrastructure for AI isolation that enables enterprises and service providers to run multiple AI projects and customers on shared infrastructure with strict security boundaries, maximizing utilization of expensive GPU and CPU resources while supporting data sovereignty for sensitive models.
- Open ecosystem integration that delivers multi-accelerator GPU choice across AMD and NVIDIA, support for leading AMD and Intel CPU platforms, and standards-based EVPN and VXLAN interoperability with Arista Universal Cloud Network, demonstrating VCF’s commitment to providing the high-performance connectivity and compute flexibility production AI demands.
- High speed networking for AI workloads through VCF support for NVIDIA ConnectX-7 NICs and NVIDIA BlueField-3 with Enhanced DirectPath I/O. With this enhancement high-speed, multi-host AI model training and data transfer, crucial for demanding Gen AI workloads is enabled.
- Virtualized load balancing and security with VMware Avi Load Balancer and VMware vDefend eliminate hardware appliance requirements for AI inference endpoints and agentic applications, reducing capital expense while providing enterprise-grade resilience and automated lifecycle management.
High Velocity App Delivery: Modern Workload Platform for AI, Containers, and VMs
VCF 9.1 delivers a unified platform that accelerates AI application deployment by running inference workloads, agentic applications, containerized services, and traditional VMs on a single infrastructure layer. This eliminates operational fragmentation and the cost of managing separate stacks while providing the developer velocity and platform governance that production AI requires. Key capabilities include:
- Kubernetes scale and performance for AI that delivers 2.6x increased cluster scale, 70% faster deployments, 75% shorter upgrade windows compared to preview versions, and seamless scaling that enables zero downtime for production AI services.
- Mixed compute management that efficiently handles both CPU-intensive agentic AI workflows and GPU-accelerated inference on a unified platform, addressing the reality that agentic workloads demand significantly more CPU than GPU capacity for workflow execution and decision orchestration.
- AI observability and governance that provides detailed metrics for time to first token, token throughput, and GPU utilization across multiple accelerator types, enabling enterprises to maximize infrastructure ROI through precise hardware utilization monitoring while centralized policy injection and data sovereignty controls enable AI compliance enforcement and secure model access.
- Live application stack blueprints that capture multi-VM applications as reusable templates for rapid environment deployment, eliminating manual configuration errors and preventing configuration drift across development, test, and production environments while accelerating infrastructure delivery velocity.
Zero-Trust Architecture for AI Data Sovereignty and Governance
VCF 9.1 integrates security at the infrastructure layer to protect AI workloads, proprietary models, and training data from hypervisor to application. By delivering zero-trust segmentation, sovereign recovery, and continuous patching without bolt-on tools, VCF strengthens the security posture essential for production AI deployments that public cloud environments cannot match. Key capabilities include:
- On-premises ransomware recovery that provides isolated recovery environments and integrated validation tools including new CrowdStrike Falcon Endpoint Security support protect AI models and training data – significant intellectual property – from cross-border movement while avoiding massive bandwidth fees during crisis restoration.
- Continuous compliance enforcement that maintains regulatory adherence through centralized monitoring and automated desired state remediation for workloads and VCF stack components, enabling enterprises to demonstrate audit readiness for production AI deployments without manual overhead or separate compliance tools.
- Zero-downtime live patching that supports up to 80% of use cases without host evacuation or maintenance windows, eliminating disruption to production AI inference services and agentic applications that require continuous availability for service level agreements.
- Zero-trust lateral security that extends distributed IDS/IPS protection to Kubernetes AI workloads for the first time, delivering 9 Tbps threat inspection performance for distributed inference and 5x increased application identification for private cloud and internet applications.
- Self-service security with automation that provides centralized tagging, pre-defined security profiles, delegated firewall configurations and ingress web application security, enabling enterprises and service providers to secure AI deployments without operational complexity or fragmented security toolchains.
About Broadcom
Broadcom Inc. (NASDAQ: AVGO) is a technology leader that designs, develops, and supplies semiconductors and infrastructure software for global organizations’ complex, mission-critical needs. Broadcom combines long-term R&D investment with superb execution to deliver the best technology, at scale. Broadcom is a Delaware corporation headquartered in Palo Alto, CA. For more information, visit www.broadcom.com.
Source: Broadcom
Related

