IBM and Intel announced the general availability of Intel Gaudi 3 AI accelerators on IBM Cloud.
IBM has announced the general availability of Intel® Gaudi® 3 AI accelerators on IBM Cloud, enhancing enterprise capabilities to deploy and scale generative AI workloads efficiently. Revealed during Intel Vision 2025, this collaboration makes Intel Gaudi 3 accessible through a public cloud environment suitable for production-level AI tasks. This strategic partnership underscores both companies’ commitment to optimizing cost-effective enterprise AI deployment, addressing critical infrastructure expenditure and scalability challenges.
Initially announced in August 2024, Intel Gaudi 3 AI accelerators will be available in the IBM Cloud regions of Frankfurt (eu-de) and Washington, D.C. (us-east), with availability in the Dallas (us-south) region expected in Q2 2025. This geographic flexibility enables global organizations to manage their AI workloads according to regional and operational requirements.
IBM’s recent “AI in Action 2024” report highlights the business value of AI, revealing that 67% of surveyed executives experienced revenue growth of at least 25% after integrating AI. Despite these encouraging figures, organizations encounter significant costs associated with deploying and maintaining high-performance AI infrastructure. Intel Gaudi 3 on IBM Cloud directly addresses these challenges by allowing enterprises to experiment, scale, and innovate with generative AI solutions more cost-effectively.
According to Saurabh Kulkarni, Vice President of Data Center AI Strategy and Product Management at Intel, this collaboration significantly boosts performance for inferencing and fine-tuning tasks, thus accelerating generative AI adoption across diverse industries.
The long-standing partnership between IBM Cloud and Intel has historically facilitated scalable and versatile infrastructure solutions, allowing clients to dynamically align computing resources with their business needs. Intel Gaudi 3 continues this legacy by offering multiple deployment options tailored to enterprise requirements:
Clients seeking isolated, secure environments with high resiliency can provision standalone Intel Gaudi 3 servers within the IBM Cloud Virtual Private Cloud (VPC). The VPC setup enables a fully customizable combination of compute, storage, and networking resources, enhanced by Red Hat Enterprise Linux AI image options. This strategy supports enterprises that require precise control over their AI infrastructure and specialized software stacks.
In Q2 2025, IBM Cloud will offer Intel Gaudi 3 as managed worker nodes integrated into Red Hat OpenShift AI clusters and Red Hat OpenShift on IBM Cloud for organizations adopting containerized infrastructures.
Additionally, clients seeking comprehensive stack management—from infrastructure to AI workloads—can leverage IBM’s watsonx.ai software on Intel Gaudi 3-enabled IBM Cloud VPC virtual servers. IBM watsonx.ai offers an integrated AI studio environment with a robust developer toolkit and comprehensive lifecycle management, facilitating AI development and deployment.
IBM simplifies the rapid adoption of Intel Gaudi 3 capabilities with Deployable Architectures (DAs). These predefined modules enable development and operations teams to expedite deployments and system updates, reducing manual intervention. Planned DAs will support IBM watsonx software, IBM Cloud Virtual Servers for VPC, and Red Hat OpenShift platforms on IBM Cloud, with expected availability in the second half of 2025.
Intel Gaudi 3 AI accelerators on IBM Cloud
Engage with StorageReview
Newsletter | YouTube | Podcast iTunes/Spotify | Instagram | Twitter | TikTok | RSS Feed