Oracle unveils new cloud infrastructure boosted by NVIDIA and Ampere hardware
Wed 20 Sep 2023
Oracle has announced the upcoming availability for new Oracle Cloud Infrastructure (OCI) Compute instances powered by NVIDIA and Ampere hardware.
The new OCI Compute instances aim to make running a variety of workloads — from training, fine-tuning, and AI model inferencing to running cloud-native applications and video transcoding applications — in the cloud more accessible to organisations.
“OCI was one of the first cloud providers to offer bare metal instances natively, which is a key part of our ability to make high-performance computing more accessible to organisations everywhere,” said Donald Lu, Senior Vice President for Software Development at Oracle Cloud Infrastructure.
Oracle and NVIDIA Target AI Workloads
The upcoming OCI Compute instances based on next-generation NVIDIA GPUs will include bare metal instances powered by NVIDIA H100 GPUs and NVIDIA L40S GPUs.
“The collaboration between NVIDIA and Oracle is helping democratise access to cutting-edge GPUs on Oracle Cloud Infrastructure,” said Ian Buck, Vice President of Hyperscale and High Performance Computing at NVIDIA.
Organisations using NVIDIA H100 Tensor Core GPUs have seen as much as a 30x improvement in performance for AI inference use cases and 4x better performance training AI models compared to using the previous generation of NVIDIA A100 Tensor Core GPUs. These instances are planned to be generally available in the Oracle Cloud London Region and Oracle Cloud Chicago Region later this year, with others expected to follow.
“The upcoming OCI Compute instances, powered by NVIDIA GPUs, will give us the power we need to train and serve the next generation of industry-leading Cohere enterprise AI models,” said Martin Kon, President and COO of Cohere.
The NVIDIA L40S GPUs provide an alternative option for workloads such as AI inferencing or training small to medium AI models. These instances have been tested to deliver up to a 20% improvement in performance for generative AI workloads and up to a 70% improvement in fine-tuning models over the previous generation of NVIDIA A100 GPUs. The instances are planned to be available within the next year.
“Training large language models on the MosaicML Platform requires thousands of NVIDIA GPUs running on OCI’s bare metal compute instances, which leverage high-performance storage and ultrafast cluster networking,” said Naveen Rao, Vice President of Generative AI at Databricks.
Oracle and Ampere Instances to Reduce Data Centre Footprint
The upcoming OCI Compute Instances based on Ampere Computing CPUs will include instances powered by Ampere AmpereOne CPUs. These are expected to deliver the highest available processor core count in the industry at 320 cores in the bare metal shape and up to 156 cores in the flexible VM shape.
“Oracle was the first cloud services provider to globally deploy compute instances based on Ampere processors,” said Jeff Wittich, Chief Product Officer at Ampere Computing.
The high core count available in these instances can support increased levels of performance, virtual machine density, and scaling while reducing data centre footprint and power consumption. These instances are planned to be coming next year.
“Uber is revolutionising the way people and things move around cities. As part of a multicloud architecture, we leverage Oracle Cloud Infrastructure for critical workloads. We use Standard and Dense I/O instances based on AMD processors, and plan to use OCI Compute with NVIDIA GPUs in the future,” said Kamran Zargahi, Senior Director of Tech Strategy at Uber.
Hungry for more tech news?
Sign up for your weekly tech briefings!