Paul Krill
Editor at Large

Nvidia H100 Tensor Core GPUs come to Oracle Cloud

news
Sep 21, 20232 mins

Oracle Cloud now offers bare-metal instances with Nvidia H100 GPUs for AI training and inference and high-performance computing applications. Nvidia L40S GPUs are on the way.

AI on chip
Credit: Getty Images

In response to growing demand for generative AI applications and large language models (LLM), Oracle Cloud Infrastructure (OCI) has made Nvidia H100 Tensor Core GPUs available on the OCI Compute platform. Nvidia L40S GPUs also will be coming to the platform soon.

Oracle said OCI Compute now offers bare-metal instances with Nvidia H100 GPUs, powered by the Nvidia Hopper architecture for AI, thus enabling an “order-of-magnitude performance leap” for large-scale AI and high-performance computing applications. The Nvidia H100 GPU is designed for resource-intensive computing tasks, including training LLM models.

Organizations using Nvidia H100 GPUs obtain as much as a 30x increase in AI inference performance and a 4x boost in AI training compared with tapping Nvidia A100 Tensor Core GPUs, Oracle said. The BM.GPU H100.8 OCI Compute shape includes eight Nvidia H100 GPUs, each with 80GB of HBM2 GPU memory.

OCI Compute bare-metal instances with Nvidia L40S GPUs will be available for early access later this year, with general availability early in 2024. Nvidia L40S GPUs, based on the Nvidia Ada Lovelace architecture for graphics, AI, and gaming, serve as a universal GPU for the data center, providing multi-workload acceleration for LLM inference and training, visual computing, and video applications.

Paul Krill

Paul Krill is editor at large at InfoWorld. Paul has been covering computer technology as a news and feature reporter for more than 35 years, including 30 years at InfoWorld. He has specialized in coverage of software development tools and technologies since the 1990s, and he continues to lead InfoWorld’s news coverage of software development platforms including Java and .NET and programming languages including JavaScript, TypeScript, PHP, Python, Ruby, Rust, and Go. Long trusted as a reporter who prioritizes accuracy, integrity, and the best interests of readers, Paul is sought out by technology companies and industry organizations who want to reach InfoWorld’s audience of software developers and other information technology professionals. Paul has won a “Best Technology News Coverage” award from IDG.

More from this author