Join prime executives in San Francisco on July 11-12, to listen to how leaders are integrating and optimizing AI investments for fulfillment. Learn More
At this yr’s GPU Technology Conference (GTC), Nvidia continued its AI {hardware} push with a particular give attention to making its expertise extra accessible to enterprises throughout industries and streamlining the event of generative AI functions like ChatGPT.
The following is a every day recap of main bulletins that the Santa Clara, California-based firm made with hyperlinks to in-depth protection.
Rent AI supercomputing infrastructure with DGX Cloud
While Nvidia has been constructing {hardware} for AI for fairly a while, the expertise has taken a while to see mass adoption — partly owing to excessive prices. Back in 2020, its DGX A100 server field was bought for $199,000. To change this, the corporate at present introduced DGX Cloud, a service that may permit enterprises to entry its AI supercomputing infrastructure and software program via an internet browser.
>>Follow VentureBeat’s ongoing Nvidia GTC spring 2023 protection<<
Event
Transform 2023
Join us in San Francisco on July 11-12, the place prime executives will share how they’ve built-in and optimized AI investments for fulfillment and prevented frequent pitfalls.
DGX Cloud rents DGX Server packing containers, every with eight Nvidia H100 or A100 GPUs and 640GB of reminiscence, and prices $36,999 a month for a single node.
Leveraging the facility of DGX Cloud, the corporate additionally introduced the launch of AI Foundations to assist enterprises create and use customized generative AI fashions. The providing, Nvidia stated, gives three cloud companies: Nvidia NeMo for big language fashions (LLMs), Nvidia Picasso for picture, video and 3D functions, and BioNeMO to generate scientific texts based mostly on organic knowledge.
New {hardware} for AI inference and proposals
Alongside DGX and AI Foundations, Nvidia additionally debuted 4 inference platforms designed to assist builders shortly construct specialised generative AI functions. This consists of Nvidia L4 for producing AI video; Nvidia L40 for 2D/3D picture technology; Nvidia H100 NVL for deploying massive language fashions; and Nvidia Grace Hopper — which connects the Grace CPU and Hopper GPU over a high-speed 900GB/sec coherent chip-to-chip interface — for advice techniques constructed on large datasets.
The firm says L4 can ship 120x extra AI-powered video efficiency than CPUs, mixed with 99% higher vitality effectivity; whereas L40 serves because the engine of Omniverse, delivering 7x the inference efficiency for Stable Diffusion and 12x Omniverse efficiency over the earlier technology.
Chipmakers get cuLitho at Nvidia GTC
At the occasion, Nvidia CEO Jenson Huang took the stage to announce Nvidia cuLitho software program library for computational lithography. The providing, as Huang defined, will allow semiconductor enterprises to design and develop chips with ultrasmall transistors and wires whereas accelerating time to market and boosting the vitality effectivity of the large knowledge facilities that run 24/7 to drive the semiconductor manufacturing course of.
“The chip industry is the foundation of nearly every other industry in the world,” stated Huang. “With lithography at the limits of physics, NVIDIA’s introduction of cuLitho and collaboration with our partners TSMC, ASML and Synopsys allows fabs to increase throughput, reduce their carbon footprint and set the foundation for 2nm and beyond.”
Finally, the corporate additionally introduced partnerships with Medtronic and Microsoft. The former, it stated, will result in the event of a typical AI platform for software-defined medical gadgets able to bettering affected person care. Meanwhile, the latter will see Microsoft Azure host Nvidia Omniverse and Nvidia DGX Cloud.
The 2023 Nvidia GTC occasion runs via March 23.
VentureBeat’s mission is to be a digital city sq. for technical decision-makers to realize information about transformative enterprise expertise and transact. Discover our Briefings.