Today, we’re saying the final availability of Amazon Elastic Compute Cloud (Amazon EC2) P6-B200 cases powered by NVIDIA B200 to deal with buyer wants for prime efficiency and scalability in artificial intelligence (AI), machine studying (ML), and excessive efficiency computing (HPC) functions.
Amazon EC2 P6-B200 cases speed up a broad vary of GPU-enabled workloads however are particularly well-suited for large-scale distributed AI coaching and inferencing for foundation fashions (FMs) with reinforcement studying (RL) and distillation, multimodal coaching and inference, and HPC functions similar to local weather modeling, drug discovery, seismic evaluation, and insurance coverage threat modeling.
When mixed with Elastic Fabric Adapter (EFAv4) networking, hyperscale clustering by EC2 UltraClusters, and superior virtualization and safety capabilities by AWS Nitro System, you possibly can prepare and serve FMs with elevated pace, scale, and safety. These cases additionally ship as much as two instances the efficiency for AI coaching (time to coach) and inference (tokens/sec) in comparison with EC2 P5en cases.
You can speed up time-to-market for coaching FMs and ship sooner inference throughput, which lowers inference price and helps enhance adoption of generative AI functions in addition to elevated processing efficiency for HPC functions.
EC2 P6-B200 cases specs
New EC2 P6-B200 cases present eight NVIDIA B200 GPUs with 1440 GB of excessive bandwidth GPU reminiscence, fifth Generation Intel Xeon Scalable processors (Emerald Rapids), 2 TiB of system reminiscence, and 30 TB of native NVMe storage.
Here are the specs for EC2 P6-B200 cases:
Instance dimension | GPUs (NVIDIA B200) | GPU reminiscence (GB) |
vCPUs | GPU Peer to look (GB/s) | Instance storage (TB) | Network bandwidth (Gbps) | EBS bandwidth (Gbps) |
P6-b200.48xlarge | 8 | 1440 HBM3e | 192 | 1800 | 8 x 3.84 NVMe SSD | 8 x 400 | 100 |
These cases function as much as 125 % enchancment in GPU TFLOPs, 27 % enhance in GPU reminiscence dimension, and 60 % enhance in GPU reminiscence bandwidth in comparison with P5en cases.
P6-B200 cases in motion
You can use P6-B200 cases within the US West (Oregon) AWS Region by EC2 Capacity Blocks for ML. To reserve your EC2 Capacity Blocks, select Capacity Reservations on the Amazon EC2 console.
Select Purchase Capacity Blocks for ML after which select your whole capability and specify how lengthy you want the EC2 Capacity Block for p6-b200.48xlarge cases. The whole variety of days that you could reserve EC2 Capacity Blocks is 1-14 days, 21 days, 28 days, or multiples of seven as much as 182 days. You can select your earliest begin date for as much as 8 weeks prematurely.
Now, your EC2 Capacity Block will probably be scheduled efficiently. The whole worth of an EC2 Capacity Block is charged up entrance, and the value doesn’t change after buy. The fee will probably be billed to your account inside 12 hours after you buy the EC2 Capacity Blocks. To be taught extra, go to Capacity Blocks for ML within the Amazon EC2 User Guide.
When launching P6-B200 cases, you should utilize AWS Deep Learning AMIs (DLAMI) to assist EC2 P6-B200 cases. DLAMI gives ML practitioners and researchers with the infrastructure and instruments to rapidly construct scalable, safe, distributed ML functions in preconfigured environments.
To run cases, you should utilize AWS Management Console, AWS Command Line Interface (AWS CLI) or AWS SDKs.
You can combine EC2 P6-B200 cases seamlessly with numerous AWS managed companies similar to Amazon Elastic Kubernetes Services (Amazon EKS), Amazon Simple Storage Service (Amazon S3), and Amazon FSx for Lustre. Support for Amazon SageMaker HyperPod can be coming quickly.
Now out there
Amazon EC2 P6-B200 cases can be found right now within the US West (Oregon) Region and will be bought as EC2 Capacity blocks for ML.
Give Amazon EC2 P6-B200 cases a strive within the Amazon EC2 console. To be taught extra, check with the Amazon EC2 P6 occasion web page and ship suggestions to AWS re:Post for EC2 or by your regular AWS Support contacts.
— Channy
How is the News Blog doing? Take this 1 minute survey!
(This survey is hosted by an exterior firm. AWS handles your info as described within the AWS Privacy Notice. AWS will personal the information gathered by way of this survey and won’t share the knowledge collected with survey respondents.)