NVIDIA data center Available

NVIDIA H200 SXM

H-Series · Hopper Architecture

The NVIDIA H200 is an enhanced version of the H100 featuring 141GB of HBM3e memory with 4.8 TB/s bandwidth — nearly 1.4x the memory capacity and 1.4x the bandwidth of the H100. Designed as a drop-in upgrade for existing H100 SXM systems, it dramatically improves performance for memory-bound LLM inference workloads.

Key Features

141GB HBM3e 4.8 TB/s memory bandwidth NVLink 4.0 Drop-in H100 SXM replacement Optimized for LLM inference

Full Specifications

Compute

Architecture Hopper
Process Node 4nm TSMC
CUDA Cores 16,896
Tensor Cores 528
Base Clock 1095 MHz
Boost Clock 1830 MHz
FP32 Performance 66.91 TFLOPS
FP16 Performance 989.4 TFLOPS
BF16 Performance 989.4 TFLOPS
INT8 Performance 1978.9 TOPS

Memory

Memory Size 141 GB
Memory Type HBM3e
Memory Bus 6144-bit
Memory Bandwidth 4800 GB/s

Power & Physical

TDP 700W
Form Factor SXM5
Power Connectors SXM5 connector

Features & Connectivity

PCIe Version PCIe 5.0
NVLink Support Yes
Multi-GPU Support Yes

Availability

MSRP (USD) Contact for pricing
Release Date Jun 2024
Status Available

Industries

Use Cases

LLM Inference Large Model Training Generative AI Scientific Computing

Interested in the NVIDIA H200 SXM?

Get pricing, availability, and bulk discount information from our team.

Enquire Now

Related GPUs

NVIDIA data center

NVIDIA H100 SXM

Memory

80GB HBM3

FP32

66.91 TFLOPS

TDP

700W

FP16

989.4 TFLOPS

Available View Specs
NVIDIA data center

NVIDIA H100 PCIe

Memory

80GB HBM3

FP32

51.22 TFLOPS

TDP

350W

FP16

756 TFLOPS

Available View Specs
NVIDIA data center

NVIDIA B200

Memory

192GB HBM3e

FP32

90 TFLOPS

TDP

1000W

FP16

1800 TFLOPS

Available View Specs
NVIDIA data center

NVIDIA GB200 NVL72

Memory

192GB HBM3e (per GPU)

FP32

90 TFLOPS

TDP

2700W

FP16

1800 TFLOPS

Available View Specs