gpuhardware

Cloud GPUs

GPU Programming

GPU Architecture

Tensor Cores

Specs

NVIDIA B200NVIDIA B100NVIDIA H200NVIDIA H100NVIDIA A100
ArchitectureBlackwellBlackwellHopperHopperAmpere
FP6440 teraFLOPS30 teraFLOPS34 teraFLOPS34 teraFLOPS9.7 teraFLOPS
FP64 Tensor Core40 teraFLOPS30 teraFLOPS67 teraFLOPS67 teraFLOPS19.5 teraFLOPS
FP3280 teraFLOPS60 teraFLOPS67 teraFLOPS67 teraFLOPS19.5 teraFLOPS
FP32 Tensor Core2.2 petaFLOPS1.8 petaFLOPS989 teraFLOPS989 teraFLOPS312 teraFLOPS
FP16/BF16 Tensor Core4.5 petaFLOPS3.5 petaFLOPS1979 teraFLOPS1979 teraFLOPS624 teraFLOPS
INT8 Tensor Core9 petaOPs7 petaOPs3958 teraOPs3958 teraOPs1248 teraOPs
FP8 Tensor Core9 petaFLOPS7 petaFLOPS3958 teraFLOPS3958 teraFLOPS-
FP4 Tensor Core18 petaFLOPS14 petaFLOPS---
GPU Memory192GB HBM3e192GB HBM3e141GB HBM3e80GB HBM380GB HBM2e
Memory BandwidthUp to 8TB/sUp to 8TB/s4.8TB/s3.2TB/s2TB/s
Multi-Instance GPUsUp to 7 MIGs @23GBUp to 7 MIGs @23GBUp to 7 MIGs @16.5GBUp to 7 MIGs @16.5GBUp to 7 MIGs @10GB
InterconnectNVLink 1.8TB/sNVLink 1.8TB/sNVLink 900GB/sNVLink 900GB/sNVLink 600GB/s