Question 1

What is the NVIDIA GB300 NVL72 and what&#x27;s included in a rack?

Accepted Answer

The GB300 NVL72 is a liquid-cooled, rack-scale AI supercomputer built on NVIDIA&#x27;s Blackwell Ultra GPU architecture. A single rack integrates 72 B300 Tensor Core GPUs, 36 NVIDIA Grace CPUs, up to 21TB of HBM3e GPU memory, fifth-generation NVLink (1.8 TB/s per GPU), high-speed NVMe storage, BlueField-3 DPUs, and ConnectX-8 SuperNICs&#x2014;allowing the entire rack to operate as one massive AI compute node.&#xD;&#xA;For a concise summary of nvl72 specs, see the "What is" section above.&#xD;&#xA;

Question 2

How does the NVLink and NVSwitch fabric improve performance and scalability?

Accepted Answer

Fifth-generation NVLink provides up to 1800 GB/s per GPU and full connectivity across all 72 GPUs, enabling the rack to function as a single compute domain. Nine NVSwitch trays deliver 130 TB/s of non-blocking bandwidth for direct GPU-to-GPU communication, eliminating bottlenecks. Together, this architecture powers up to 30x faster trillion-parameter inference and up to 50x performance over prior Hopper-based systems.

Question 3

What networking capabilities does the GB300 NVL72 provide for AI workloads?

Accepted Answer

: East-west (intra-cluster) traffic uses ConnectX-8 SuperNICs with up to 800 Gb/s per GPU and supports RDMA, RoCE, and GPUDirect for ultra-low latency training. North-south (storage/external) traffic is handled by BlueField-3 DPUs, delivering roughly 480 Gb/s throughput with storage acceleration, secure data pipelines, and zero-trust features for efficient, secure data movement.

Question 4

What are the power and cooling requirements, and what options are available?

Accepted Answer

: A GB300 NVL72 rack draws approximately 140--142 kW and relies on direct liquid cooling (DLC) for thermal stability at extreme density. Cooling options include in-rack CDUs (up to 250 kW), in-row CDUs (up to 1.8 MW), and sidecar air--liquid hybrid solutions&#x2014;reducing energy consumption and OPEX while enabling multi-rack scale-out.

Question 5

How does the memory architecture benefit large AI models and HPC workloads?

Accepted Answer

Each B300 GPU offers up to 288GB of HBM3e, totaling about 21TB of GPU memory per rack. This capacity allows extremely large models to be hosted in-memory, accelerating training and inference, minimizing reliance on external storage, and boosting performance for generative AI, LLM training, and HPC simulations.

Question 6

How is the system engineered for enterprise deployment, and how can RackmountNTS help?

Accepted Answer

The rack features 18 compute trays (each with 4 B300 GPUs and 2 Grace CPUs), 8 power shelves (33 kW each), redundancy across subsystems, built-in leakage detection, and enterprise-grade management nodes. It supports pre-integrated networking, AI software stacks, and modular expansion for "plug-and-play" AI factories. RackmountNTS provides custom configurations, GPU integration, networking/storage solutions, and end-to-end deployment and support to help design, deploy, and scale AI data centers.

NVIDIA GB300 NVL72 AI Supercomputer Rack | Rack-Scale GPU Server Solution

NVIDIA GB300 NVL72: The Ultimate Rack-Scale AI Supercomputer for Next-Gen Data Centers

What is NVIDIA GB300 NVL72 ?

Breakthrough Rack-Scale Architecture

Fully Integrated Compute Design

NVLink Fifth-Generation Fabric

9 NVSwitch Trays for Non-Blocking Communication

High-Speed Networking & Data Movement

North-South (Storage & External Data)

Direct Liquid Cooling for Extreme Density

Memory & Performance Advantage

Enterprise-Ready Design & Scalability

Use Cases: Where GB300 NVL72 Excels

Why Choose RackmountNTS for NVIDIA GB300 NVL72?

Conclusion

Partner with RackmountNTS

Power Advanced AI Training and Real-Time Inference

Frequently Asked Questions