As high-performance computing (HPC) and artificial intelligence (AI) applications become more complex, the demand for the most advanced high-speed networking is critical for extreme-scale systems. NVIDIA Quantum-2 is the industry-leading switch platform in power and density, with NDR 400 gigabit per second (Gb/s) InfiniBand throughput that provides AI developers and scientific researchers with the highest networking performance available to take on the world's most challenging problems.
Advanced computing needs advanced networking
The NVIDIA Quantum-2-based QM9700 switch system delivers an unprecedented 64 ports of NDR 400Gb/s InfiniBand per port in a 1U standard chassis design. A single switch carries an aggregated bidirectional throughput of 51.2 terabits per second (Tb/s), with a landmark of more than 66.5 billion packets per second (BPPS) capacity. Supporting the latest NDR technology, NVIDIA Quantum-2 brings a highspeed, extremely low-latency and scalable solution that incorporates state-of-the-art technologies such as Remote Direct Memory Access (RDMA), adaptive routing, and NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP). Unlike any other networking solution, NVIDIA InfiniBand provides self-healing network capabilities, as well as quality of service (QoS), enhanced virtual lane (VL) mapping, and congestion control to provide the highest overall application throughput. As an ideal rackmounted InfiniBand solution, the QM9700 InfiniBand fixed-configuration switch allows maximum flexibility, as it enables a variety of topologies, including Fat Tree, SlimFly, DragonFly+, multi-dimensional Torus, and more. It is also backwards compatible to previous generations and includes expansive software ecosystem support.
The era of data-driven computing
Today's complex research demands ultra-fast processing of high-resolution simulations, extreme-size datasets, and complex, highly parallelized algorithms that need to exchange information in real time. The QM9700 NDR InfiniBand switch extends NVIDIA In-Network Computing technologies and introduce the third generation of NVIDIA SHARP technology, SHARPv3. Creating virtually unlimited scalability for large data aggregation through the network, SHARPv3 enables support for up to 64 parallel flows - 32X higher AI acceleration power compared to the previous generation. SHARPv3 dramatically boosts application performance of complex computations while data moves through the data center network, participating in the application's runtime and reducing the amount of data needed to traverse the network.
Streamlining network design and topologies
By implementing NVIDIA port-split technology, the QM9700 switch provides a double-density radix for 200Gb/s (NDR200) data speeds, reducing the cost of network design and network topologies. Supporting up to 128 ports of 200Gb/s, NVIDIA delivers the densest top-of-rack (TOR) switch available on the market. The QM9700 family of switches enables small to medium-sized deployments to scale with a two-level Fat Tree topology while reducing power, latency, and space requirements.
Enhanced management
The internally managed QM9700 switch features an on-board subnet manager that enables simple, out-of-the-box bringup for up to 2,000 nodes. Running the NVIDIA MLNX-OS software package, the subnet manager delivers full chassis management through command-line interface (CLI), web-based user (WebUI), Simple Network Management Protocol (SNMP), or JavaScript Object Notation (JSON) interfaces.