NVIDIA Introduces NVSHMEM 3.0 with Enhanced GPU Communication Features

Jessie A Ellis
Sep 07, 2024 08:39

NVIDIA’s NVSHMEM 3.0 affords multi-node help, ABI backward compatibility, and CPU-assisted InfiniBand GPU Direct Async, enhancing GPU communication.

NVIDIA has introduced the discharge of NVSHMEM 3.0, the most recent model of its parallel programming interface designed to facilitate environment friendly and scalable communication for NVIDIA GPU clusters. This replace, a part of NVIDIA Magnum IO and primarily based on OpenSHMEM, goals to reinforce software portability and compatibility throughout varied platforms, in accordance with the NVIDIA Technical Weblog.

New Options and Interface Help

NVSHMEM 3.0 introduces a number of new options, together with multi-node, multi-interconnect help, host-device ABI backward compatibility, and CPU-assisted InfiniBand GPU Direct Async (IBGDA).

Multi-Node, Multi-Interconnect Help

The brand new model helps connectivity between a number of GPUs inside a node over P2P interconnects, equivalent to NVIDIA NVLink/PCIe, and throughout nodes utilizing RDMA interconnects like InfiniBand and RDMA over Converged Ethernet (RoCE). This enhancement contains platform help for a number of racks of NVIDIA GB200 NVL72 techniques related by way of RDMA networks.

Host-Machine ABI Backward Compatibility

NVSHMEM 3.0 introduces backward compatibility throughout minor variations, permitting functions linked to an older model of NVSHMEM to run on techniques with newer variations. This function facilitates smoother updates and reduces the necessity for recompiling functions with every new launch.

CPU-Assisted InfiniBand GPU Direct Async

The newest launch additionally helps CPU-assisted IBGDA, which divides management aircraft tasks between the GPU and CPU. This method helps enhance IBGDA adoption on non-coherent platforms and relaxes administrative-level configuration constraints in large-scale clusters.

Non-Interface Help and Minor Enhancements

NVSHMEM 3.0 contains minor enhancements and non-interface help, equivalent to:

Object-Oriented Programming Framework for Symmetric Heap

This model introduces an object-oriented programming (OOP) framework to handle totally different sorts of symmetric heaps, together with static and dynamic system reminiscence. The OOP framework simplifies the extension to superior options and improves information encapsulation.

Efficiency Enhancements and Bug Fixes

NVSHMEM 3.0 brings varied efficiency enhancements and bug fixes, together with enhancements in IBGDA setup, block-scoped on-device reductions, system-scoped atomic reminiscence operation (AMO), and group administration.

Abstract

The discharge of NVSHMEM 3.0 marks a big improve in NVIDIA’s parallel programming interface. Key options equivalent to multi-node multi-interconnect help, host-device ABI backward compatibility, and CPU-assisted IBGDA goal to reinforce GPU communication and software portability. Directors and builders can now replace to newer variations of NVSHMEM with out disrupting current functions, making certain smoother transitions and higher efficiency in large-scale GPU clusters.

Picture supply: Shutterstock

What's Hot

ANDREW TATES MOST ICONIC INTERVIEW ON MONEY, MASCULINITY & WOMEN ft LAYAH HEILPERN

Solana (SOL) Could Crash 40% If It Stays Below $140 – Top Analyst Shares Insights

Ethereum faces Fed liquidity suck, ETFs affected too: What now?

NVIDIA Introduces NVSHMEM 3.0 with Enhanced GPU Communication Features

Runway Partners with Lionsgate to Revolutionize AI in Film Production

SonicJobs Leverages AI to Revolutionize Job Application Processes

Bitcoin (BTC) Market Faces Stagnation Amid Low Capital Flows

Tether (USDT) Strengthens Financial Transparency and Law Enforcement Collaboration

Binance Offers Up to 104 USDC in Exclusive Promotion for New and Existing Users

Binance Introduces New Loanable Assets on Flexible Rate and VIP Loan Services

ANDREW TATES MOST ICONIC INTERVIEW ON MONEY, MASCULINITY & WOMEN ft LAYAH HEILPERN

Solana (SOL) Could Crash 40% If It Stays Below $140 – Top Analyst Shares Insights

Ethereum faces Fed liquidity suck, ETFs affected too: What now?

PancakeSwap Launches First IFO on Arbitrum Featuring Eigenpie

Pomp Podcast #439: Brad Garlinghouse on Crypto Regulation

Content

Market Tools

COMPANY

Connect

What's Hot

NVIDIA Introduces NVSHMEM 3.0 with Enhanced GPU Communication Features

New Options and Interface Help

Multi-Node, Multi-Interconnect Help

Host-Machine ABI Backward Compatibility

CPU-Assisted InfiniBand GPU Direct Async

Non-Interface Help and Minor Enhancements

Object-Oriented Programming Framework for Symmetric Heap

Efficiency Enhancements and Bug Fixes

Abstract

Keep Reading

Content

Market Tools

COMPANY

Connect