Content by Valerie Cutts and Jithin Jose (1)
Valerie Cutts and Jithin Jose explain how Azure’s Fairwater AI supercomputer network is designed to keep large synchronous training jobs running through routine faults, using Multipath Reliable Connection (MRC), a two-tier multi-plane topology, and static SRv6 source routing.
End of content