May 21, 2024


Microsoft is dedicated to the accountable development of AI to allow each particular person and group to attain extra. Over the previous couple of months, we have now talked about developments in our Azure infrastructure, Azure Cognitive Companies, and Azure Machine Studying to make Azure higher at supporting the AI wants of all our prospects, no matter their scale. In the meantime, we additionally work carefully with a number of the main analysis organizations world wide to empower them to construct nice AI.

At the moment, we’re thrilled to announce an growth of our ongoing collaboration with Meta: Meta has chosen Azure as a strategic cloud supplier to assist speed up AI analysis and growth. 

As a part of this deeper relationship, Meta will increase its use of Azure’s supercomputing energy to speed up AI analysis and growth for its Meta AI group. Meta will make the most of a devoted Azure cluster of 5400 GPUs utilizing the newest digital machine (VM) collection in Azure (NDm A100 v4 collection, that includes NVIDIA A100 Tensor Core 80GB GPUs) for a few of their large-scale AI analysis workloads. In 2021, Meta started utilizing Microsoft Azure Digital Machines (NVIDIA A100 80GB GPUs) for a few of its large-scale AI analysis after experiencing Azure’s spectacular efficiency and scale. With 4 occasions the GPU-to-GPU bandwidth between digital machines in comparison with different public cloud choices, the Azure platform allows quicker distributed AI coaching. Meta used this, for instance, to coach their latest OPT-175B language mannequin. The NDm A100 v4 VM collection on Azure additionally provides prospects the flexibleness to configure clusters of any measurement robotically and dynamically from a number of GPUs to hundreds, and the power to pause and resume throughout experimentation. Now, the Meta AI crew is increasing their utilization and bringing extra cutting-edge machine studying coaching workloads to Azure to assist additional advance their main AI analysis.

As well as, Meta and Microsoft will collaborate to scale PyTorch adoption on Azure and speed up builders’ journey from experimentation to manufacturing. Azure supplies a complete prime to backside stack for PyTorch customers with best-in-class (NDv4s and Infiniband). Within the coming months, Microsoft will construct new PyTorch growth accelerators to facilitate speedy implementation of PyTorch-based options on Azure. Microsoft may also proceed offering enterprise-grade help for PyTorch to allow prospects and companions to deploy PyTorch fashions in manufacturing on each cloud and edge.

We’re excited to deepen our collaboration with Azure to advance Meta’s AI analysis, innovation, and open-source efforts in a manner that advantages extra builders world wide,” Jerome Pesenti, Vice President of AI, Meta. “With Azure’s compute energy and 1.6 TB/s of interconnect bandwidth per VM we’re in a position to speed up our ever-growing coaching calls for to higher accommodate bigger and extra progressive AI fashions. Moreover, we’re blissful to work with Microsoft in extending our expertise to their prospects utilizing PyTorch of their journey from analysis to manufacturing.”

By scaling Azure’s supercomputing energy to coach massive AI fashions for the world’s main analysis organizations, and by increasing instruments and assets for open supply collaboration and experimentation, we might help unlock new alternatives for builders and the broader tech neighborhood, and additional our mission to empower each particular person and group world wide.


Source link