Monday, October 11, 2021

Megatron-Turing NLG 530B, the World’s Largest Generative Language Model

Article URL: https://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/

Comments URL: https://news.ycombinator.com/item?id=28831917

Points: 2

# Comments: 0



from Hacker News: Newest https://ift.tt/3DprzKa
via IFTTT

No comments:

Post a Comment

Acid attack victim was 'set up by his ex-wife'

A court hears Danny Cahalane, 38, faced "real threats" in the months before his death. from BBC News https://ift.tt/3cjZxC6 via...