【论文阅读】{MegaScale}:Scaling Large Language Model Training to More Than 10,000 {GPUs}
论文基础信息论文地址: {MegaScale}: Scaling Large Language Model Training to More Than 10,000 {GPUs} 收录会议: 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24)(CCF-A,计算机网络顶级会议) 作者机构: 字节