LLM
44
基础知识
17
阅读笔记:Agentic RL 时代的 Infra 重构(Forge、ROLL、Seer、Slime)
【论文阅读】SWE-bench Goes Live!
【论文阅读】Search-R1:Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
【论文阅读】GLM-5:from Vibe Coding to Agentic Engineering
【论文阅读】ByteScale:Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000
【论文阅读】ScheMoE:An Extensible Mixture-of-Experts Distributed Training System with Tasks Scheduling
【论文阅读】The Llama 3 Herd of Models(Section 3 Pre-Training)
【论文阅读】Rail-only:A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters
【论文阅读】Reducing Activation Recomputation in Large Transformer Models
深度学习中反向传播及优化器使用详解
More...