标签 - LLM - 滑滑蛋的个人博客

01-12

【Nano-vLLM源码分析（二）】关键类实现

01-10

【Nano-vLLM源码分析（一）】环境配置及整体流程概览

01-09

【Megatron-LM源码分析（六）】-流水线并行-1F1B

01-09

【论文阅读】Efficient Memory Management for Large Language Model Serving with PagedAttention（vLLM论文）

01-08

【Megatron-LM源码分析（五）】-Tensor并行

12-28

【论文阅读】ByteScale:Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000

12-28

【Megatron-LM源码分析（四）】-DDP数据并行

12-26

【Megatron-LM源码分析（三）】-性能分析

12-22

【论文阅读】ScheMoE:An Extensible Mixture-of-Experts Distributed Training System with Tasks Scheduling

12-22

【Megatron-LM源码分析（二）】-GPT模型pretrain流程