LLM后训练基础

Post-Training lesson from DeepLearning.AI Intro Pre-training:喂数据,让模型学会 predict next token,输出Base Model Post-training:让模型学会 chat,或完成指...

February 18, 2026 · 3 min · 1462 words · KAI