包含标签 "nanobot" 的所有文章
共 6 篇文章
nanobot
rl
reinforcement learning
mid_train
pre-train
checkpoint_manager
gpt
sft
supervised fine-tuning