📚 模块9：资源管理和常见问题

即使使用QLoRA，在Colab中仍可能出现内存耗尽。策略：

从1或2开始。通过gradient_accumulation_steps进行补偿。

如果内容允许，从512降低到256或384。

model = torch.compile(model)

可能加速训练并减少内存，但并不总是稳定。

torch.cuda.empty_cache()

在加载模型后或实验之间很有用。

如果使用trust_remote_code=True加载或使用PEFT，这是正常的。不严重。

在TrainingArguments中使用optim="adamw_bnb_8bit"或optim="paged_adamw_8bit"。

忽略。Trainer会自动处理模式。

← Module8

Course: AI-course3

Language: ZH

Lesson: Module9