diff --git a/docs/guides/grpo.md b/docs/guides/grpo.md index 8d93fb64f7..6ace84876d 100644 --- a/docs/guides/grpo.md +++ b/docs/guides/grpo.md @@ -10,6 +10,8 @@ uv run examples/run_grpo_math.py --config {overrides} ``` If not specified, `config` will default to [examples/configs/grpo.yaml](../../examples/configs/grpo.yaml) +**Reminder**: Don't forget to set your HF_HOME and WANDB_API_KEY (if needed). You'll need to do a `huggingface-cli login` as well for Llama models. + ## Now, for the details: In this guide, we'll walk through we handle diff --git a/docs/guides/sft.md b/docs/guides/sft.md index d994967bd2..4d452b109d 100644 --- a/docs/guides/sft.md +++ b/docs/guides/sft.md @@ -21,6 +21,7 @@ uv run examples/run_sft.py \ cluster.gpus_per_node=1 \ logger.wandb.name="sft-dev-1-gpu" ``` +**Reminder**: Don't forget to set your HF_HOME and WANDB_API_KEY (if needed). You'll need to do a `huggingface-cli login` as well for Llama models. ## Datasets