SFT Training update tutorials #769

tengomucho · 2025-01-28T15:03:26Z

What does this PR do?

This PR revisits the SFT Training tutorial of Llama3-8B. Few highlights of the changes:

correct scripts, remove references to unused configurations and tools,
update TOC,
update wording, compile script and separate shell script to launch training,
add model merge after consolidation, to obtain a model that can be loaded for evaluation,
add dependencies on trl and peft.

Note that there is still an issue with the training: the loss does NOT decrease during fine-tune (cc @michaelbenayoun).

Before submitting

This PR fixes a typo or improves the docs

dacorvo

Thansk you for the pull-request. Looks good to me except for the dependencies.

dacorvo · 2025-01-28T15:15:14Z

setup.py

@@ -19,6 +19,8 @@
    "huggingface_hub >= 0.20.1",
    "numpy>=1.22.2, <=1.25.2",
    "protobuf>=3.20.3, <4",
+    "trl == 0.11.4",


These are the global, bare minimum dependencies: we should not add optional components here.

HuggingFaceDocBuilderDev · 2025-01-28T15:21:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

michaelbenayoun · 2025-01-29T09:53:18Z

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

+5. Make sure you have the `training` extra installed, to get all the necessary dependencies:
+```bash
+python -m pip install .[training]
+```


michaelbenayoun · 2025-01-29T09:53:55Z

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

@@ -63,7 +73,7 @@ Example:
  "context": "",
  "response": (
        "World of warcraft is a massive online multi player role playing game. "
-        "It was released in 2004 by blizarre entertainment"
+        "It was released in 2004 by bizarre entertainment"


Suggested change

"It was released in 2004 by bizarre entertainment"

"It was released in 2004 by Blizzard Entertainment"

Nope, that is what is actually in the Dolly dataset! See here 🤷

As a former big player of WoW I feel attacked.

michaelbenayoun · 2025-01-29T09:57:15Z

docs/source/training_tutorials/sft_lora_finetune_llm.sh

+  --gradient_accumulation_steps $GRADIENT_ACCUMULATION_STEPS \
+  --gradient_checkpointing true \
+  --bf16 \
+  --zero_1 false \


Can be removed

michaelbenayoun · 2025-01-29T09:57:48Z

setup.py

@@ -50,9 +50,15 @@
    "hf_doc_builder @ git+https://github.com/huggingface/doc-builder.git",
 ]

+TRAINING_REQUIRES = [
+    "trl == 0.11.4",
+    "peft == 0.14.0",


Can we add neuronx_distributed as well?

Slurm can be used for multi-node training, but not relevant in the tutorial, so it is removed.

The shell script is to launch precompilation and fine-tuning.

- reword inference hw suggestion - adapt code to use the merged model directory

Peft was already in the AMI, but hte version was not fixed, and TRL was not installed, but it is required for the SFT LoRA fine tune LLM tutorial.

Instead of adding peft and trl as package dependencies, these are isolated in the training extra.

michaelbenayoun

LGTM!
That's awesome, thanks for taking care of that!

michaelbenayoun · 2025-01-29T14:56:43Z

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

@@ -63,7 +73,7 @@ Example:
  "context": "",
  "response": (
        "World of warcraft is a massive online multi player role playing game. "
-        "It was released in 2004 by blizarre entertainment"
+        "It was released in 2004 by bizarre entertainment"


As a former big player of WoW I feel attacked.

tengomucho requested review from michaelbenayoun, JingyaHuang and dacorvo January 28, 2025 15:04

tengomucho force-pushed the training-update-tutorials branch from 81dd858 to 64a9623 Compare January 28, 2025 15:08

dacorvo requested changes Jan 28, 2025

View reviewed changes

dacorvo approved these changes Jan 28, 2025

View reviewed changes

michaelbenayoun reviewed Jan 29, 2025

View reviewed changes

tengomucho requested a review from michaelbenayoun January 29, 2025 10:15

tengomucho added 14 commits January 29, 2025 12:45

fix(doc): remove reference to slurm

ddb8632

Slurm can be used for multi-node training, but not relevant in the tutorial, so it is removed.

fix(doc): correct typo on training tutorial

01ee4b5

fix(doc): correct dataset citation on training tutorial

94ecb4c

chore(doc): update TOC to reflect content

47320d7

fix(doc): correct link

19c1744

doc(training): update SFT tutorial's wording, separate shell script

5b95050

The shell script is to launch precompilation and fine-tuning.

feat(doc): add model merge after weights consolidation in SFT tutorial

c25fb7e

doc(tutorial): adapt evaluation section

d32de69

- reword inference hw suggestion - adapt code to use the merged model directory

chore(training): add TRL and PEFT as dependencies

3d483b6

Peft was already in the AMI, but hte version was not fixed, and TRL was not installed, but it is required for the SFT LoRA fine tune LLM tutorial.

fix(doc): remove tip tag

9d85e40

chore(dependencies): create "training" extra with dependencies

c11dda7

Instead of adding peft and trl as package dependencies, these are isolated in the training extra.

chore(doc): update sft training tutorial to install training extra

4ce9b9a

doc(tutorial): remove cmd option --zero_1 false

5c0bcee

chore(dependencies): add neuronx-distributed for training extra

15408fe

tengomucho force-pushed the training-update-tutorials branch from 544da8d to 15408fe Compare January 29, 2025 12:45

michaelbenayoun approved these changes Jan 29, 2025

View reviewed changes

tengomucho merged commit 43ad4be into main Jan 29, 2025
9 of 11 checks passed

tengomucho deleted the training-update-tutorials branch January 29, 2025 15:51

dacorvo mentioned this pull request Jan 30, 2025

Update sft_lora_finetune_llm.mdx #765

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFT Training update tutorials #769

SFT Training update tutorials #769

tengomucho commented Jan 28, 2025 •

edited

Loading

dacorvo left a comment

dacorvo Jan 28, 2025

HuggingFaceDocBuilderDev commented Jan 28, 2025

michaelbenayoun Jan 29, 2025

michaelbenayoun Jan 29, 2025

tengomucho Jan 29, 2025

michaelbenayoun Jan 29, 2025

michaelbenayoun Jan 29, 2025

michaelbenayoun Jan 29, 2025

michaelbenayoun left a comment

michaelbenayoun Jan 29, 2025

	"It was released in 2004 by bizarre entertainment"
	"It was released in 2004 by Blizzard Entertainment"

SFT Training update tutorials #769

SFT Training update tutorials #769

Conversation

tengomucho commented Jan 28, 2025 • edited Loading

What does this PR do?

Before submitting

dacorvo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jan 28, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

michaelbenayoun left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tengomucho commented Jan 28, 2025 •

edited

Loading