December 2024_xiangyu474_module1commit #158
Open
+13,692
−0
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
December 2024 Student Submission
Module Completed
Changes Made
Describe what you've done in this PR:
1_instruction_tuning\student_examples\xiangyu474\chat_templates_example.ipynb
, I implemented 2process_dataset
functions.1_instruction_tuning\student_examples\xiangyu474\sft_finetuning_example.ipynb
, I fine tuned the base model using "everyday-conversations" dataset.Notebooks Added/Modified
List any notebooks you've added or modified:
1_instruction_tuning\student_examples\xiangyu474\chat_templates_example.ipynb
1_instruction_tuning\student_examples\xiangyu474\sft_finetuning_example.ipynb
Checklist
december-2024
branchQuestions or Discussion Points
Add any questions you have or points you'd like to discuss:
sft_finetuning_example.ipynb
, the training section was taking a long time. To address this, I modified theSFTConfig
parameters, reducingmax_steps
from 1000 to 750. However, the resulting model output quality declined—it couldn't generate a proper haiku as requested.Additional Notes
Any other information that might be helpful for reviewers: