December 2024_xiangyu474_module1commit #158

xiangyu474 · 2025-01-02T06:24:51Z

December 2024 Student Submission

Describe what you've done in this PR:

In 1_instruction_tuning\student_examples\xiangyu474\chat_templates_example.ipynb, I implemented 2 process_dataset functions.
In 1_instruction_tuning\student_examples\xiangyu474\sft_finetuning_example.ipynb, I fine tuned the base model using "everyday-conversations" dataset.

List any notebooks you've added or modified:

Added new example in 1_instruction_tuning\student_examples\xiangyu474\chat_templates_example.ipynb
Added new example in 1_instruction_tuning\student_examples\xiangyu474\sft_finetuning_example.ipynb
Modified existing notebook with additional examples
Added documentation or comments

Add any questions you have or points you'd like to discuss:

In sft_finetuning_example.ipynb, the training section was taking a long time. To address this, I modified the SFTConfig parameters, reducing max_steps from 1000 to 750. However, the resulting model output quality declined—it couldn't generate a proper haiku as requested.

Any other information that might be helpful for reviewers:

burtenshaw · 2025-01-08T03:49:55Z

Nice work @xiangyu474 !

Would you like to take part in peer review? If so, mention me on a PR from another student, review it, and I'll get a student to review yours.

nevernever69 · 2025-01-18T12:31:43Z

@burtenshaw this pr looks good read the files.

xiangyu474 added 3 commits December 27, 2024 16:18

xiangyu474_Module1.1

8efefdc

Create sft_finetuning_example.ipynb

40d9292

module_1_commit

03c7f39

nevernever69 approved these changes Jan 18, 2025

View reviewed changes