Skip to content

Commit

Permalink
Feature/llama factory llama2 pipeline (#89)
Browse files Browse the repository at this point in the history
* added llama-factory under llm_rl

* added sft training bash

* added datasets from llama-factory; will delete later

* finished llama-2-13b train and inference

* fixed minor errors

* changed config

* added deepspeed config

* added more training config to train bash

* adding fix for wandb tags and distributed ranks

* added fastchat data to replicate training for 2k
  • Loading branch information
Jasonqi146 authored and lwaekfjlk committed Nov 16, 2023
1 parent 8bfe735 commit b8e58b0
Show file tree
Hide file tree
Showing 32 changed files with 2,453,774 additions and 10 deletions.
7 changes: 6 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ llm_ft/checkpoints/*
llm_ft/*_checkpoints/*
!**/dummy_conversation.json
!llm_ft/deepspeed_config_s2.json
!llm_rl/data/*.json

# Editor
.idea
Expand Down Expand Up @@ -193,4 +194,8 @@ cython_debug/
# be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
# and can be added to the global gitignore or merged into this file. For a more nuclear
# option (not recommended) you can uncomment the following to ignore the entire idea folder.
#.idea/
#.idea/

./llm_rl/preprocess/GPT4-4_Redis_Easy_No_Slide

llm_rl/*cache/
6 changes: 6 additions & 0 deletions llm_rl/cli_inference-llama-2-13b.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
python src/cli_demo.py \
--model_name_or_path meta-llama/Llama-2-13b-hf \
--cache_dir ./model_cache \
--template llama2-sotopia \
--finetuning_type lora \
--checkpoint_dir /workspace/sotopia-llm/llm_rl/llama2-13b-sft_cache/checkpoint-35
1 change: 1 addition & 0 deletions llm_rl/data/GPT4-4_Redis_Easy_No_Slide.json

Large diffs are not rendered by default.

260,012 changes: 260,012 additions & 0 deletions llm_rl/data/alpaca_data_en_52k.json

Large diffs are not rendered by default.

257,308 changes: 257,308 additions & 0 deletions llm_rl/data/alpaca_data_zh_51k.json

Large diffs are not rendered by default.

Loading

0 comments on commit b8e58b0

Please sign in to comment.