GitHub - synbol/TCM-Nvwa: Nüwa, a Traditional Chinese Medical LLaMA-based LLM.

Nüwa: Towards Large Language Model to Be a Traditional Chinese Medicine Doctor

📖 Introduction

we introduce Nüwa, a comprehensive Traditional Chinese Medicine LLM that encompasses the entire training pipeline from Continuous Pre-training and Supervised Instruction Fine-tuning to Reinforcement Learning from AI Feedback. Nuwa outperforms other open-source Chinese medical LLMs within TCM domain, thanks in part to our construction of a large-scale TCM training corpus and TCM dialogue dataset.

🔥 News and Updates

✅ [2024/08/15] Nüwa starts releasing dataset, code, etc.

✅ [2024/08/01] Nüwa TCM repo is created.

📚 Data

data/pretrain: Contains part of TCM corpus for continuous pre-training the model.
data/finetune: Contains part of TCM-QR for supervised instruction fine-tuning the model.
data/reward: Contains samples for training the reward model.

⭐ Code Structure

Training Stage:

Stage	Python script	Shell script
Stage 1: Continue Pre-training	pretraining.py	run_pt.sh
Stage 2: Supervised Instruction Fine-tuning	supervised_finetuning.py	run_sft.sh
Stage 3: Reward Modeling	reward_modeling.py	run_rm.sh
Stage 4: Reinforcement Learning	rl_training.py	run_rl.sh

🐌 Quick Start

👉 Environment Setup

To install the required packages, you can create a conda environment.

conda create --name nvwa-tcm python=3.8

Activate conda environment.

conda activate nvwa-tcm

Use pip to install required packages.

pip install -r requirements.txt

👉 Download base model

Please download the LLaMA-Ziya-13B model from the link Download Link.

👉 Training

1. Continuous Pre-training

bash run_pt.sh

1. Supervised Instruction Fine-tuning

bash run_sft.sh

The LoRA method is used here, and the parameters need to be merged into the Model

python merge_peft_adapter.py

1. Reward Modeling

bash run_rm.sh

1. Reward Modeling

run_rl.sh

👉 Inference

python inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nüwa: Towards Large Language Model to Be a Traditional Chinese Medicine Doctor

📖 Introduction

🔥 News and Updates

📚 Data

⭐ Code Structure

🐌 Quick Start

👉 Environment Setup

👉 Download base model

👉 Training

👉 Inference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
assets		assets
data		data
README.md		README.md
deepspeed_config.json		deepspeed_config.json
gradio_demo.py		gradio_demo.py
inference.py		inference.py
merge_peft_adapter.py		merge_peft_adapter.py
pretraining.py		pretraining.py
requirements.txt		requirements.txt
reward_modeling.py		reward_modeling.py
rl_training.py		rl_training.py
run_pt.sh		run_pt.sh
run_rl.sh		run_rl.sh
run_rm.sh		run_rm.sh
run_sft.sh		run_sft.sh
supervised_finetuning.py		supervised_finetuning.py

synbol/TCM-Nvwa

Folders and files

Latest commit

History

Repository files navigation

Nüwa: Towards Large Language Model to Be a Traditional Chinese Medicine Doctor

📖 Introduction

🔥 News and Updates

📚 Data

⭐ Code Structure

🐌 Quick Start

👉 Environment Setup

👉 Download base model

👉 Training

👉 Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages