A survey on a line of work following (Qi. et al. 2023) #8

huangtiansheng · 2024-10-05T20:18:48Z

Hi authors,

Thanks for the wonderful initial work on harmful fine-tuning. We recently noticed a huge amount of papers coming out on the harmful fine-tutning attacks for LLMs. We have pre-printed a survey paper summarizing the existing following-up paper on this issue.

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey https://arxiv.org/abs/2409.18169

Repo: https://github.com/git-disl/awesome_LLM-harmful-fine-tuning-papers

It would be nice if the authors can incorporate our survey in the readme, as this probably can attract more people to work on this important topic. But no pressure if you feel it is inappropriate.

Thanks,
Tiansheng Huang

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A survey on a line of work following (Qi. et al. 2023) #8

A survey on a line of work following (Qi. et al. 2023) #8

huangtiansheng commented Oct 5, 2024 •

edited

Loading

A survey on a line of work following (Qi. et al. 2023) #8

A survey on a line of work following (Qi. et al. 2023) #8

Comments

huangtiansheng commented Oct 5, 2024 • edited Loading

huangtiansheng commented Oct 5, 2024 •

edited

Loading