Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

126 Open 1,191 Closed

🐛 bug 🏋 DPO 👁️ VLM

#2563 opened Jan 12, 2025 by liuchaohu

5 of 9 tasks

❓ question 🏋 RLOO

#2562 opened Jan 11, 2025 by mnoukhov

5 of 9 tasks

⚡accelerate 🏋 PPO 🏋 RLOO

#2555 opened Jan 10, 2025 by dawidm

7 of 9 tasks

✨ enhancement 🏋 KTO

#2554 opened Jan 10, 2025 by starmpcc

🐛 bug 🏋 DPO

#2553 opened Jan 9, 2025 by solume

7 of 9 tasks

❓ question 🏋 SFT

#2545 opened Jan 6, 2025 by okhat

Is truncation_mode used in DPOTrainer? 🏋 DPO ❓ question

#2538 opened Jan 2, 2025 by anakin87

🏋 DPO 🙋 help from community wanted ⚡ PEFT

#2536 opened Jan 2, 2025 by maoulee

7 of 9 tasks

🙋 help from community wanted 🏋 PPO

#2534 opened Dec 31, 2024 by SachinVashisth

🐛 bug 🚀 deepspeed ⏳ needs more info 🏋 Online DPO

#2532 opened Dec 30, 2024 by yiyepiaoling0715

5 of 9 tasks

✨ enhancement 🏋 Online DPO 🏋 PPO 🏋 RLOO

#2529 opened Dec 28, 2024 by dawidm

✨ enhancement

#2525 opened Dec 28, 2024 by August-murr

3 tasks done

🏋 PPO ❓ question

#2518 opened Dec 24, 2024 by yananchen1989

✨ enhancement

#2517 opened Dec 23, 2024 by AMindToThink

3 tasks

🐛 bug 🏋 RLOO

#2515 opened Dec 23, 2024 by dawidm

7 of 9 tasks

🐛 bug ⚡ PEFT 🏋 SFT

#2514 opened Dec 21, 2024 by SwayamInSync

7 of 9 tasks

🐛 bug 🏋 DPPO 🙋 help from community wanted ⏳ needs more info

#2505 opened Dec 20, 2024 by nguyenhoa-uit

5 of 9 tasks

ProTip! What’s not been updated in a month: updated:<2024-12-14.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list