Skip to content

Take token count quantization of fused attention into consideration for CP results correction#1396

Merged
xrennvidia merged 3 commits intoNVIDIA:mainfrom xrennvidia:xren/cp_lseJan 10, 2025

Commits

Commits on Jan 9, 2025