CI / cuda

Use fused softmax kernel in llama attention layer #3356

Sign in to view logs

Summary
Jobs
- test-cuda
Run details
- Usage
- Workflow file

Re-run triggered October 23, 2024 17:43

#2572

zackangelo:llama_softmax_last_dim

Status Skipped

Total duration 4s

Artifacts –

ci_cuda.yaml

on: pull_request