Skip to content

Commit

Permalink
move k8s deployment file
Browse files Browse the repository at this point in the history
  • Loading branch information
saienduri committed Dec 20, 2024
1 parent b320fcb commit 78dcc9b
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/shortfin/llm/user/e2e_llama8b_k8s.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ behind a load balancer on MI300X GPU.

### Deploy shortfin llama app service

Save [llama-app-deployment.yaml](https://github.com/nod-ai/shark-ai/tree/main/shortfin/python/shortfin_apps/llm/k8s/llama-app-deployment.yaml) locally and edit it to include your artifacts and intended configuration.
Save [llama-app-deployment.yaml](../../../../shortfin/deployment/shortfin_apps/llm/k8s/llama-app-deployment.yaml) locally and edit it to include your artifacts and intended configuration.

To deploy llama app:

Expand Down

0 comments on commit 78dcc9b

Please sign in to comment.