Skip to content

Commit

Permalink
docs: Update to use Standard_NC24ads_A100_v4 as default SKU in docs
Browse files Browse the repository at this point in the history
  • Loading branch information
Fei-Guo committed Jan 10, 2025
1 parent b4d9a85 commit ae4e1a0
Show file tree
Hide file tree
Showing 18 changed files with 25 additions and 25 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ kind: Workspace
metadata:
name: workspace-phi-3-5-mini
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-3-5
Expand All @@ -65,8 +65,8 @@ The workspace status can be tracked by running the following command. When the W

```sh
$ kubectl get workspace workspace-phi-3-5-mini
NAME INSTANCE RESOURCEREADY INFERENCEREADY JOBSTARTED WORKSPACESUCCEEDED AGE
workspace-phi-3-5-mini Standard_NC6s_v3 True True True 4h15m
NAME INSTANCE RESOURCEREADY INFERENCEREADY JOBSTARTED WORKSPACESUCCEEDED AGE
workspace-phi-3-5-mini Standard_NC24ads_A100_v4 True True True 4h15m
```

Next, one can find the inference service's cluster ip and use a temporal `curl` pod to test the service endpoint in the cluster.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-custom-llm
resource:
instanceType: "Standard_NC12s_v3" # Replace with the required VM SKU based on model requirements
instanceType: "Standard_NC24ads_A100_v4" # Replace with the required VM SKU based on model requirements
labelSelector:
matchLabels:
apps: custom-llm
Expand Down
4 changes: 2 additions & 2 deletions docs/custom-model-integration/reference-image-deployment.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-custom-llm
resource:
instanceType: "Standard_NC12s_v3" # Replace with the required VM SKU based on model requirements
instanceType: "Standard_NC24ads_A100_v4" # Replace with the required VM SKU based on model requirements
labelSelector:
matchLabels:
apps: custom-llm
Expand Down Expand Up @@ -37,4 +37,4 @@ inference:
volumes:
- name: dshm
emptyDir:
medium: Memory
medium: Memory
6 changes: 3 additions & 3 deletions docs/inference/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ kind: Workspace
metadata:
name: workspace-falcon-7b
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b
Expand Down Expand Up @@ -54,7 +54,7 @@ metadata:
annotations:
kaito.sh/runtime: "transformers"
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b
Expand All @@ -73,7 +73,7 @@ kind: Workspace
metadata:
name: workspace-falcon-7b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_falcon_7b-instruct.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-falcon-7b-instruct
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b-instruct
Expand Down
4 changes: 2 additions & 2 deletions examples/inference/kaito_workspace_falcon_7b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,11 +3,11 @@ kind: Workspace
metadata:
name: workspace-falcon-7b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b
inference:
preset:
name: "falcon-7b"


Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-falcon-7b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: falcon-7b
Expand All @@ -15,4 +15,4 @@ inference:
name: "falcon-7b-adapter"
image: "<YOUR_IMAGE>"
strength: "0.2"


2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_llama2_13b-chat.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-llama-2-13b-chat
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: llama-2-13b-chat
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_llama2_13b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-llama-2-13b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: llama-2-13b
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_llama2_7b-chat.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-llama-2-7b-chat
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: llama-2-7b-chat
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_llama2_7b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-llama-2-7b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: llama-2-7b
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-mistral-7b-instruct
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: mistral-7b-instruct
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_mistral_7b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-mistral-7b
resource:
instanceType: "Standard_NC12s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: mistral-7b
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_phi_2.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-phi-2
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-2
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_phi_3.5-instruct.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-phi-3-5-mini
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-3-5
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_phi_3_mini_128k.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-phi-3-mini
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-3
Expand Down
2 changes: 1 addition & 1 deletion examples/inference/kaito_workspace_phi_3_mini_4k.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-phi-3-mini
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-3
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ kind: Workspace
metadata:
name: workspace-phi-3-mini-adapter
resource:
instanceType: "Standard_NC6s_v3"
instanceType: "Standard_NC24ads_A100_v4"
labelSelector:
matchLabels:
apps: phi-3-adapter
Expand Down

0 comments on commit ae4e1a0

Please sign in to comment.