You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are multiple backends to choose for running the model that the Home Assistant integration uses. Here is a description of all the options for each backend
Limits the number of tokens that can be produced by each model response
512
Additional attribute to expose in the context
Extra attributes that will be exposed to the model via the {{ devices }} template variable
Service Call Regex
The regular expression used to extract service calls from the model response; should contain 1 repeated capture group
Refresh System Prompt Every Turn
Flag to update the system prompt with updated device states on every chat turn. Disabling can significantly improve agent response times when using a backend that supports prefix caching (Llama.cpp)