-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EvalStatus on placement failure #24824
Comments
This is really confusing! Thanks for filing an issue. I think this is probably best fixed with documentation (I can't find any!), but it's possible there's some CLI improvements we could make as well. What's going on?When you schedule a job it creates an evaluation as you have observed. When that job cannot be placed (due to lack of cluster capacity), a blocked evaluation is created and the original evaluation is marked as complete. The CLI makes this very confusing:
And the deployment spins until it hits its progress deadline or capacity becomes available. So you did the natural thing and peeked at the evaluation to see what happened:
That very last line is the missing piece of your puzzle! Let's look at
That's the blocked evaluation that needs to be monitored! So the evaluations form a chain that's a bit tricky to follow:
The UI makes this much clearer by going to the Evaluations tab on the job: The SolutionI'm open to ideas! I think the Scheduling > Placement doc is the most obvious place to explain blocked evaluations. I'm shocked it doesn't mention "blocked" or even "eval" once! |
Nomad version
Output from
nomad version
1.9.4
Operating system and Environment details
Clang: 16.0.0 build 1600
Git: 2.47.1 => /opt/homebrew/bin/git
Curl: 8.7.1 => /usr/bin/curl
macOS: 15.2-arm64
CLT: 16.2.0.0.1.1733547573
Xcode: 16.0
Rosetta 2: false
Issue
On a placement failure status returned is "complete" in evaluation structure.
There's no error the only thing one can use to detect this is 'trigger-by' field where i have will have to compare to string "job-deregister"
i understand since evaluation is complete(concluded job cannot be placed) so status is complete , but there i don't see a simple way to find out if job got placed
Reproduction steps
added a sample job file
Expected Result
expecting EvalStatus to be blocked or cancelled
Actual Result
Job file (if appropriate)
Nomad Server logs (if appropriate)
Nomad Client logs (if appropriate)
The text was updated successfully, but these errors were encountered: