Capabilities
Results classification can use run artifacts such as screenshots, page state, URLs, step descriptions, element-targeting details, error messages, and stack traces, plus evidence from past runs. It is designed to:- Distinguish product changes from brittle or outdated test authoring.
- Separate application bugs from infrastructure, gateway, browser, or external service failures.
- Identify missing setup such as test data, files, credentials, or shared fixture state.
- Detect timing races, missing waits, and async producer/consumer mismatches.
- Notice when an earlier passing step looks like a false positive based on the later evidence.
- Flag cases where Momentic execution, AI, cache, or recovery behavior appears to be the failure source.
- Estimate whether the failure is recoverable by an automated follow-up.
Actions
Each category has a configured action:- Heal writes the classification data needed for the healing agent to pick up the run and attempt a repair. It exits zero when waiting from the CLI.
- Warn notifies the user through configured notification channels and exits zero when waiting from the CLI.
- Fail keeps the run blocking and exits with code 1.
Where you see it
The run viewer surfaces results classification in the failure details for a run:- Category and reasoning
- Confidence and recoverability when available
- Selected post-classification action
- Error summary and supporting details
