Add atvm144-suse15.0 to the ATVM automation blacklist because it crashes when creating a migration session. Update maintained exclude examples to include the new blacklist entry. Tighten status reporting guidance to require one machine per line. Add a Notes section for failure reasons and operator-facing context. Record the new run learnings in atvm-automation-runs.md.
6.0 KiB
6.0 KiB
Run ATVM Automation Runs
This file stores run-specific examples only when a run produced a new learning relevant to future automation tasks.
Entry Rule
- Add an entry only when a run changed workflow behavior, exposed a failure mode, or confirmed a required new check.
- Do not add routine runs with no new learning.
Current State
- No run-learning entries recorded yet from
atvm-automation-guide.mdsource material.
Run Learning: 2026-03-08 (E2E redhat9.7, pure/fc)
- Request:
- template:
cmc-e2e - filter:
--containsVm redhat9.7 - integration:
--integration_type pure - plugin:
--use_specified_plugin fc
- template:
- Observed result:
- Cypress spec execution passed (
1test,1passing,0failing). - Cloud run URL was produced and marked uploaded.
run-sorry-cypress.pyremained running afterward with a defunctnpm exec cypress-cloudchild process and did not exit cleanly on its own.
- Cypress spec execution passed (
- Action for future runs:
- If pass/upload is confirmed but
run-sorry-cypress.pydoes not exit, treat it as a runner hang condition. - Capture run URL and pass/fail status first, then terminate the stuck runner process cleanly.
- If pass/upload is confirmed but
Run Learning: 2026-03-09 (Blacklist handling and status format)
- Observed requirement:
- Some ATVM machines must be skipped even when a broad selector such as
--containsVmor--randomizewould otherwise include them.
- Some ATVM machines must be skipped even when a broad selector such as
- Machines to blacklist via
--exclude_partial_match:BLACKLISTED: CMC INSTALL - CAN'T COMPILE:atvm6-centos6.0atvm41-redhat6.0atvm73-oracle6.0
BLACKLISTED: SUPPORT REQUEST - WAITING:atvm113-debian9.0.0atvm115-debian9.1.0atvm116-debian9.2.0atvm156-debian9.3.0
- Needs re-creation:
atvm157-debian13.0.0
- Action for future runs:
- Add these machine names to
--exclude_partial_matchwhen building broad-scope automation commands. - When reporting run status, include skipped blacklisted machines separately with their reason, in addition to completed and remaining machines.
- Use the run
build_nameas the heading/title for status responses so the test type is obvious. - For failed machines in status responses, include the failure reason taken from the run log.
- Include timing details in status responses: start time, end time when complete, and total or elapsed runtime.
- Also include timing stats in status responses: quickest completed test runtime, longest completed test runtime, and average completed test runtime.
- Add these machine names to
Run Learning: 2026-03-11 (Machine-first status lines and whole-run ETA)
- Observed requirement:
- Status output must list each machine first and then its status, rather than leading with the status label.
- Estimated completion time must refer to the entire remaining automation run, not only the currently running machine.
- Action for future runs:
- Format machine entries as
machine-name - STATUS. - Keep failure reasons after the machine/status entry when a machine failed.
- When giving ETA, explicitly state it is the estimate for completion of the full remaining run.
- Format machine entries as
Run Learning: 2026-03-11 (Categorized run status must be reconstructed across batches)
- Observed failure mode:
run-sorry-cypress.py --categorizemutates the active config to the current category batch, so live files such asspecPattern,current_vm, and the newest/tmpCypress JSON only describe the current category, not the full automation run.- Answering from only the current live batch underreports the run and misses already-finished machines from earlier category batches.
- Action for future runs:
- Reconstruct whole-run status from the generated machine scope plus all machine result artifacts written since the run start time.
- Use the current batch only to identify the live
RUNNINGmachine and immediate next machine(s), not as the full run scope. - Do not answer status requests for categorized runs until earlier category results have been checked as part of the same run.
Run Learning: 2026-03-11 (Hash-named XML files still belong to machine runs)
- Observed failure mode:
- Same-run JUnit output is not consistently named
test-result-atvm...xml. - Many machine results for the same automation run were written as hash-named files such as
test-result-01fe412894862398d06d9cc4bc7e81a0.xml. - Limiting status reconstruction to machine-named XML files causes major undercounting of completed machines.
- Same-run JUnit output is not consistently named
- Action for future runs:
- Parse all
test-result-*.xmlfiles written since the run start time, not onlytest-result-atvm*.xml. - Extract the machine name from XML contents such as
testsuite file=,testsuite name=, ortestcase name=when the filename does not include the machine name. - Treat
check-xml-files.tsXML outputs as bookkeeping steps, not machine results. - Prefer the most recently written same-run XML per machine when multiple XML files exist for that machine.
- Parse all
Run Learning: 2026-03-12 (Status output must be one machine per line with notes separated)
- Observed requirement:
- Listing multiple completed machines on one line makes run status harder to scan and does not meet the expected reporting format.
- Failure reasons and extra context should be separated from the machine status list so the list stays clean.
- Action for future runs:
- Under completed, skipped, and remaining sections, put exactly one machine status on each line.
- Add a
Notessection after completed machines for failure reasons, anomalies, and other operator-relevant context. - Keep completed machine lines in the form
machine-name - STATUSand avoid appending long explanations inline.
Run Learning: 2026-03-12 (Add suse15.0 machine to blacklist)
- Observed requirement:
atvm144-suse15.0must be excluded from automation runs because it crashes while creating the migration session.
- Action for future runs:
- Add
atvm144-suse15.0to the maintained blacklist. - Record the reason as
CRASHES WHEN CREATING MIGRATION SESSION - BUG. - Include it in reusable
--exclude_partial_matchcommand examples.
- Add