driver_matrix: Multi-stage pipeline tracking #289

k0machi · 2023-10-24T02:05:54Z

Currently, driver matrix tests rely on filename parsing to determine which test they are running. In addition to that, the submission happens at the end of pipeline, losing some of the potentially useful information - like start time, overall pipeline health is guessed instead.

The replacement mechanism should function akin to SCT and provide following features:

Maintain a unique job id per matrix run
Create an empty entity when a run is started
Submit each test result separately as they complete
Use pipeline health indicator to provide general status - Failed, Passed, etc
Pass the test information (kind, driver) to the API.

fruch · 2024-01-18T08:07:56Z

@k0machi

can you expend a bit, this sound like a change more then Argus itself.
can you estimate how long it would take ?

k0machi · 2024-01-23T09:37:43Z

Better title and time estimation added.

k0machi · 2024-06-25T04:07:46Z

Current plans:

~~Switching from docker container to a standalone client - assuming the ec2-asg-x86 node pool we have has python 3.10~~
Making a cli interface for the driver results - skipping need for a custom python script that has to be maintained separately
Keeping legacy endpoint by checking schema version on the request
The new API will contain this flow:
- Create run - Requires UUID (pipeline manager) and build environment information (separate parameters, undecided which ones are needed). Run will be created with status "CREATED"
- Update run with build information if available
- Submit result - Requires raw XML encoded as base64 inside the request json, as well as job id. First submission will set test status to "RUNNING" and apply heartbeat.
- Fail result - Same as submit, probably under same command except different type, needs failure reason, run id. Won't change the status yet.
- Finalize - Indicate that we have finished the pipeline. Backend will determine the results from that. Will need just the id.

k0machi · 2024-06-27T11:10:54Z

As a note, we will need to update all driver matrix repositories to generate some kind of metadata file, as the current implementation runs every version on the same process and only outputs results at the end, so we will need some way to collect which file belongs to which version.

Additionally, adding a way to store logs from driver matrix results would be very useful, akin to how sct does it. Saving scylla nodes logs might prove tricky as that is more on the driver's test framework side than ours.

This commit reworks the way Driver Matrix Runs are submitted to Argus, replacing previously one-shot architecture with a multi-stage pipeline akin to SCT, where driver matrix run first submits itself, and then submits results sequentially (either a pass or failure), finalizing with a call that makes backend determine status based on resulting payload. Patch failures are now supported. Entire logic was moved from the client to the backend, removing the need to update the client in case of large changes to the API / submitted data. Fixes scylladb#289

This commit adds a new file emitted by the matrix runs, intended to provide metadata about the run for reporting tools that are executed later, such as Argus. Currently, the metadata file always contains the driver name (derived from result xml name), type (in this case it's always python), and either failure_reason key (which contains exception stack trace that happened during .run() or path to the resulting junit xml file relative to the metadata file. This commit is part of ongoing task to improve matrix pipeline reporting to Argus. Task: scylladb/argus#289

This commit reworks the way Driver Matrix Runs are submitted to Argus, replacing previously one-shot architecture with a multi-stage pipeline akin to SCT, where driver matrix run first submits itself, and then submits results sequentially (either a pass or failure), finalizing with a call that makes backend determine status based on resulting payload. Patch failures are now supported. Entire logic was moved from the client to the backend, removing the need to update the client in case of large changes to the API / submitted data. Fixes scylladb#289

This commit adds a new file emitted by the matrix runs, intended to provide metadata about the run for reporting tools that are executed later, such as Argus. Currently, the metadata file always contains the driver name (derived from result xml name), type (in this case it's always python), and either failure_reason key (which contains exception stack trace that happened during .run() or path to the resulting junit xml file relative to the metadata file. This commit is part of ongoing task to improve matrix pipeline reporting to Argus. Task: scylladb/argus#289

This commit reworks the way Driver Matrix Runs are submitted to Argus, replacing previously one-shot architecture with a multi-stage pipeline akin to SCT, where driver matrix run first submits itself, and then submits results sequentially (either a pass or failure), finalizing with a call that makes backend determine status based on resulting payload. Patch failures are now supported. Entire logic was moved from the client to the backend, removing the need to update the client in case of large changes to the API / submitted data. Fixes scylladb#289

This commit adds a new file emitted by the matrix runs, intended to provide metadata about the run for reporting tools that are executed later, such as Argus. Currently, the metadata file always contains the driver name (derived from result xml name), type (in this case it's always python), and either failure_reason key (which contains exception stack trace that happened during .run() or path to the resulting junit xml file relative to the metadata file. This commit is part of ongoing task to improve matrix pipeline reporting to Argus. Task: scylladb/argus#289

This commit reworks the way Driver Matrix Runs are submitted to Argus, replacing previously one-shot architecture with a multi-stage pipeline akin to SCT, where driver matrix run first submits itself, and then submits results sequentially (either a pass or failure), finalizing with a call that makes backend determine status based on resulting payload. Patch failures are now supported. Entire logic was moved from the client to the backend, removing the need to update the client in case of large changes to the API / submitted data. Fixes #289

fruch · 2024-09-12T06:40:35Z

@k0machi

reopen this one, since we didn't finish the work for all of the matrixes,
please point from here to the PRs or other related work.

This commit adds metadata files to java matrix reports, allowing argus to correctly collect each version separately. Task: scylladb/argus#289

This commit introduces results metadata files for collecting by argus scripts within scylla-pkg. Task: scylladb/argus#289

This commit adds metadata files to java matrix reports, allowing argus to correctly collect each version separately. Task: scylladb/argus#289

This commit adds metadata files support to make it possible for Argus to upload individual runs to itself using the client application. Task: scylladb/argus#289

This commit adds metadata files to java matrix reports, allowing argus to correctly collect each version separately. Task: scylladb/argus#289

This commit adds metadata files support to make it possible for Argus to upload individual runs to itself using the client application. Task: scylladb/argus#289

This commit introduces results metadata files for collecting by argus scripts within scylla-pkg. Task: scylladb/argus#289

fruch · 2024-09-23T13:27:37Z

Waiting on for rust
and then update all of the pipelines

k0machi added the refactor label Oct 24, 2023

k0machi mentioned this issue Oct 24, 2023

Driver Matrix usability improvements #288

Merged

k0machi mentioned this issue Dec 20, 2023

Need to add FAILED-INFRA vs. FAILED-BUG (and perhaps FAILED-TEST) failures #316

Closed

fruch assigned k0machi Dec 20, 2023

fruch added enhancement New feature or request Argus labels Dec 20, 2023

k0machi mentioned this issue Dec 21, 2023

siren-tests: "sirenada-stg-t3-micro" job appears in Argus as 'Not Planned' although it runs on a daily basis. #269

Open

k0machi changed the title ~~driver_matrix: Replace filename parsing with job API similar to SCT~~ driver_matrix: Adding Job API similar to SCT Jan 23, 2024

k0machi changed the title ~~driver_matrix: Adding Job API similar to SCT~~ driver_matrix: Multi-stage pipeline tracking Jun 5, 2024

k0machi mentioned this issue Jul 15, 2024

refactor(plugins/driver_matrix_tests): Multi-stage pipeline for Drivers #387

Merged

k0machi mentioned this issue Jul 22, 2024

run.py: Introduce metadata files for runs scylladb/python-driver-matrix#85

Merged

k0machi closed this as completed in #387 Aug 27, 2024

fruch reopened this Sep 12, 2024

k0machi mentioned this issue Sep 13, 2024

run.py: Introduce metadata files for results scylladb/scylla-java-driver-matrix#54

Merged

k0machi added a commit to k0machi/scylla-cpp-driver-matrix that referenced this issue Sep 13, 2024

run.py: Introduce metadata files for results

695aef3

This commit introduces results metadata files for collecting by argus scripts within scylla-pkg. Task: scylladb/argus#289

k0machi mentioned this issue Sep 13, 2024

run.py: Introduce metadata files for results scylladb/scylla-cpp-driver-matrix#19

Merged

k0machi mentioned this issue Sep 17, 2024

run.py: Add metadata file support scylladb/gocql-driver-matrix#14

Merged

fruch pushed a commit to scylladb/scylla-cpp-driver-matrix that referenced this issue Sep 19, 2024

run.py: Introduce metadata files for results

488f3df

This commit introduces results metadata files for collecting by argus scripts within scylla-pkg. Task: scylladb/argus#289

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

driver_matrix: Multi-stage pipeline tracking #289

driver_matrix: Multi-stage pipeline tracking #289

k0machi commented Oct 24, 2023 •

edited

Loading

fruch commented Jan 18, 2024

k0machi commented Jan 23, 2024

k0machi commented Jun 25, 2024 •

edited

Loading

k0machi commented Jun 27, 2024 •

edited

Loading

fruch commented Sep 12, 2024

fruch commented Sep 23, 2024

driver_matrix: Multi-stage pipeline tracking #289

driver_matrix: Multi-stage pipeline tracking #289

Comments

k0machi commented Oct 24, 2023 • edited Loading

fruch commented Jan 18, 2024

k0machi commented Jan 23, 2024

k0machi commented Jun 25, 2024 • edited Loading

k0machi commented Jun 27, 2024 • edited Loading

fruch commented Sep 12, 2024

fruch commented Sep 23, 2024

k0machi commented Oct 24, 2023 •

edited

Loading

k0machi commented Jun 25, 2024 •

edited

Loading

k0machi commented Jun 27, 2024 •

edited

Loading