Reduce the impact of flaky tests to be able to make them more actionable #7055

joperezr · 2025-01-09T19:53:50Z

Objective: Improve the infrastructure to reduce the impact of flaky tests and make them more actionable.

Tasks:

Consider Isolating flaky tests into individual jobs to be able to re-run just those when they fail.
Implement mechanisms to quickly identify and disable failing tests.
Ensure that test failures are actionable with proper logs and diagnostics. (e.g. Once a test fails, we have enough information to investigate and fix them)

sebastienros · 2025-01-10T17:11:10Z

Candidate for quarantining: AzureServiceBusExtensionsTests.VerifyWaitForOnServiceBusEmulatorBlocksDependentResources

Examples:

eerhardt · 2025-01-10T21:54:16Z

I've disabled the following in Disable Azure ServiceBus emulator functional tests (dotnet/aspire#7067):

AzureServiceBusExtensionsTests.VerifyWaitForOnServiceBusEmulatorBlocksDependentResources
AzureServiceBusExtensionsTests.VerifyAzureServiceBusEmulatorResource
The ServiceBus portion of the Azure Functions Playground test

davidfowl · 2025-01-20T18:48:56Z

Why do we have this #7056

danmoseley · 2025-01-20T21:27:50Z

Why do we have this #7056

The assumption was there was test specific work to fix flaky tests, and infra work to reduce the impact when tests were actually flaky. But I guess everyone's using just this issue

davidfowl · 2025-02-08T18:38:44Z

I think we're done with this for 9.1.

We moved to GitHub actions for PRs
We disabled flaky tests https://github.com/dotnet/aspire/issues?q=is%3Aissue%20state%3Aopen%20label%3Aflaky-test, they should mostly be in the right area labels. We should mark these high priority and assign them like normal bugs. (@joperezr @danmoseley looking for your help here)
We fixed lots of flaky tests. There was a mixture of product issues and test issues.
We added LOTS of logs to the CI pipelines and product. We should be capturing everything required to isolate test problems.

We don't have a way to quarantine test run as yet. That would be the last thing I think we could do. Run all tests marked with ActiveIssue in a separate workflow that could be used to investigate issues. I was thinking about a manually triggerable workflow to run a specific test project.

davidfowl · 2025-02-11T08:19:59Z

Closing this issue out.

joperezr added the flaky-test label Jan 9, 2025

joperezr added this to the 9.1 milestone Jan 9, 2025

joperezr assigned JamesNK Jan 9, 2025

joperezr added the area-engineering-systems infrastructure helix infra engineering repo stuff label Jan 9, 2025

JamesNK assigned davidfowl and unassigned JamesNK Jan 20, 2025

davidfowl closed this as completed Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the impact of flaky tests to be able to make them more actionable #7055

Reduce the impact of flaky tests to be able to make them more actionable #7055

joperezr commented Jan 9, 2025 •

edited by davidfowl

Loading

sebastienros commented Jan 10, 2025

eerhardt commented Jan 10, 2025 •

edited

Loading

davidfowl commented Jan 20, 2025

danmoseley commented Jan 20, 2025 •

edited

Loading

davidfowl commented Feb 8, 2025

davidfowl commented Feb 11, 2025

Reduce the impact of flaky tests to be able to make them more actionable #7055

Reduce the impact of flaky tests to be able to make them more actionable #7055

Comments

joperezr commented Jan 9, 2025 • edited by davidfowl Loading

sebastienros commented Jan 10, 2025

eerhardt commented Jan 10, 2025 • edited Loading

davidfowl commented Jan 20, 2025

danmoseley commented Jan 20, 2025 • edited Loading

davidfowl commented Feb 8, 2025

davidfowl commented Feb 11, 2025

joperezr commented Jan 9, 2025 •

edited by davidfowl

Loading

eerhardt commented Jan 10, 2025 •

edited

Loading

danmoseley commented Jan 20, 2025 •

edited

Loading