Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Flaky test] End To End Suite: kindest/node:v1.32.0: 4 failed e2e tests #4505

Open
nasedil opened this issue Mar 5, 2025 · 1 comment
Open
Labels
kind/bug Categorizes issue or PR as related to a bug. kind/flake Categorizes issue or PR as related to a flaky test.

Comments

@nasedil
Copy link
Contributor

nasedil commented Mar 5, 2025

/kind flake

What happened:
Flaky e2e tests in #4475 : https://prow.k8s.io/view/gs/kubernetes-ci-logs/pr-logs/pull/kubernetes-sigs_kueue/4475/pull-kueue-test-e2e-main-1-32/1897218139884621824
What you expected to happen:
test passes
How to reproduce it (as minimally and precisely as possible):
run in CI
Anything else we need to know?:
End To End Suite: kindest/node:v1.32.0: [It] Pod groups when Single CQ Unscheduled Pod which is deleted can be replaced in group

{Timed out after 10.001s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/util/util.go:104 with:
Error matcher expects an error.  Got:
    <nil>: nil failed [FAILED] Timed out after 10.001s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/util/util.go:104 with:
Error matcher expects an error.  Got:
    <nil>: nil
In [AfterEach] at: /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/pod_test.go:79 @ 03/05/25 09:43:00.078
}

might be related to #2529

End To End Suite: kindest/node:v1.32.0: [It] Pod groups when Single CQ should allow to schedule a group of diverse pods

{Timed out after 45.014s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/util/util.go:343 with:
it's finished
Expected
    <[]v1.Condition | len:2, cap:2>: [
        {
            Type: "QuotaReserved",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:43:02Z,
            },
            Reason: "QuotaReserved",
            Message: "Quota reserved in ClusterQueue cq",
        },
        {
            Type: "Admitted",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:43:02Z,
            },
            Reason: "Admitted",
            Message: "The workload is admitted",
        },
    ]
to have condition type Finished and status True failed [FAILED] Timed out after 45.014s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/util/util.go:343 with:
it's finished
Expected
    <[]v1.Condition | len:2, cap:2>: [
        {
            Type: "QuotaReserved",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:43:02Z,
            },
            Reason: "QuotaReserved",
            Message: "Quota reserved in ClusterQueue cq",
        },
        {
            Type: "Admitted",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:43:02Z,
            },
            Reason: "Admitted",
            Message: "The workload is admitted",
        },
    ]
to have condition type Finished and status True
In [It] at: /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/pod_test.go:387 @ 03/05/25 09:43:47.731

There were additional failures detected after the initial failure. These are visible in the timeline
}

might be related to #1898

End To End Suite: kindest/node:v1.32.0: [It] Pod groups when Single CQ should allow to preempt the lower priority group

{Timed out after 45.000s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/pod_test.go:483 with:
Expected
    <v1.PodPhase>: Running
to equal
    <v1.PodPhase>: Failed failed [FAILED] Timed out after 45.000s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/pod_test.go:483 with:
Expected
    <v1.PodPhase>: Running
to equal
    <v1.PodPhase>: Failed
In [It] at: /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/pod_test.go:485 @ 03/05/25 09:44:48.892

There were additional failures detected after the initial failure. These are visible in the timeline
}

might be related to #4434

End To End Suite: kindest/node:v1.32.0: [It] TopologyAwareScheduling when Creating a Job requesting TAS should admit a Job via TAS

{Timed out after 45.001s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/tas_test.go:144 with:
Expected
    <[]v1.Condition | len:2, cap:2>: [
        {
            Type: "QuotaReserved",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:45:01Z,
            },
            Reason: "QuotaReserved",
            Message: "Quota reserved in ClusterQueue cluster-queue",
        },
        {
            Type: "Admitted",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:45:01Z,
            },
            Reason: "Admitted",
            Message: "The workload is admitted",
        },
    ]
to have condition type Finished and status True failed [FAILED] Timed out after 45.001s.
The function passed to Eventually failed at /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/tas_test.go:144 with:
Expected
    <[]v1.Condition | len:2, cap:2>: [
        {
            Type: "QuotaReserved",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:45:01Z,
            },
            Reason: "QuotaReserved",
            Message: "Quota reserved in ClusterQueue cluster-queue",
        },
        {
            Type: "Admitted",
            Status: "True",
            ObservedGeneration: 1,
            LastTransitionTime: {
                Time: 2025-03-05T09:45:01Z,
            },
            Reason: "Admitted",
            Message: "The workload is admitted",
        },
    ]
to have condition type Finished and status True
In [It] at: /home/prow/go/src/sigs.k8s.io/kueue/test/e2e/singlecluster/tas_test.go:145 @ 03/05/25 09:45:46.878
}

Environment:

  • Kubernetes version (use kubectl version):
  • Kueue version (use git describe --tags --dirty --always):
  • Cloud provider or hardware configuration:
  • OS (e.g: cat /etc/os-release):
  • Kernel (e.g. uname -a):
  • Install tools:
  • Others:
@nasedil nasedil added the kind/bug Categorizes issue or PR as related to a bug. label Mar 5, 2025
@k8s-ci-robot k8s-ci-robot added the kind/flake Categorizes issue or PR as related to a flaky test. label Mar 5, 2025
@nasedil nasedil changed the title [Flaky test] End To End Suite: kindest/node:v1.32.0: [It] Pod groups when Single CQ Unscheduled Pod which is deleted can be replaced in group [Flaky test] End To End Suite: kindest/node:v1.32.0: 4 failed tests Mar 5, 2025
@nasedil nasedil changed the title [Flaky test] End To End Suite: kindest/node:v1.32.0: 4 failed tests [Flaky test] End To End Suite: kindest/node:v1.32.0: 4 failed e2e tests Mar 5, 2025
@tenzen-y
Copy link
Member

tenzen-y commented Mar 7, 2025

I extracted End To End Suite: kindest/node:v1.32.0: [It] Pod groups when Single CQ should allow to preempt the lower priority group to #4525 since there are some of similar testing failures.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug. kind/flake Categorizes issue or PR as related to a flaky test.
Projects
None yet
Development

No branches or pull requests

3 participants