Fix race when creating secrets and Kubernetes jobs #4319

p-se · 2025-11-12T09:37:07Z

When creating secrets and a Kubernetes Job that uses those secrets in volumes, it is possible that the job is created before the secret has been synchronized. If the job required the content of the secret to be mounted, the file would be missing.

Also fixes usage of controllerutil.CreateOrUpdate.

Possibly related to kubernetes-sigs/secrets-store-csi-driver#1051.

Additional Information

The implementation of the annotations I didn't test. It only would be used in case of updates, which in turn would only happen if the reconciler was unable to complete the creation of the job after the secret was created, so that it would have to re-reconcile. That case should be fairly rare, but since it can happen, I have included this mechanism that would take care of that. In my tests I never hit the point where the secret could be retrieved but wasn't equal to the one that was created and/or updated.

On the other hand, hitting a case where the secret could not be fetched right after it's creation, I have encountered about every 10th attempt.

The tests that seem to be most affected by this change are the infra setup tests for git, helm and OCI. I noticed that because they all failed when I still used retry.RetryOnConflict which is not suitable for this case, it requires an error of type Conflict to trigger the back-off. Other errors end the attempts of retrying. Which I have replaced that functionality with retry.OnError.

Checklist

I have updated the documentation via a pull request in the
fleet-docs repository.

When creating secrets and a Kubernetes Job that uses those secrets in volumes, it is possible that the job is created before the secret has been synchronized. If the job required the content of the secret to be mounted, the file would be missing. Also fixes usage of `controllerutil.CreateOrUpdate`.

kkaempf added this to Fleet Nov 12, 2025

kkaempf added the kind/bug label Nov 12, 2025

kkaempf moved this to 👀 In review in Fleet Nov 12, 2025

p-se force-pushed the fleet-secret-volume-race branch from c007299 to bba4dd5 Compare November 12, 2025 11:09

p-se marked this pull request as ready for review November 12, 2025 11:51

p-se requested a review from a team as a code owner November 12, 2025 11:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix race when creating secrets and Kubernetes jobs #4319

Fix race when creating secrets and Kubernetes jobs #4319

Uh oh!

p-se commented Nov 12, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix race when creating secrets and Kubernetes jobs #4319

Are you sure you want to change the base?

Fix race when creating secrets and Kubernetes jobs #4319

Uh oh!

Conversation

p-se commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Additional Information

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

p-se commented Nov 12, 2025 •

edited

Loading