THREESCALE-14654 Fix status reconciler requeue logic by urbanikb · Pull Request #1177 · 3scale/3scale-operator

urbanikb · 2026-05-07T20:26:37Z

Fixes https://issues.redhat.com/browse/THREESCALE-14654

Summary

Fixes equalStatus && newAvailable guard: the unavailable-but-equal case fell through to a no-op status write on every reconcile. Changed to if equalStatus which short-circuits both available and unavailable steady states.
Switches both unavailable return paths from Requeue: true (immediate tight-loop) to RequeueAfter: 30s, giving components time to recover between checks and avoiding extending the backoff counter.

Test plan

Unit tests: go test ./controllers/apps/... -run TestAPIManagerStatusReconciler
TestAPIManagerStatusReconciler_Reconcile_requeueAfterWhenUnavailable verifies RequeueAfter is non-zero on True→False transition

Manual testing setup

Tested manually with following setup:

export NAMESPACE=3scale-test
make NAMESPACE=$NAMESPACE cluster/prepare/local

cat << EOF | oc create -f -
kind: Secret
apiVersion: v1
metadata:
  name: s3-credentials
  namespace: $NAMESPACE
data:
  AWS_ACCESS_KEY_ID: c29tZXRoaW5nCg==
  AWS_BUCKET: c29tZXRoaW5nCg==
  AWS_REGION: dXMtd2VzdC0xCg==
  AWS_SECRET_ACCESS_KEY: c29tZXRoaW5nCg==
type: Opaque
EOF

DOMAIN=$(oc get routes console -n openshift-console -o json | jq -r '.status.ingress[0].routerCanonicalHostname' | sed 's/router-default.//')
cat << EOF | oc create -f -
kind: APIManager
apiVersion: apps.3scale.net/v1alpha1
metadata:
  name: 3scale
  namespace: $NAMESPACE
spec:
  wildcardDomain: $DOMAIN
  system:
    fileStorage:
      simpleStorageService:
        configurationSecretRef:
          name: s3-credentials
  externalComponents:
    backend:
      redis: true
    system:
      database: true
      redis: true
EOF

sleep 120 && make run

After system provisions and stabilises, delete one of the route, to induce "Available: false", observe the logs for some time - the reconcile was triggered every 30s.

🤖 Co-authored with Claude Code

tkan145 · 2026-05-08T02:02:42Z

/retest

codecov-commenter · 2026-05-08T06:23:53Z

Codecov Report

❌ Patch coverage is 25.00000% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 43.98%. Comparing base (3e09410) to head (bf7312e).
⚠️ Report is 20 commits behind head on master.

Files with missing lines	Patch %	Lines
controllers/apps/apimanager_status_reconciler.go	25.00%	2 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1177      +/-   ##
==========================================
+ Coverage   43.16%   43.98%   +0.82%     
==========================================
  Files         203      204       +1     
  Lines       20885    20927      +42     
==========================================
+ Hits         9015     9205     +190     
+ Misses      11056    10920     -136     
+ Partials      814      802      -12

Flag	Coverage Δ
unit	`43.98% <25.00%> (+0.82%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
apis/apps/v1alpha1 (u)	`63.19% <ø> (+0.74%)`	⬆️
apis/capabilities/v1alpha1 (u)	`3.50% <ø> (ø)`
apis/capabilities/v1beta1 (u)	`20.21% <ø> (ø)`
controllers (i)	`12.08% <20.00%> (-0.01%)`	⬇️
pkg (u)	`63.69% <81.15%> (+1.30%)`	⬆️

Files with missing lines	Coverage Δ
controllers/apps/apimanager_status_reconciler.go	`73.43% <25.00%> (-0.58%)`	⬇️

... and 4 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

tkan145 · 2026-05-13T04:48:36Z

+	if equalStatus {
 		s.logger.V(1).Info("Status was not updated")
+		if !newAvailable {
+			return reconcile.Result{RequeueAfter: 30 * time.Second}, nil


As discussed, we now return reconcile.Result{RequeueAfter: 30 * time.Second}, nil, but inside apimanager_controller.go we only check Requeue, so the result is ignored.

if statusResult.Requeue { logger.Info("Reconciling not finished. Requeueing.") return statusResult, nil }

…us write Two issues in the previous requeue logic: 1. The guard `equalStatus && newAvailable` caused the unavailable-but-equal case to fall through to a status write even when nothing had changed, producing a no-op write on every reconcile while the instance remained unavailable. The new `if equalStatus` guard short-circuits both the available and unavailable steady states, avoiding the unnecessary write. 2. Both unavailable return paths used `Requeue: true` (immediate requeue), which causes tight-loop reconciliation against an instance that is still unavailable. Switching to `RequeueAfter: 30s` gives components time to recover between checks and avoids extending backoff counter. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

urbanikb · 2026-05-13T21:59:27Z

/retest

tkan145 · 2026-05-15T02:26:49Z

/retest

tkan145 · 2026-05-15T04:38:59Z

 		return specResult, nil
 	}

-	if statusResult.Requeue {


We only check the last one? reconcileAPIManagerStatus is called more than once

tkan145 · 2026-05-15T04:54:04Z

+		s.logger.V(1).Info("Status is different")
+		s.apimanagerResource.Status = *newStatus
+		if err := s.Client().Status().Update(s.Context(), s.apimanagerResource); err != nil {
+			return reconcile.Result{}, fmt.Errorf("failed to update status: %w", err)


We now ignore the Conflict error?

tkan145 · 2026-05-17T23:43:34Z

/retest

openshift-ci · 2026-05-18T01:04:08Z

@urbanikb: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/test-integration	`9a9dee8`	link	true	`/test test-integration`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

urbanikb requested a review from a team as a code owner May 7, 2026 20:26

urbanikb changed the title ~~THREESCALE-14654 Fix status reconciler requeue logic~~ THREESCALE-14224 (THREESCALE-14654) Fix status reconciler requeue logic May 7, 2026

urbanikb changed the title ~~THREESCALE-14224 (THREESCALE-14654) Fix status reconciler requeue logic~~ THREESCALE-14654 Fix status reconciler requeue logic May 8, 2026

urbanikb force-pushed the THREESCALE-14654 branch from 5c2e21f to 5086f39 Compare May 8, 2026 05:15

urbanikb force-pushed the THREESCALE-14654 branch from 5086f39 to bf7312e Compare May 11, 2026 18:31

tkan145 requested changes May 13, 2026

View reviewed changes

urbanikb force-pushed the THREESCALE-14654 branch from bf7312e to 9a9dee8 Compare May 13, 2026 15:41

tkan145 requested changes May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

THREESCALE-14654 Fix status reconciler requeue logic#1177

THREESCALE-14654 Fix status reconciler requeue logic#1177
urbanikb wants to merge 1 commit into
3scale:masterfrom
urbanikb:THREESCALE-14654

urbanikb commented May 7, 2026 •

edited

Loading

Uh oh!

tkan145 commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 •

edited

Loading

Uh oh!

tkan145 May 13, 2026

Uh oh!

urbanikb commented May 13, 2026

Uh oh!

tkan145 commented May 15, 2026

Uh oh!

tkan145 May 15, 2026

Uh oh!

tkan145 May 15, 2026

Uh oh!

tkan145 commented May 17, 2026

Uh oh!

openshift-ci Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

urbanikb commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Manual testing setup

Uh oh!

tkan145 commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tkan145 May 13, 2026

Choose a reason for hiding this comment

Uh oh!

urbanikb commented May 13, 2026

Uh oh!

tkan145 commented May 15, 2026

Uh oh!

tkan145 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

tkan145 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

tkan145 commented May 17, 2026

Uh oh!

openshift-ci Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

urbanikb commented May 7, 2026 •

edited

Loading

codecov-commenter commented May 8, 2026 •

edited

Loading