fix: properly rewrite errors for watch api, part 2 #2656

miparnisari · 2025-10-28T01:00:02Z

This is a continuation of #2640

codecov · 2025-10-28T01:03:16Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.41%. Comparing base (45508c2) to head (8cb6c91).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2656      +/-   ##
==========================================
- Coverage   79.47%   79.41%   -0.06%     
==========================================
  Files         455      455              
  Lines       47156    47158       +2     
==========================================
- Hits        37471    37444      -27     
- Misses       6934     6954      +20     
- Partials     2751     2760       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

vroldanbet · 2025-10-28T10:10:18Z

internal/services/shared/errors.go

 		return status.Errorf(codes.ResourceExhausted, "watch disconnected: %s", err)
+	case errors.As(err, &datastore.WatchRetryableError{}):
+		// FailedPrecondition is safe to retry
+		return status.Error(codes.FailedPrecondition, err.Error())


It would seem like Unavailable is a better error condition

From https://grpc.io/docs/guides/status-codes/

The service is currently unavailable. This is most likely a transient condition, which can be corrected by retrying with a backoff. Note that it is not always safe to retry non-idempotent operations.

Whereas FailedPrecondition says this

The operation was rejected because the system is not in a state required for the operation’s execution. For example, the directory to be deleted is non-empty, an rmdir operation is applied to a non-directory, etc. Service implementors can use the following guidelines to decide between FAILED_PRECONDITION, ABORTED, and UNAVAILABLE: (a) Use UNAVAILABLE if the client can retry just the failing call. (b) Use ABORTED if the client should retry at a higher level (e.g., when a client-specified test-and-set fails, indicating the client should restart a read-modify-write sequence). (c) Use FAILED_PRECONDITION if the client should not retry until the system state has been explicitly fixed. E.g., if an “rmdir” fails because the directory is non-empty, FAILED_PRECONDITION should be returned since the client should not retry unless the files are deleted from the directory.

Yeah i only had one reason for using FailedPrecondition, which was that we didn't alarm on it 😆 i've updated to use Unavailable now.

tstirrat15

LGTM

github-actions bot added the area/tooling Affects the dev or user toolchain (e.g. tests, ci, build tools) label Oct 28, 2025

miparnisari force-pushed the fix-watch-retryable branch from aa150c8 to 552a3c9 Compare October 28, 2025 01:07

miparnisari marked this pull request as ready for review October 28, 2025 01:11

miparnisari requested a review from a team as a code owner October 28, 2025 01:11

miparnisari force-pushed the fix-watch-retryable branch from 552a3c9 to f6447fa Compare October 28, 2025 01:17

vroldanbet reviewed Oct 28, 2025

View reviewed changes

miparnisari marked this pull request as draft October 28, 2025 18:40

miparnisari force-pushed the fix-watch-retryable branch from f6447fa to 2e70602 Compare October 29, 2025 01:47

miparnisari requested a review from vroldanbet October 29, 2025 18:01

miparnisari marked this pull request as ready for review October 29, 2025 18:02

miparnisari force-pushed the fix-watch-retryable branch from 2e70602 to 722bd3d Compare October 29, 2025 18:45

miparnisari enabled auto-merge October 30, 2025 00:47

fix: properly rewrite errors for watch api, part 2

8cb6c91

miparnisari force-pushed the fix-watch-retryable branch from 722bd3d to 8cb6c91 Compare October 30, 2025 22:10

tstirrat15 approved these changes Oct 30, 2025

View reviewed changes

miparnisari added this pull request to the merge queue Oct 30, 2025

Merged via the queue into main with commit e55404b Oct 30, 2025
80 of 82 checks passed

miparnisari deleted the fix-watch-retryable branch October 30, 2025 22:51

github-actions bot locked and limited conversation to collaborators Oct 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: properly rewrite errors for watch api, part 2 #2656

fix: properly rewrite errors for watch api, part 2 #2656

Uh oh!

miparnisari commented Oct 28, 2025

Uh oh!

codecov bot commented Oct 28, 2025 •

edited

Loading

Uh oh!

vroldanbet Oct 28, 2025 •

edited

Loading

Uh oh!

miparnisari Oct 29, 2025

Uh oh!

tstirrat15 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: properly rewrite errors for watch api, part 2 #2656

fix: properly rewrite errors for watch api, part 2 #2656

Uh oh!

Conversation

miparnisari commented Oct 28, 2025

Uh oh!

codecov bot commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vroldanbet Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

miparnisari Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

tstirrat15 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Oct 28, 2025 •

edited

Loading

vroldanbet Oct 28, 2025 •

edited

Loading