Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: dbCreated is false when db already exists #578

Closed
wants to merge 4 commits into from

Conversation

tanjinx
Copy link

@tanjinx tanjinx commented Jan 7, 2025

Description

dbCreated is set to false even when db already exists, which results in that tablet start always try to creeate the db. (credit to @vmogilev )

Related Issue(s)

Checklist

  • "Backport to:" labels have been added if this change should be back-ported
  • Tests were added or are not required
  • Did the new or modified tests pass consistently locally and on the CI
  • Documentation was added or is not required

Deployment Notes

@tanjinx tanjinx requested a review from a team as a code owner January 7, 2025 04:46
@github-actions github-actions bot added this to the v19.0.7 milestone Jan 7, 2025
@tanjinx tanjinx added bug Something isn't working upstream-me labels Jan 7, 2025
vmogilev
vmogilev previously approved these changes Jan 7, 2025
@tanjinx tanjinx force-pushed the fix-sidecardb-check branch from d11ee0a to 008a0eb Compare January 10, 2025 19:02
@tanjinx
Copy link
Author

tanjinx commented Jan 11, 2025

Many test failures are related to InitShardPrimary:

0110 19:59:45.466702   20051 vtctlclient_process.go:213] Executing vtctlclient with command: vtctlclient --server localhost:16006 InitShardPrimary -- --force --wait_replicas_timeout 31s ks/0 zone1-4410 (attempt 1 of 10)
E0110 20:00:16.571859   20051 vtctlclient_process.go:58] error in InitShardPrimary output W0110 19:59:45.478667   22809 main.go:96] W0110 19:59:45.478483 server.go:2439] primary-elect tablet zone1-0000004410 is not the shard primary, proceeding anyway as -force was used
W0110 19:59:45.478703   22809 main.go:96] W0110 19:59:45.478525 server.go:2445] primary-elect tablet zone1-0000004410 is not a primary in the shard, proceeding anyway as -force was used
InitShardPrimary Error: rpc error: code = Unknown desc = tablet zone1-0000004411 InitReplica failed: rpc error: code = DeadlineExceeded desc = context deadline exceeded;tablet zone1-0000004412 InitReplica failed: rpc error: code = DeadlineExceeded desc = context deadline exceeded
E0110 20:00:16.570499   22809 main.go:105] remote error: rpc error: code = Unknown desc = tablet zone1-0000004411 InitReplica failed: rpc error: code = DeadlineExceeded desc = context deadline exceeded;tablet zone1-0000004412 InitReplica failed: rpc error: code = DeadlineExceeded desc = context deadline exceeded
, err exit status 1

so looks like the fix has some side effect
however, I deployed a build with this change and tried InitShardPrimary or PRS, neither of them failed,

@tanjinx
Copy link
Author

tanjinx commented Jan 15, 2025

as we found out the root cause of the long reparenting time was not caused by this bug, and upstream filed an issue for this vitessio/vitess#17521. Going to close this one.

@tanjinx tanjinx closed this Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working upstream-me
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants