Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fixes multiple upgrade bugs found when testing #4717

Merged
merged 3 commits into from
Jul 2, 2024

Conversation

keith-turner
Copy link
Contributor

Fixed a bug where upgrade code was attemptnig to read the managers service lock before it was set.

Fixed a bug with deteting fate operations. The code to detect fate operations would hang trying to read the fate table which did not exists yet.

Scans of the metadata table running as the !SYSTEM user would fail if no auths were specified. The !SYSTEM user has not auths set in zookeeper and an exception would happen when trying to find them.

A new zookeeper node related to compaction changes needed to be created.

Simplified code for setting tablet availability by using DataLevel.

Fixed a bug where upgrade code was attemptnig to read the managers
service lock before it was set.

Fixed a bug with deteting fate operations.  The code to detect fate
operations would hang trying to read the fate table which did not
exists yet.

Scans of the metadata table running as the !SYSTEM user would fail
if no auths were specified.  The !SYSTEM user has not auths set in
zookeeper and an exception would happen when trying to find them.

A new zookeeper node related to compaction changes needed to be
created.

Simplified code for setting tablet availability by using DataLevel.
@keith-turner keith-turner linked an issue Jul 1, 2024 that may be closed by this pull request
@keith-turner
Copy link
Contributor Author

These bugs were found running #4718 and #4719

// as not tablets are assigned when this is called. The Fate code is not used to read from
// zookeeper below because the serialization format changed in zookeeper, that is why a direct
// read is performed.
if (!context.getZooReader().getChildren(context.getZooKeeperRoot() + Constants.ZFATE)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah was briefly looking over this and was wondering why it was only looking at ZK... This makes sense good catch. Haven't looked at the other classes but the changes for this class LGTM

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The upgrade from 4.0 to 4.1 will probably need to look at the new fate table, which will be tricky. Will open an issue for that now so its not forgotten.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Opened #4721

…de/UpgradeCoordinator.java

Co-authored-by: Dave Marion <[email protected]>
@keith-turner keith-turner merged commit f6909cd into apache:elasticity Jul 2, 2024
8 checks passed
@keith-turner keith-turner deleted the upgrade-fixes branch July 2, 2024 17:10
@keith-turner keith-turner added this to the 4.0.0 milestone Jul 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement upgrade from 3.1 to 4.0
3 participants