-
Notifications
You must be signed in to change notification settings - Fork 516
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links #7676
Conversation
…le links Change-Id: I6871db471adc1790ac3a0ff295a4db6eeb7608ad
7a0fdf2
to
0ea3331
Compare
Change-Id: If6f300d6068c4be2c8da99fdef3ae8495680d5ea
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @swamirishi for the patch.
hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/OMDBCheckpointServlet.java
Outdated
Show resolved
Hide resolved
hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/OMDBCheckpointServlet.java
Outdated
Show resolved
Hide resolved
Change-Id: Ic2b623cdb5ea6cbdcfad2b82ebb11bad62caa6d2
hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/OMDBCheckpointServlet.java
Outdated
Show resolved
Hide resolved
Change-Id: I03befbcab5d08add580c44cc7ee52dbfaeb101ba
@adoroszlai @prashantpogde if you don't have any issues with the patch. Can you give me a +1 for this patch? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The changes look good to me. Thank you for making these changes @swamirishi
Thanks for the review on the patch @prashantpogde @adoroszlai |
@swamirishi please trim extended details when merging:
|
+1 |
What changes were proposed in this pull request?
Currently while checking file links, the exclude sst file list & files already present in the current tarball is checked in the entries by sequentially iterating through the entries, for each and every file in the om data directory (snapshot directory, active om.db, compaction backup sst file). Now if the exclude list or files present in the tarball is really long order of 1000s and the total number of sst files are in the order of millions, the bootstrap is going to read timeout and might take hours. We need to optimize and not perform this unnecessary iteration to avoid this n^2 operation and do it in O
What is the link to the Apache JIRA
https://issues.apache.org/jira/browse/HDDS-12064
How was this patch tested?
Existing unit tests modification