-
Notifications
You must be signed in to change notification settings - Fork 3.4k
HBASE-26441: Add metrics for BrokenStoreFileCleaner #3861
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
|
||
| /** | ||
| * Increment the deleted files counter | ||
| * @param deletes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nits: avoid empty javadoc element, it will lead to a checkstyle warning.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for pointing it out!
| String FAILED_DELETES_DESC = | ||
| "Number of files BrokenStoreFileCleaner tried but failed to delete"; | ||
| String RUNS = "BrokenStoreFileCleanerRuns"; | ||
| String RUNS_DESC = "Number of time the BrokenStoreFileCleaner chore run"; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Number of times?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice catch
| required uint64 runtime = 2; | ||
| optional uint64 deleted_files = 3; | ||
| optional uint64 failed_deletes = 4; | ||
| required uint32 runs = 5; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the above metrics class we use long to runs, why here we change to use uint32 instead of uint64?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure how it ended up as uint32. Changing it to uint64.
| returns(GetLiveRegionServersResponse); | ||
|
|
||
| /** Report BrokenStoreFileCleaner chore metrics to master */ | ||
| rpc ReportBrokenStoreFileCleanerUsage(BrokenStoreFileCleanerUsageRequest) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it worth a separated method? How do we report the metrics for other background tasks on region server to master?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Usually we do not report metrics. I do this here for 2 reasons:
- Convenience. Aggregating the rs metrics is possible and usually done by queries from the db, storing aggregated metrics here is for ease of use.
- Preparing things for the next PR where I'll add logic to store the reported metrics in-memory and extend the REST service to allow checking these values without having to access the metrics db.
|
Any updates here? Thanks. |
…racking logic Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
…g from different store file tracker implementations (apache#3656) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
…iptor when creating table (apache#3666) Signed-off-by: Duo Zhang <zhangduo@apache.org>
… file tracker implementation (apache#3665) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
… tracker (apache#3681) Signed-off-by: Josh Elser <elserj@apache.org>
Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org> Reviewed-by: Josh Elser <elserj@apache.org>
apache#3721) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>
…et method (apache#3774) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
… impl (apache#3749) Signed-off-by: Duo Zhang <zhangduo@apache.org>
…ations to TableDescriptor for existing tables (apache#3700) Signed-off-by: Duo Zhang <zhangduo@apache.org> Reviewed-by: Wellington Ramos Chevreuil <wchevreuil@apache.org>
…he#3786) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org> Signed-off-by: Wellington Ramos Chevreuil <wchevreuil@apache.org>
apache#3845) Signed-off-by: Duo Zhang <zhangduo@apache.org>
|
I've rebased HBASE-26067 to the newest master. For addressing the UT problem so we can run most of UTs in the nightly job. Please rebase the PR. Thanks. |
add rs metricsfor BrokenStoreFileCleaner aggregate the rs metrics on master for easier access
31a5852 to
e9a86ef
Compare
|
@Apache9 Sorry for the slow response I was sick |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
Oh, sorry... |
checkstlye fixes
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
…racking logic Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
…g from different store file tracker implementations (apache#3656) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
…iptor when creating table (apache#3666) Signed-off-by: Duo Zhang <zhangduo@apache.org>
… file tracker implementation (apache#3665) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
… tracker (apache#3681) Signed-off-by: Josh Elser <elserj@apache.org>
Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org> Reviewed-by: Josh Elser <elserj@apache.org>
apache#3721) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>
…et method (apache#3774) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>
… impl (apache#3749) Signed-off-by: Duo Zhang <zhangduo@apache.org>
…ations to TableDescriptor for existing tables (apache#3700) Signed-off-by: Duo Zhang <zhangduo@apache.org> Reviewed-by: Wellington Ramos Chevreuil <wchevreuil@apache.org>
…he#3786) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org> Signed-off-by: Wellington Ramos Chevreuil <wchevreuil@apache.org>
apache#3845) Signed-off-by: Duo Zhang <zhangduo@apache.org>
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
@Apache9 The failing unit test is unrelated it works just fine for me. Do you have any other feedback or can we wrap this up? |
add rs metricsfor BrokenStoreFileCleaner
aggregate the rs metrics on master for easier access