-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[fix](cloud) fix routine load job progress cache incorrect in cloud mode #39313
[fix](cloud) fix routine load job progress cache incorrect in cloud mode #39313
Conversation
Thank you for your contribution to Apache Doris. Since 2024-03-18, the Document has been moved to doris-website. |
run buildall |
TPC-H: Total hot run time: 40003 ms
|
TPC-DS: Total hot run time: 186090 ms
|
ClickBench: Total hot run time: 31.16 s
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…ode (apache#39313) Routine load job progress cache incorrect in cloud mode in the following scenario: 1. schedule thread update cloud progress, get old transaction value in fdb. 2. routine load task commit transaction and update progress cache. 3. update cloud progress RPC return, and change progress value to old value which is incorrect. This PR solves the problem that may occur in the storage computation separation mode by not allowing small values to overwrite large values.
…ode (#39313) Routine load job progress cache incorrect in cloud mode in the following scenario: 1. schedule thread update cloud progress, get old transaction value in fdb. 2. routine load task commit transaction and update progress cache. 3. update cloud progress RPC return, and change progress value to old value which is incorrect. This PR solves the problem that may occur in the storage computation separation mode by not allowing small values to overwrite large values.
…46149) In cloud mode, routine load loss data when fe master node restart. When updating progress, in order to avoid small values covering large values, we introduced pr #39313, Due to the pr that the routine load replays progress metadata by first obtaining the set default offset and then pulling metadata from meta service to update the local value, if the metadata pulled from meta service is not larger than the set default offset, the correct value cannot be assigned to memory. To solve this problem, pulling metadata from meta service when restart, determine whether to obtain default offset from Kafka based on the pulled value.
…46149) In cloud mode, routine load loss data when fe master node restart. When updating progress, in order to avoid small values covering large values, we introduced pr #39313, Due to the pr that the routine load replays progress metadata by first obtaining the set default offset and then pulling metadata from meta service to update the local value, if the metadata pulled from meta service is not larger than the set default offset, the correct value cannot be assigned to memory. To solve this problem, pulling metadata from meta service when restart, determine whether to obtain default offset from Kafka based on the pulled value.
Routine load job progress cache incorrect in cloud mode in the following scenario:
This PR solves the problem that may occur in the storage computation separation mode by not allowing small values to overwrite large values.