[bugfix](be_metrics) update scan bytes metric correctly#52232
Merged
yiguolei merged 1 commit intoapache:masterfrom Jul 1, 2025
Merged
[bugfix](be_metrics) update scan bytes metric correctly#52232yiguolei merged 1 commit intoapache:masterfrom
yiguolei merged 1 commit intoapache:masterfrom
Conversation
Contributor
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
Contributor
Author
|
run buildall |
Contributor
Author
|
run buildall |
Contributor
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
Contributor
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
Contributor
Author
|
run buildall |
TPC-H: Total hot run time: 33866 ms |
TPC-DS: Total hot run time: 185065 ms |
ClickBench: Total hot run time: 29.06 s |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
morningman
approved these changes
Jul 1, 2025
Contributor
|
PR approved by at least one committer and no changes requested. |
Contributor
|
PR approved by anyone and no changes requested. |
zclllyybb
approved these changes
Jul 1, 2025
yiguolei
added a commit
to yiguolei/incubator-doris
that referenced
this pull request
Jul 1, 2025
1. unify scanbytes and scan rows definition in audit log,doris metrics,profile。 scan bytes is the uncompressed data read from the file + from the page cache. scan rows is the raw rows read from the file. 2. add update realtime counter interface in scanner, so other scanner type could implement this interface to get these counters realtime. 3. the uncompressed data read in page io is wrong. 4. cputimer counter is not updated correctly. DO NOT merge this pr to 3.0 or 3.1, because there is some behavior change for scan bytes. If user has config workload group policy or some monitor on scan bytes metric, there will be something wrong.
morrySnow
pushed a commit
that referenced
this pull request
Jul 1, 2025
koarz
pushed a commit
to koarz/doris
that referenced
this pull request
Jul 4, 2025
### What problem does this PR solve? 1. unify scanbytes and scan rows definition in audit log,doris metrics,profile。 scan bytes is the uncompressed data read from the file + from the page cache. scan rows is the raw rows read from the file. 2. add update realtime counter interface in scanner, so other scanner type could implement this interface to get these counters realtime. 3. the uncompressed data read in page io is wrong. 4. cputimer counter is not updated correctly. DO NOT merge this pr to 3.0 or 3.1, because there is some behavior change for scan bytes. If user has config workload group policy or some monitor on scan bytes metric, there will be something wrong.
koarz
pushed a commit
to koarz/doris
that referenced
this pull request
Jul 4, 2025
### What problem does this PR solve? 1. unify scanbytes and scan rows definition in audit log,doris metrics,profile。 scan bytes is the uncompressed data read from the file + from the page cache. scan rows is the raw rows read from the file. 2. add update realtime counter interface in scanner, so other scanner type could implement this interface to get these counters realtime. 3. the uncompressed data read in page io is wrong. 4. cputimer counter is not updated correctly. DO NOT merge this pr to 3.0 or 3.1, because there is some behavior change for scan bytes. If user has config workload group policy or some monitor on scan bytes metric, there will be something wrong.
koarz
pushed a commit
to koarz/doris
that referenced
this pull request
Jul 4, 2025
### What problem does this PR solve? 1. unify scanbytes and scan rows definition in audit log,doris metrics,profile。 scan bytes is the uncompressed data read from the file + from the page cache. scan rows is the raw rows read from the file. 2. add update realtime counter interface in scanner, so other scanner type could implement this interface to get these counters realtime. 3. the uncompressed data read in page io is wrong. 4. cputimer counter is not updated correctly. DO NOT merge this pr to 3.0 or 3.1, because there is some behavior change for scan bytes. If user has config workload group policy or some monitor on scan bytes metric, there will be something wrong.
16 tasks
16 tasks
morningman
pushed a commit
that referenced
this pull request
Jul 28, 2025
…53729) ### What problem does this PR solve? Problem Summary: ### Release note The external part is implemented according to the framework defined by the unified audit log, Doris metrics, and scanbytes and scan rows in Profile in #52232. However, - Scan bytes in the external table currently represents the bytes counted by **the top-level File Reader** called by the scan reader layer. - Scan rows represents the number of scanned rows of the underlying storage. The number of scanned rows of parquet/orc does not include the number of rows of skipped page/rowgroup. **Note: However, there is still a problem that `jni_reader` has not yet implemented the number of rows that only contain the storage to be read.**
kaka11chen
added a commit
to kaka11chen/doris
that referenced
this pull request
Aug 1, 2025
…pache#53729) Problem Summary: The external part is implemented according to the framework defined by the unified audit log, Doris metrics, and scanbytes and scan rows in Profile in apache#52232. However, - Scan bytes in the external table currently represents the bytes counted by **the top-level File Reader** called by the scan reader layer. - Scan rows represents the number of scanned rows of the underlying storage. The number of scanned rows of parquet/orc does not include the number of rows of skipped page/rowgroup. **Note: However, there is still a problem that `jni_reader` has not yet implemented the number of rows that only contain the storage to be read.**
kaka11chen
added a commit
to kaka11chen/doris
that referenced
this pull request
Aug 1, 2025
…pache#53729) Problem Summary: The external part is implemented according to the framework defined by the unified audit log, Doris metrics, and scanbytes and scan rows in Profile in apache#52232. However, - Scan bytes in the external table currently represents the bytes counted by **the top-level File Reader** called by the scan reader layer. - Scan rows represents the number of scanned rows of the underlying storage. The number of scanned rows of parquet/orc does not include the number of rows of skipped page/rowgroup. **Note: However, there is still a problem that `jni_reader` has not yet implemented the number of rows that only contain the storage to be read.**
zhiqiang-hhhh
pushed a commit
to zhiqiang-hhhh/doris
that referenced
this pull request
Aug 4, 2025
…pache#53729) ### What problem does this PR solve? Problem Summary: ### Release note The external part is implemented according to the framework defined by the unified audit log, Doris metrics, and scanbytes and scan rows in Profile in apache#52232. However, - Scan bytes in the external table currently represents the bytes counted by **the top-level File Reader** called by the scan reader layer. - Scan rows represents the number of scanned rows of the underlying storage. The number of scanned rows of parquet/orc does not include the number of rows of skipped page/rowgroup. **Note: However, there is still a problem that `jni_reader` has not yet implemented the number of rows that only contain the storage to be read.**
Merged
16 tasks
freemandealer
added a commit
to freemandealer/doris
that referenced
this pull request
Sep 11, 2025
forget update profile counters in update_realtime_counters bug introduced in apache#52232 Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>
16 tasks
freemandealer
added a commit
to freemandealer/doris
that referenced
this pull request
Sep 11, 2025
forget update profile counters in update_realtime_counters bug introduced in apache#52232 Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>
dataroaring
pushed a commit
that referenced
this pull request
Sep 14, 2025
forget update profile counters in update_realtime_counters. bug introduced in #52232
freemandealer
added a commit
to freemandealer/doris
that referenced
this pull request
Oct 30, 2025
…55929) forget update profile counters in update_realtime_counters. bug introduced in apache#52232
freemandealer
added a commit
to freemandealer/doris
that referenced
this pull request
Oct 30, 2025
…55929) forget update profile counters in update_realtime_counters. bug introduced in apache#52232
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What problem does this PR solve?
DO NOT merge this pr to 3.0 , because there is some behavior change for scan bytes. If user has config workload group policy or some monitor on scan bytes metric, there will be something wrong.
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)