[feat](catalog)Support OneLake#57933
Merged
CalvinKirs merged 4 commits intoapache:masterfrom Nov 14, 2025
Merged
Conversation
Contributor
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
Member
Author
|
run buildall |
TPC-H: Total hot run time: 34220 ms |
TPC-DS: Total hot run time: 188940 ms |
ClickBench: Total hot run time: 27.38 s |
Member
Author
|
run buildall |
TPC-H: Total hot run time: 35029 ms |
TPC-DS: Total hot run time: 187980 ms |
ClickBench: Total hot run time: 27.42 s |
Member
Author
|
run buildall |
morningman
approved these changes
Nov 14, 2025
Contributor
|
PR approved by at least one committer and no changes requested. |
Contributor
|
PR approved by anyone and no changes requested. |
TPC-H: Total hot run time: 34587 ms |
TPC-DS: Total hot run time: 187671 ms |
ClickBench: Total hot run time: 27.48 s |
Contributor
FE UT Coverage ReportIncrement line coverage |
Contributor
FE Regression Coverage ReportIncrement line coverage |
CalvinKirs
added a commit
to CalvinKirs/incubator-doris
that referenced
this pull request
Dec 1, 2025
CalvinKirs
added a commit
to CalvinKirs/incubator-doris
that referenced
this pull request
Dec 1, 2025
yiguolei
pushed a commit
that referenced
this pull request
Dec 2, 2025
nagisa-kunhah
pushed a commit
to nagisa-kunhah/doris
that referenced
this pull request
Dec 14, 2025
Microsoft OneLake, part of the Fabric ecosystem, is a unified and open SaaS data lake that provides organizations with a centralized logical data storage layer. Data in OneLake is stored in Parquet format and can simultaneously maintain Delta Lake and Apache Iceberg metadata. This design allows multiple analytics engines to directly access shared datasets without data duplication or migration, significantly simplifying data management and governance. By leveraging Apache Doris’s Iceberg REST Catalog, users can directly query and analyze data stored in OneLake—again, without copying or moving it. This integration enables building end-to-end analytics pipelines on a single data lake, combining OneLake’s unified storage and governance capabilities with Doris’s high-performance analytical compute. At the technical level, Doris interacts with OneLake through open table formats and standardized interfaces to access both metadata and Parquet files. The architecture preserves centralized governance, access control, and security policies—ensuring platform reliability, scalability, and maintainability. ``` create catalog onelake_doris PROPERTIES ( 'type' = 'iceberg', 'iceberg.catalog.type' = 'rest', 'uri'='https://onelake.table.fabric.microsoft.com/iceberg', 'warehouse'='<workerspace_id>/<data_item_id>', 'iceberg.rest.security.type'='oauth2', 'iceberg.rest.oauth2.server-uri'='https://login.microsoftonline.com/<talent_id>/oauth2/v2.0/token', 'iceberg.rest.oauth2.credential'='<oauth2.client_id>:'<oauth2.client_secret>, 'iceberg.rest.oauth2.scope'='https://storage.azure.com/.default', 'fs.azure.support'='true', 'azure.endpoint'='https://onelake.dfs.fabric.microsoft.com', 'azure.auth_type'='OAuth2', 'azure.oauth2_account_host'='onelake.dfs.fabric.microsoft.com', 'azure.oauth2_server_uri'='https://login.microsoftonline.com/<talent_id>/oauth2/v2.0/token', 'azure.oauth2_client_id'='<oauth2.client_id>', 'azure.oauth2_client_secret'='<oauth2.client_secret>' ); ```
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Microsoft OneLake, part of the Fabric ecosystem, is a unified and open SaaS data lake that provides organizations with a centralized logical data storage layer. Data in OneLake is stored in Parquet format and can simultaneously maintain Delta Lake and Apache Iceberg metadata. This design allows multiple analytics engines to directly access shared datasets without data duplication or migration, significantly simplifying data management and governance.
By leveraging Apache Doris’s Iceberg REST Catalog, users can directly query and analyze data stored in OneLake—again, without copying or moving it.
This integration enables building end-to-end analytics pipelines on a single data lake, combining OneLake’s unified storage and governance capabilities with Doris’s high-performance analytical compute.
At the technical level, Doris interacts with OneLake through open table formats and standardized interfaces to access both metadata and Parquet files. The architecture preserves centralized governance, access control, and security policies—ensuring platform reliability, scalability, and maintainability.