Abstract
Mapping air pollution at high spatial resolution is essential for understanding, managing, and mitigating the adverse impacts of air pollution. Current air pollution monitoring approaches suffer from limited spatial coverage and resolution. Artificial intelligence holds great promise for tackling these challenges, yet its application in air pollution monitoring remains nascent, facing limited transferability regarding low-quality labeled and non-uniform spread data. Here, we introduce Height-Field Signed Distance Function (HF-SDF), an innovative 3D implicit representation, to reconstruct air pollution concentration maps from coarse, incomplete data, which achieves both extensive spatial coverage and fine-scale results with powerful transferability. HF-SDF learns a continuous and transferable mapping model that integrates an auto-decoder network with a geometric constraint, offering flexible resolution. The evaluation uses reanalysis data and satellite observations, reaching accuracy rates of 96% and 91%, respectively. HF-SDF reveals immense promise in advancing air pollution monitoring by offering insights into the spatial heterogeneity of pollution distributions.
Similar content being viewed by others
Introduction
Deteriorating air quality is responsible for more than 7 million premature deaths per year globally, as identified by the World Health Organization1,2,3. Monitoring air pollutant distribution at high spatial resolution provides essential insights into understanding pollution patterns4, assessing public health risks5, and developing and implementing targeted policies to safeguard both environment and public health6,7,8,9,10.
Current air pollution measurement methods are subject to limitations and uncertainties, which hinder our understanding of air pollution distributions at high spatial resolutions4,11,12. Ground station-based measurements offer precise measurements. However, their implementation is constrained by substantial construction expenses, resulting in a restricted deployment of monitoring stations over extensive spatial coverage, especially in suburban areas13,14,15. On the other hand, satellite-based measurements provide extensive spatial coverage and cost-effective measurement, with substantial benefits in the areas where ground-based stations are impractical, thereby bridging the measurement gaps16,17. However, satellite-based measurement encounters limitations in accuracy and spatial resolution (e.g., several to tens of kilometers)11. Additionally, satellite observations are susceptible to interference from cloud cover and other environmental factors, leading to a lower-than-expected availability of data18. To conclude, the present measurements are limited by low-quality labeled data and uneven spatial coverage, posing significant challenges to high-resolution air pollution monitoring.
Currently, two kinds of approaches have been developed to monitor and reconstruct air pollution concentrations at fine scales. The physical knowledge-based methods19,20,21 assess the connection between geographic predictors (e.g., nightlight, road network, land use map) and air pollutants based on the available measurement data. The air pollution concentrations at locations lacking actual measurements can be predicted based on regression analysis22, enabling the derivation of a detailed spatial distribution of air pollution. The complex interplay among different predictors poses a challenge for physical knowledge-based models to map air pollution distributions accurately. On the other hand, AI-based methods are gaining popularity by offering a compelling avenue for capturing the nonlinear relationship between handcrafted predictors and regional air pollution distribution. Some approaches23,24 integrate satellite data with deep learning and physical laws to investigate the connection between air pollution concentration and regional predictors, generating detailed concentration maps. However, these solutions rely on high-quality labeled data and lack transferability—the ability to adapt learned information to unseen regions and related air pollutants—forcing researchers to focus on specific regions and tailor predictors to local conditions, as patterns may vary significantly across spatial contexts. These limitations restrict their applicability to a narrow range of scenarios.
Confronting the above challenges from data and transferability perspectives, to our knowledge for the first time, we introduce a powerful and transferable implicit representation, conceptualized as Height-Field Signed Distance Function (HF-SDF), to monitor air pollution with high spatial adaptability, fine-scale resolution, and comprehensive pollutant coverage even with noisy and low-resolution data (e.g., satellite observations). The key insight is that the continuous spatial distribution of air pollution can be viewed as an unfolded, irregular 3D surface by interpreting pollutant concentration as “height” in an additional dimension. The shape of this surface is then reconstructed by learning HF-SDF functions. Instead of struggling with intricate mapping of air pollution based on various hand-crafted geographic predictors, this new implicit representation, implemented through a self-supervised model, offers a powerful paradigm for integrating 3D geometric features to effectively reconstruct the continuous spatial distribution of air pollutants from low-quality labeled and heterogeneously scattered observations. By avoiding interference from redundancy and uncertainty in various localized predictors, HF-SDF achieves strong transferability and maximizes the network’s ability to capture potential correlations within available observations at minimal cost while producing reliable estimations for unmeasured areas. In this paper, we present the formulation of HF-SDF and further integrate it into an auto-decoder network to support the continuous modeling of air pollution. To validate our new implicit representation, we examined the proposed approach on the reanalysis dataset (TAP: Tracking Air Pollution in China) and raw satellite observations (TROPOMI: TROPOspheric Monitoring Instrument), focusing on its ability to generate high-resolution maps, robustness to sparse or regionally missing data, transferability to related pollutant species, and applicability to real-world observations.
Results
Reconstructing air pollution map continuously
To showcase the capability of high-resolution mapping, we reconstruct PM2.5 data across China based on TAP data at 10 km resolution. For comparison, both the TAP maps and our results are presented in Fig. 1. The experimental results demonstrate that our approach enables an extremely fine-scale mapping of PM2.5, zooming in from the regional scale to the detailed level of cities or towns. The experiment reports an R value of 0.992, an IOA of 0.996, a MAB of 1.181 μg/m3, and an RMSE of 2.117 μg/m3. Further, more details of the experimental setting and a visualization of the absolute bias are provided in Appendix B.1.
In (a) the TAP PM2.5 distribution within China is illustrated at a resolution of 10 km on November 3rd, 2023. For panels (b) (c) and (d) the upper images depict the TAP PM2.5 distribution in Central China, Hubei Province, and Wuhan City, respectively, at a 10 km resolution. The lower images provide detailed patterns of the estimated PM2.5 distribution in the corresponding areas of Central China, Hubei Province, and Wuhan City.
Reconstructing air pollution maps based on sparse observations
In this section, reconstructions are performed based on low-resolution inputs downsampled at various scales. Specifically, the tested input resolutions include 1 km, 5 km, 10 km, 20 km, 30 km, and 40 km PM2.5 data from TAP. Figure 2 shows that our reconstructions exhibit detailed spatial distribution patterns that consistently align with the diverse resolution inputs. The R and IOA values range from 0.97 to 0.99 and 0.98 to 0.99, respectively, while the MAB and RMSE remain lower than 2.63 μg/m3 and 3.93 μg/m3, respectively. Despite these promising qualitative results, an increasing disparity between our reconstructions and the ground truth (TAP 1 km) can be seen in the associated line chart and boxplot in Fig. 2. The assessment of reconstructed results provides insights into the robustness of HF-SDF for sparse data reconstruction. Additional details on the experimental settings and comparison results with Kriging interpolation25 are provided in Appendix B.2.
In (a) the PM2.5 distribution around the Sichuan Basin from the TAP 1 km dataset was downscaled at various scales, where DS 5 km, DS 10 km, DS 20 km, DS 30 km, DS 40 km illustrate downsampled inputs at resolutions of 5 km, 10 km, 20 km, 30 km, and 40 km, respectively. Panel (b) illustrates our reconstruction results at a resolution of 1 km, inferred from TAP 1 km, DS 5 km, DS 10 km, DS 20 km, DS 30 km, and DS 40 km, respectively. Panel (c) illustrates the difference between our 1 km resolution reconstructions and the data from TAP 1 km. Chart (d) shows two groups of R and IOA: Ours 1 km vs. TAP 1 km and Ours vs. Downsampled Input. Chart (e) illustrates two groups of distribution of reconstruction error: Ours 1 km vs. TAP 1 km and Ours vs. Downsampled Input. See Appendix B.2 for more details.
Reconstructing missing TAP data
To validate the robustness of our reconstruction method in handling unmeasured areas, we evaluate its performance in air pollution reconstruction with varying levels of missing data. In this experiment, we randomly mask five locations (circles in Fig. 3) on an air pollution map to simulate missing data. These masked maps are then used as inputs for high-resolution reconstruction. To assess the effectiveness of our method, we set five different levels of data loss, indicated by the size of the masked circles.
Panels (a–e) show reconstruction outcomes for maps with unmeasured areas of radii 0.05°, 0.15°, 0.25°, 0.35°, and 0.45°. GAP1 to GAP5 denote these unmeasured areas. Each panel displays (left to right): the input map, high-resolution output, complete map (GT), and bias visualization (CDF). See Supporting Information Appendix B.3 for more details.
Figure 3 summarizes the results of our method in addressing missing data, influenced by the scale of the unmeasured area and the connectivity of surrounding observations. Regarding the scale, for each unmeasured area, the CDF visualizations in Fig. 3 show that our reconstructions closely match the original distribution for smaller unmeasured areas (radius of 0.05°, 0.15°, and 0.25°), reporting bias lower than 7.5 μg/m3. However, the maximal bias of reconstruction increases to 15 μg/m3 as the scale of the unmeasured areas expands to 0.25° and grows further with larger areas. In terms of surrounding observations, analyzing the CDF line charts in Fig. 3 vertically reveals the following insights: 1. Similarity of Surrounding Concentration Patterns: If the surrounding concentration patterns share high similarity with the unmeasured area, the reconstructions are likely to have better quality. Specifically, GAP2, a moderately polluted area, and GAP4, a low-pollution area, both provide well-reconstructed results regardless of the scale of missing data. In contrast, GAP3, representing a heavily polluted spot, shows rapid degradation in reconstruction quality as the scale of the unmeasured area expands, with decreasing similarity. Essentially, the reconstruction quality depends on the specific relevance between the unmeasured distribution and accessible surrounding measurements. 2. Variability of Surrounding Observations: If the surrounding observations vary greatly in different directions, the reconstruction tends to yield more acceptable results. GAP1 and GAP5 consistently provide high-quality reconstructions, although the quality may slightly degrade with increasing unmeasured areas. Additionally, the reconstructions of regions without missing data consistently exhibit high quality. Further details of the experimental settings and comparison results are provided in Appendix B.3.
Reconstructing air pollution maps on unseen data
In this section, we utilize the coded map network to evaluate the generalization capability of HF-SDF within trained regions, as well as its transferability to entirely unseen locations and different air pollutants. Details regarding the dataset used for model training and testing, as well as model performance, are summarized in Table 1. The training set comprises PM2.5 data (TAP) collected from 2021 to 2022 in six locations: Chengdu (CD), Bazhou (BZ), Taiyuan (TY), Changsha (CS), Xining (XN), and Longnan (LN), each covering a geographic area of 10° × 10°. For additional details, refer to the Supporting Information (Table A7). To address concerns of homogeneity, the training dataset includes 1080 instances of concentration maps, randomly sampled from 180 days between 2021 and 2022 for each location. Subsequently, test sets, which include different regions, times, and air pollutants, were utilized to assess the model’s generalization capability and transferability. For more details of the experimental settings, see Appendix B.4.
As summarized in Table 1, the model’s performance on the training set demonstrates its ability to effectively learn and fit the data, exhibiting high consistency with an R value of 0.96 and an IOA of 0.98. Regarding generalization, the model achieved a remarkable level of agreement when tested across different periods (2023) and regions (Yinchuan (YC)), with R values of 0.95 and 0.93, and IOA values of 0.97 and 0.95, respectively. This strongly indicates that HF-SDF is capable of producing fine-scale reconstructions of PM2.5 in previously unseen scenarios. In terms of transferability, the validation conducted on test sets that encompass various air pollutants related to PM2.5, including \({{\rm{SO}}}_{4}^{2-}\), \({{\rm{NO}}}_{3}^{-}\), and \({{\rm{NH}}}_{4}^{+}\), confirms the model’s superior capability in transferring knowledge across different types of pollutants. The R and IOA values for these pollutants range from 0.62 to 0.72. In particular, this variation in performance likely arises from the model, which was trained on PM2.5, not fully capturing pollutant-specific patterns present in these unseen datasets. Nonetheless, our findings suggest that with appropriate fine-tuning tailored to the characteristics of different pollutants, the model’s reconstruction accuracy could be significantly enhanced.
Reconstructing air pollution maps from satellite data: take the TROPOMI NO2 column as an example
Satellites provide a global perspective on air quality monitoring; however, limitations such as low spatial resolution and unexpected data gaps reduce data availability. As discussed above, HF-SDF facilitates the continuous reconstruction of air pollution data despite these challenges, enabling effective reconstruction of satellite data. Here, our method is employed to estimate NO2 maps for both a local area (35° N to 45° N, 115° E to 125° E) and a regional area (18° N to 54° N, 72° E to 136° E) on daily and 10-day scales using TROPOMI data, addressing the challenges of sparse, noisy, and unexpectedly missing satellite data. For local reconstruction (Fig. 4a), our continuous reconstruction is performed at various resolutions (10 km, 2 km, and 1 km), achieving an R value of 0.983 and an IOA of 0.991. These results highlight the effectiveness of HF-SDF in overcoming challenges related to real-world air pollution mapping. For the daily regional reconstruction (Fig. 4b), the experimental results demonstrate that our method successfully captures detailed patterns of air pollutants, significantly improving resolution to a fine degree, with an R value of 0.978 and an IOA of 0.989. While HF-SDF shows strong promise for high-resolution satellite mapping, we acknowledge that extensive missing observations can lead to reconstruction challenges. For example, in experiments using single-day TROPOMI data (see Appendix B.5), the reliability of reconstructions in areas with extremely limited data is compromised. In such cases, additional information, such as temporal features or data from other sensors, would be necessary to enhance inference and support robust reconstructions. Additional details of the experimental settings are provided in Appendix B.5.
The top row illustrates the projection of satellite data alongside our continuous reconstruction at various resolutions: a.1 presents raw NO2 data from TROPOMI; a.2 showcases our reconstruction projected at a resolution of 10 km; a.3 displays our reconstruction projected at a resolution of 2 km; a.4 exhibits our reconstruction projected at a resolution of 1 km; and a.5 visualizes the reconstructed 3D concentration surface. The bottom row depicts NO2 concentration maps over China on the 10-day scale. b.1 Features averaged NO2 concentration maps over 10 continuous days from May 1st to 10th, 2018, while b.2 presents the corresponding reconstruction. Reconstruction on a single day can be found in Supporting Information Fig. A7.
Discussion
In summary, the proposed implicit representation frameworks thus hold several advantages regarding the high-resolution mapping of air pollution. First, HF-SDF provides an advanced paradigm for end-to-end high-resolution mapping of air pollution from low-quality labeled or heterogeneous-distributed data to maps at expected resolution, significantly improving the quality of those data that suffer from lower-than-expected availability. Furthermore. the ability to be independent of other geographic predictors, such as road networks and land use maps, allows the structure of the proposed framework to be tailored for a broad range of air pollution-related applications, substantially widening the applicable scenarios with HF-SDF’s strong transferability (easily to be applied to unseen locations or species). The feasibility of new implicit representation would further permit direct data recovery from satellite observations, driving progress in air pollution monitoring by enriching perceptual capabilities.
Nevertheless, the current implementation of HF-SDF has some limitations. For one, the use of the Marching Cubes algorithm can introduce irregular spatial gradients where sharp transitions or noise in the data may cause discontinuities or artifacts. For another, when measurements are overly sparse, our method tends to produce smoother estimations, potentially underestimating pollution levels at hotspots while slightly overestimating them in cleaner areas. In future studies, incorporating more air pollution data from different periods and regions will be necessary to further refine the model, enhancing its accuracy and robustness.
This continuous model not only significantly aids in facilitating specific downstream tasks, such as air-quality-related health research and the development of air pollutant disposal strategies, but also may offer a new approach for addressing broader data assimilation challenges in the field of Earth Science. This is made possible by the growing demand for a unified representation, which is likely to extend to Earth System Models (ESMs) that integrate diverse environmental factors beyond the scope of specific pollutants across varying resolutions. Additionally, we anticipate the potential to use the proposed representation to learn spatial patterns of relevant geophysical parameters with higher resolution or fewer gaps, further enhancing the richness of air pollution monitoring.
Methods
The satellite data for air pollution monitoring has been limited by its inadequate availability, typically characterized by low resolution or incomplete observations. To address these challenges, we propose a novel implicit representation, called the Height-Field Signed Distance Function (HF-SDF), which models air pollutant distribution as a stretched surface within the abstract 3D space defined by latitude, longitude, and air pollutant density. Built upon HF-SDF representation, our general framework for high-resolution mapping of air pollution includes measurement, data processing, model training, and map inference (Fig. 5a–d). Next, we first introduce the design of our implicit representation, followed by the implementation of the pipeline, and finally, the model configuration and validation. For further details on the self-supervised model settings, please refer to Appendix A.
a–d illustrate the stepwise modeling approach, including measurement, data processing, model training, and map inference. Specifically, (a.1), (b), (c.1), and (d) represent the high-resolution mapping pipeline for single-day measurements, while (a.2), (b), (c.2), and (d) correspond to the high-resolution mapping pipeline for multi-day measurements. Notably, in (d), the zero iso-surface of the HF-SDF field is shown in white, with points having positive HF-SDF values colored in red to indicate positive discrepancies and points with negative values in blue to indicate negative discrepancies.
Implicit representation: Height-Field Signed Distance Function (HF-SDF)
Implicit representation refers to a class of neural networks26,27,28,29 that can learn to represent complex, high-dimensional functions without explicit parameterization, where the signal can be parameterized into a continuous field. In this study, we propose a novel implicit representation, called Height-Field Signed Distance Function (HF-SDF), to reconstruct continuous concentration maps for air pollutants. Our approach models the distribution of air pollution as an unfolded surface within a customized 3D space constructed by latitude, longitude, and air pollutant density.
In the context of HF-SDF shown in Fig. 6c, we assume that there is a manifold traversing the customized 3D space, dividing it into two disjoint portions (the upper and the lower). This manifold is referred to as the reference surface (or concentration surface) S. For each 3D point, we compute the HF-SDF value that measures the distance between the current position and the reference surface S along the height axis, with the sign indicating whether the point is above or below the reference surface S. To clarify, for an arbitrary point in the space represented as p(x, y, z) ∈ R3, we define the HF-SDF as a mapping: R3 ↦ R, denoted as SDF*:
where S is the 3D reference surface, the height (or z-axis) indicates air pollutant density, and \({p}^{{\prime} }:= (x,y,{z}^{{\prime} })\in S\) is a point sampled from the reference surface S. Particularly, the point \({p}^{{\prime} }\) shares the same (x, y) coordinates with point p, as it is the projection of p onto the reference surface S with respect to the height. It is straightforward to show that HF-SDF has an inherent differential property:
a Decision boundary of the implicit reference surface (the curved line) represents the zero iso-surface in the HF-SDF formulation. Red points indicate samples with positive HF-SDF values (SDF* > 0), while blue points indicate samples with negative values (SDF* < 0). b Zero iso-surface from a vertical cross-sectional perspective. c HF-SDF field modeling for air pollutants, where the space is defined by latitude, longitude, and estimated concentration. The reference surface S is the target of interest, corresponding to the decision boundary in (a); the upper and lower regions represent the positive and negative HF-SDF subspaces, respectively.
HF-SDF constitutes a continuous representation that may be intuitively understood as a learnable shape-conditioned classifier, for which the decision boundary is the concentration surface S (Fig. 6). In the setting of HF-SDF, the coordinates of each 3D point hold tailored physical meaning: The height represents the estimated density value at the geographic location (x, y), where x and y indicate latitude and longitude, respectively. Since the concentration surface is a 3D representation of an air pollution map, the HF-SDF value at each point p(x, y, z) can be understood as the discrepancy between estimated density and actual density at location (x, y). Intuitively, points above the concentration surface have positive discrepancies, leading to positive HF-SDF values (Fig. 6a, red points). Conversely, points below the concentration surface have negative discrepancies, resulting in negative HF-SDF values (Fig. 6a, blue points). Points lying on the concentration surface have HF-SDF values of zero (Fig. 6b, the white boundary between red and blue points).
Three-staged modeling
In practice, we incorporate the proposed implicit representation (HF-SDF) into an MLP-based deep framework for high-resolution mapping. Given the measurement (Fig. 5a), the pipeline consists of three stages: data processing (Fig. 5b), model training (Fig. 5c), and map inference (Fig. 5d).
The first stage, called dimension lifting, transforms raw air pollution measurements by interpreting pollutant concentration as “height” in an additional dimension. This allows the data to be mapped into a 3D concentration space defined by {(latitude, longitude, air pollutant density)}. The transformed measurements correspond to those scattered 3D points sampled on the concentration surface. In doing so, we convert the goal of high-resolution mapping to the reconstruction of a continuous concentration surface within the 3D space. For the sake of training ease, we normalize the points to be within a regular cube. To train a neural network to fit the underlying surface, we further sample points with positive and negative HF-SDF values around the transformed measurements.
In the second stage, two networks are tailored for HF-SDF to model the geometric structure of the concentration surface, distinguishing between single-day and multi-day measurements. These networks share the same backbone, as shown in Fig. 5c. If only single-day measurement is available, then we employ an MLP-based network (Fig. 5c.1) to regress a continuous HF-SDF Field for its concentration surface with the pre-processed data from the last stage, which is regularized by a certain geometric constraint of HF-SDF. Specifically, it inputs the 3D points from single-day observations and outputs their corresponding HF-SDF values. We can further enhance our model given access to multi-day measurements through an auto-decoder network. In this case, we introduce a learnable low-dimensional vector for coded map HF-SDF, known as a latent code, which encapsulates most of the prior knowledge. As illustrated in Fig. 5c.2, the latent code, concatenated with the 3D point, serves as an additional input to predict HF-SDF values, further refining the captured patterns through iterative updates. With the well-trained network, a continuous HF-SDF field can be approximated for a set of observations.
The third stage is a procedure called high-resolution map inference, which involves identifying the concentration surface from an approximated continuous HF-SDF field and then projecting it onto a 2D map. Under the assumption of the HF-SDF field, the concentration surface is implicitly represented as the zero iso-surface of the HF-SDF field. To identify this concentration surface, we first sample n3 grid points at expected resolution in the unit cube (xi, yj, zk), i, j, k ∈ {1, 2, ⋯, n}. Here, n can be tuned so that the query resolution is sufficient for the user. With a trained neural network, we evaluate the HF-SDF value for each grid point, which indicates the distance from this point to the underlying concentration surface (e.g., corresponding to the white part of the HF-SDF field in Fig. 5d). The Marching Cubes algorithm30 is then applied to identify the concentration surface of interest with a triangle mesh based on the grid points and their HF-SDF values. Once the surface points have been identified, it is straightforward to project the result vertically onto a 2D map with respect to ‘height’—the dimension indicating air pollutant density.
Model configuration and validation
The self-supervised model, an MLP-based auto-decoder network defined by Eq. (3), is constructed to reconstruct the concentration surface from available 3D points. Regarding the single map network, the MLP with 8 layers is taken to estimate the HF-SDF field with a customized loss function (Eq. (4)).
Here, λ = 0.1 is a hyperparameter. The first term \({{\mathcal{L}}}_{{\mathcal{I}}}(\theta )\) is given by:
This term encourages the function f to vanish on the point cloud \({\mathcal{I}}\), where \({\mathcal{I}}\) represents the set of observations. The second term \({{\mathcal{L}}}_{{\mathcal{J}}}^{{\prime} }(\theta )\) is defined as:
This term encourages the partial gradient \(\frac{\partial {f}_{\theta }(x)}{\partial z}\) to be close to 1, as indicated by Eq. (2). Here, \({\mathcal{J}}\) represents a point cloud randomly sampled round about \({\mathcal{I}}\), xj denotes any simple in \({\mathcal{J}}\), and z indicates height.
Regarding the coded map network (Eq. (5)), it takes latent code \({\mathcal{Z}}\) as an additional input to the MLP-based model aside from 3D points, allowing the network to learn domain knowledge of air pollution. At training time, we use the loss function defined as Eq. (6).
where \({{\mathcal{Z}}}_{k}\in {R}^{256}\), initialized to 0, is a latent vector corresponding to each training example \(k\in {\mathcal{K}}\), \({\mathcal{B}}\subset {\mathcal{K}}\) is a batch, α = 0.01, \({\mathcal{L}}\) defined in Eq. (4), λ = 0.1 is a hyperparameter. Once the network is well-trained, for each reconstruction target, a latent code \({\mathcal{Z}}\) corresponding to its observations \({\mathcal{X}}\) can be estimated via Maximum-a-Posterior (MAP) estimation (Eq. (7)). Information regarding the specified hyperparameters can be found in Supporting Information.
For model validation, statistical metrics, including correlation coefficient (R), index of agreement (IOA), mean absolute bias (MAB), and root mean squared error (RMSE), were used for model evaluation, according to Eqs. (8)–(11), respectively. In these equations, Oi denotes observation, while Ei represents estimation.
Data availability
Reanalysis data: Tracking Air Pollution in China (TAP)31,32,33,34,35 is a multi-scale, near-real-time reanalysis dataset of aerosol and gaseous pollutant concentrations in the atmosphere over China. It integrates diverse data sources, including ground observations, satellite remote sensing, emission inventories, and model simulations, providing a holistic view of aerosol and gaseous pollutant concentrations over China at resolutions of 10 km and 1 km. In this study, daily air pollution data from TAP (http://tapdata.org.cn/), involving PM2.5 and related air pollutants, were employed to evaluate the models in the following four aspects: high-resolution mapping, reconstruction based on sparse observations, reconstruction in unmeasured areas, generalization and transferability. Satellite data: The TROPOspheric Monitoring Instrument (TROPOMI)36 is the satellite instrument on board the Copernicus Sentinel-5 Precursor satellite, operational since October 2017. TROPOMI measures key atmospheric pollutants such as nitrogen dioxide (NO2), ozone (O3), and carbon monoxide (CO) with a high spatial resolution of up to 3.5 km × 7 km. We derived its NO2 product from the official website of the EARTHDATA (https://disc.gsfc.nasa.gov/) for validating the performance of our method on real-world observational data.
Code availability
The Python manipulation codes in PyTorch are available at Github https://github.com/height-field-sdf/implicit-representation-for-high-resolution-mapping. Besides, we refer readers to https://height-field-sdf.github.io/ for illustrations.
References
Li, X., Jin, L. & Kan, H. Air pollution: a global problem needs local fixes. Nature 570, 437–439 (2019).
Lelieveld, J., Evans, J. S., Fnais, M., Giannadaki, D. & Pozzer, A. The contribution of outdoor air pollution sources to premature mortality on a global scale. Nature 525, 367–371 (2015).
Zhu, S., Mac Kinnon, M., Carlos-Carlos, A., Davis, S. J. & Samuelsen, S. Decarbonization will lead to more equitable air quality in California. Nat. Commun. 13, 5738 (2022).
Daellenbach, K. R. et al. Sources of particulate-matter air pollution and its oxidative potential in Europe. Nature 587, 414–419 (2020).
Tang, R. et al. Air quality and health co-benefits of China’s carbon dioxide emissions peaking before 2030. Nat. Commun. 13, 1008 (2022).
He, G., Pan, Y. & Tanaka, T. The short-term impacts of COVID-19 lockdown on urban air pollution in China. Nat. Sustain. 3, 1005–1011 (2020).
Kumar, R. P., Samuel, C. & Gautam, S. A bibliometric and scientometric: analysis towards global pattern and trends related to aerosol and precipitation studies from 2002 to 2022. Air Qual. Atmos. Health 16, 613–628 (2023).
Rami, Y. et al. Validation and spatial–temporal variability of particulate matter in urban area using WRF-chem with local and global emission inventories. Water, Air, Soil Pollut. 235, 734 (2024).
Wang, H. et al. Califormer: Leveraging unlabeled measurements to calibrate sensors with self-supervised learning. In Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing, 743–748 (2023).
Liu, Y. et al. Mobiair: Unleashing sensor mobility for city-scale and fine-grained air-quality monitoring with Airbert. In Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services, 223–236 (2024).
Xu, R. et al. Global population exposure to landscape fire air pollution from 2000 to 2019. Nature 621, 521–529 (2023).
Jbaily, A. et al. Air pollution exposure disparities across US population and income groups. Nature 601, 228–233 (2022).
Burke, M. et al. The contribution of wildfire to PM2.5 trends in the USA. Nature 622, 761–766 (2023).
Wei, J. et al. First close insight into global daily gapless 1 km PM2.5 pollution, variability, and health impact. Nat. Commun. 14, 8349 (2023).
Daepp, C. et al. Eclipse: an end-to-end platform for low-cost, hyperlocal environmental sensing in cities. In Proceedings of 21th ACM/IEEE IPSN, 28–40 (IEEE, 2022).
Burney, J. A. The downstream air pollution impacts of the transition from coal to natural gas in the United States. Nat. Sustain. 3, 152–160 (2020).
Li, C. et al. Reversal of trends in global fine particulate matter air pollution. Nat. Commun. 14, 5349 (2023).
Jia, H., Ma, X., Yu, F. & Quaas, J. Significant underestimation of radiative forcing by aerosol–cloud interactions derived from satellite-based methods. Nat. Commun. 12, 3649 (2021).
De Hoogh, K. et al. Predicting fine-scale daily NO2 for 2005–2016 incorporating OMI satellite data across Switzerland. Environ. Sci. Technol. 53, 10279–10287 (2019).
Pan, Y., Zhao, C. & Liu, Z. Estimating the daily NO2 concentration with high spatial resolution in the beijing–tianjin–hebei region using an ensemble learning model. Remote Sens. 13, 758 (2021).
Chan, K. L., Khorsandi, E., Liu, S., Baier, F. & Valks, P. Estimation of surface NO2 concentrations over Germany from TROPOMI satellite observations using a machine learning method. Remote Sens. 13, 969 (2021).
de Hoogh, K., Héritier, H., Stafoggia, M., Künzli, N. & Kloog, I. Modelling daily PM2.5 concentrations at high spatio-temporal resolution across Switzerland. Environ. Pollut. 233, 1147–1154 (2018).
Ghahremanloo, M., Lops, Y., Choi, Y. & Yeganeh, B. Deep learning estimation of daily ground-level NO2 concentrations from remote sensing data. J. Geophys. Res.: Atmos. 126, e2021JD034925 (2021).
Anand, B. et al. Dynamic change analysis of water spread region and its impact assessment using spectral indices of remotely sensed data. Environ. Dev. Sustain. 26, 17635–17652 (2024).
Oliver, M. A. & Webster, R. Kriging: a method of interpolation for geographical information systems. Int. J. Geogr. Inf. Syst. 4, 313–332 (1990).
Park, J. J., Florence, P., Straub, J., Newcombe, R. & Lovegrove, S. Deepsdf: Learning continuous signed distance functions for shape representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 165–174 (2019).
Saito, S. et al. Pifu: Pixel-aligned implicit function for high-resolution clothed human digitization. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2304–2314 (2019).
Sitzmann, V., Martel, J., Bergman, A., Lindell, D. & Wetzstein, G. Implicit neural representations with periodic activation functions. Adv. Neural Inf. Process. Syst. 33, 7462–7473 (2020).
Gropp, A., Yariv, L., Haim, N., Atzmon, M. & Lipman, Y. Implicit geometric regularization for learning shapes. In Proceedings of International Conference on Machine Learning, 3747-3757 (2020).
Lorensen, W. E. & Cline, H. E. Marching cubes: a high resolution 3d surface construction algorithm. In Seminal Graphics: Pioneering Efforts that Shaped the Field, 347–353 (1998).
Geng, G. et al. Tracking air pollution in China: near real-time PM2.5 retrievals from multisource data fusion. Environ. Sci. Technol. 55, 12106–12115 (2021).
Xiao, Q. et al. Spatiotemporal continuous estimates of daily 1 km PM2.5 from 2000 to present under the tracking air pollution in China (Tap) framework. Atmos. Chem. Phys. 22, 13229–13242 (2022).
Xiao, Q. et al. Separating emission and meteorological contribution to long-term PM2.5 trends over eastern China during 2000–2018. Atmos. Chem. Phys. 21, 9475–9496 (2021).
Xiao, Q. et al. Evaluation of gap-filling approaches in satellite-based daily PM2.5 prediction models. Atmos. Environ. 244, 117921 (2021).
Liu, S. et al. Tracking daily concentrations of PM2.5 chemical composition in China since 2000. Environ. Sci. Technol. 56, 16517–16527 (2022).
Sentinel, C. 5P. TROPOMI level 2 nitrogen dioxide total column products. version 02 (2018).
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China under contract No. 62171256, 62331006. Further, We gratefully acknowledge the support of Haoyang Wang from Tsinghua Shenzhen International Graduate School and Dr. Fangxin Fang from Imperial College London for their constructive discussions.
Author information
Authors and Affiliations
Contributions
R.H. and B.Z. initiated and supervised the project. R.H., B.Z., and T.Z. conceived the idea and developed the methods. T.Z. developed the models and conducted the experiments. T.Z., R.H., and B.Z. analyzed the results and prepared the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhang, T., Zheng, B. & Huang, R. Adaptive high-resolution mapping of air pollution with a novel implicit 3D representation approach. npj Clim Atmos Sci 8, 180 (2025). https://doi.org/10.1038/s41612-025-01044-6
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41612-025-01044-6








