ADA Research Group – The Adventures and Discoveries of the ADA Research Group

Detecting atmospheric plumes with Neural Architecture Search

New Neural Architecture Search method for multi-image data fusion, including code and two image classification datasets for training and validation.

Every day, we collect more than 150TB of publicly accessible Earth Observation (EO) data. Researchers and practitioners develop and deploy Machine Learning (ML) systems to make sense of this wealth of data, for example, to monitor crops or to find rare phenomena like floods or large emissions. Increasingly, researchers adopt Automated Machine Learning (AutoML) techniques to further improve the performance of their machine learning pipeline and overcome challenges like the sheer diversity of GeoAI tasks and datasets. Automated Machine Learning research focuses on automatically designing and configuring high-performance ML systems. But how do we maintain the trustworthiness of these ML-based systems under increased automation?

In this blog, I’ll share how we developed an AutoML approach for detecting large emissions of greenhouse gases in satellite images. We’ll explore the components of our methods, key findings, and key challenges in designing AutoML systems for EO applications.

Need a recap on AutoML for Earth Observation? Read our previous blog series: Part 1, Part 2, Part 3.

Why do we want to detect plumes?

When we think of satellite data, many of us think of Google Maps’ high-resolution images of the Earth’s surface. Optical images help us track human activity like deforestation or the expansion of cities.

However, not all human activity is visible. Methane leaking from gas pipelines is colourless and odourless, but it still impacts global warming. Similarly, carbon monoxide, another colourless and odourless gas, is co-emitted with the better-known carbon dioxide in incomplete combustion. Satellites carrying specialised instruments such as TROPOMI help us estimate and eventually monitor anthropogenic emissions of these gases.

Large emissions from a point source, such as a leaking gas pipeline, show up as plume-like shapes in the data. These plumes are relatively simple visual features that could be detected using computer vision approaches.

However, there’s a catch. ML-based atmospheric plume detection pipelines are susceptible to false positives due to challenges arising from the data, such as missing pixels due to cloud coverage. If not sufficiently addressed, these false positives put the trust in operational ML-based plume detection systems at risk.

Schuit et al. have already addressed this problem in methane plume detection. But what about other atmospheric plumes, like carbon monoxide? We designed AutoMergeNet, an AutoML system for multi-image data fusion with much more general applicability to create deployment-ready atmospheric plume detection pipelines for gases beyond methane.

Example training data cube. Each channel represents a different layer from the satellite data product. Modified from Sentinel-5P/TROPOMI methane.

AutoML-based multi-image data fusion

The main contribution of our method is the design of AutoMergeNet’s search space, which has three components:

Multi-branch networks: We include additional data layers from the satellite data product to catch potential false positives. For example, cloud masks inform us which pixels are missing due to clouds. Extracting meaningful features from this data is challenging, as these different fields have different value ranges and distributions. To address this, AutoMergeNet creates multi-branch neural networks with identical but independent input branches to extract features from each data layer.
Trade-off feature extraction and fusion: Data fusion networks work in two stages: feature extraction from the input data sources, and fusion of the multi-source features. AutoMergeNet optimises the balance between these two stages by trading off the depth of each stage.
Optimise fusion strategy: finally, AutoMergeNet automatically selects the best-performing strategy for fusing the features from the input sources.

With this search space, we automatically create plume detection pipelines for methane detection and carbon monoxide detection.

Multi-branch fusion outperforms more naive approaches

Our results show that our multi-branch fusion approach is significantly more effective than a naive data fusion approach, where different data layers are simply concatenated and fed to the model as a data cube. AutoMergeNet achieves an average accuracy of 94% percent in methane plume detection and 91% percent in carbon monoxide plume detection.

We applied the best methane plume detection model to a realistic use case to evaluate our approach’s potential for operational use. Our model detected a similar number of plumes to the model developed by Schuit et al. (and which is currently in operational use). But something surprising happened: despite the 94% accuracy on the methane plume detection test set, almost 50% of our model’s detections in the operational scenario were false positives. In short, results on the test set are not fully representative of how a model may perform in real life. We are working on a solution for this problem and have already published a follow-up work addressing some of the issues (see here: Mitigating representation bias caused by missing pixels in methane plume detection). We will share more about that project in a future blog.

A personal perspective

This research, carried out in collaboration with Earth Scientists from the SRON Space Research Organisation Netherlands has taught me two main lessons:

Collaboration between machine learners and domain experts is crucial. As machine learners, we have an internal database of different modelling techniques and experience with evaluating them. But without domain knowledge, the results can be hard to interpret. Domain experts can instantly spot geographic and other patterns, explain sources of noise, and many other details crucial to dataset construction and evaluation.
Applications are messy projects compared to more fundamental machine learning projects with nice and clean benchmark datasets. First, you’re not only designing a machine learning model, but often also a dataset. Second, I had to learn about and apply many different ML techniques to address different aspects of the problem. However, it’s very rewarding to work towards solving a concrete, real-world problem.

Implications

While our models are not ready for real-world monitoring—we still have work to do in closing the generalisation gap—our work opens up new possibilities:

Datasets: We collected and labelled a new dataset for carbon monoxide plume detection and made it available on Zenodo. Together with the methane plume data collected and labelled by SRON, we now have two datasets for further development of atmospheric plume detection models.
Expansion of applicability domain: AutoMergeNet’s automated approach, though only evaluated in atmospheric plume detection, could also be applied to other multi-image problems. For example, oil spill detection faces similar challenges with false positives.

Automatically creating deployable pipelines?

Researchers and practitioners across Machine Learning for Earth Observation face the problem of the generalisation gap. It’s difficult to address because there are so many potential causes for performance gaps between our fully labelled test sets and data encountered in the wild. Data can suffer from biases, inherent (such as spatio-temporal autocorrelation) or self-inflicted (representation or sampling bias, e.g. how we have more data of the Global North than the Global South).

For AutoML, the problem of the generalisation gap is even more pressing. When we manually develop models, we have many windows into a model’s performance: learning curves, summary statistics, xAI techniques such as attention maps, etc. The AutoML systems we have now have a much narrower view of a pipeline’s performance: often only a single summary statistic like the loss.

Our results have shown that those results are not always enough to predict how well a model will work in real life. Moving forward, we want to continue collaborating with domain experts to design better evaluation procedures and datasets that will help us close this generalisation gap.

Read the full study in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing or visit our project pages to access our code and datasets.

About me

Julia Wąsala is a PhD candidate studying Automated Machine Learning for Earth Observation at the ADA Research group at Leiden University and working with atmospheric scientists at the SRON Space Research Organisation Netherlands.

ADA teaches at Leiden primary school during strikes against cuts in higher education

Solar panels, AI paintbrushes and cow farts. What do these have in common? On the 10^th of March, ADA group member Julia used these concepts to teach a preschool class about her research on AutoML for Greenhouse Gas detection. The lesson was part of the Leiden University strike against budget cuts in higher education. Leiden Alderman for equality of opportunities. Abdelhaq Jermoumi sat in on the lesson.

Science communication for young audiences

Julia brought our group mascot Ada to explain to kids how AI learns to tell stories by reading lots of stories written by others and how satellites can take pictures from space to measure methane emissions (using the metaphor of cow farts, which contain a lot of methane). The children, aged 4-6, asked advanced questions like “How does AI work?”. Julia was also impressed that the children referred to satellites as “solar panels”: she hadn’t explained the link between solar panels and satellites to them, showing the children made this link themselves.

Julia ended the lesson by reading from the book “Luna en de magische AI kwast,” which explains how generative AI can supplement your creativity.

Relay strike against budget cuts

The lesson is part of the teach-out, an alternative strike activity during the Leiden strike on Monday against proposed budget cuts to higher education. The strike is supported by Leiden Kennisstad, a partnership of Leiden University, the city of Leiden, Leiden University of Applied Sciences, Naturalis Biodiversity Center and mboRijnland (secondary vocational education school).

*Kids, teacher, Julia and Little Ada (held by the girl in front of Julia) in the schoolyard after the lesson.*

The strikes in Leiden were the first in the relay strike that continued to higher education institutions in Utrecht, Nijmegen, Amsterdam, Groningen, and Enschede.

The budget cuts put kids’ future educations at risk.

Even though the cuts still need to be ratified by the Dutch senate, they already impact research and educational institutions: university departments and study programmes are stopped, future PhD positions are cancelled, and current ones are affected, and programmes providing opportunities to primary and secondary school teachers are scrapped. Therefore, the budget cuts affect higher education and risk negatively impacting our economy and international position and put kids’ future educations at risk.

Strike live blog, including an account of Julia’s lesson.

ADA research on AutoML for Earth Observation: Cloud Removal, Super-resolution, Satellite image classification and physics-aware automated machine learning.

Julia’s blog with resources on ML for Earth Observation and scientific communication.

ADA participates in Paleissymposium on AI for biological design

From left to right: Laurens, Julia and Mitra in front of the Royal Palace in Amsterdam.

Last Tuesday, Mitra and two of her PhD students, Laurens and Julia, were invited to the Royal Palace in Amsterdam to attend the Paleissymposium (palace symposium) on biologically inspired design for AI. Mitra was invited because of her engagement in AI research and was allowed to suggest the names of two early career researchers.

The palace symposium occurs at most twice a year and hosts talks and discussions on important societal themes. In this edition, there were talks by ML professor Max Welling, pharmacology professor Gerard van Westen and ethics professor Annelien Bredenoord. You can listen to their talks here.

AI for drug- and material discovery

The inspiration for the event were algorithms like Alpha Fold that contribute to drug- and material discovery. AI is uniquely situated to explore enormous databases of molecules to find promising molecules. This approach is strongly related to search strategies in Automated Machine Learning that effectively explore large search spaces.

Max Welling spoke about how AI can contribute to a more sustainable future through AI-powered material discovery for carbon capture. He plead for using the digital revolution (AI) to clean up the messes of the industrial revolution (greenhouse gases and pollution). The topic of the talk is linked to our research on AI for Earth Observation (EO) carried out within our group (e.g., Cloud Removal, Super-resolution, Satellite image classification and physics-aware automated machine learning).

Gerard van Westen revealed how AI can contribute to drug discovery by discovering new molecules with desired traits and filtering molecules based on whether we can synthesize them in the lab or whether they’re likely to have undesirable side effects. Gerard’s view of the future includes AI-aided discovery firmly in the hands of human experts.

Annelien Bredenoord stated that “Should we do this?” is not the right question to ask regarding the ethics of AI, and we should instead ask: “Under which conditions can we do this?” She further focused on how including ethics from the early AI development stages can help us make better systems rather than inhibit us.

Photo of the discussion in the throne room. Credit: Dutch Royal Family @ Threads.

A special evening

The event took place from the late afternoon until the end of the evening. Mitra, Laurens and Julia were invited to the palace at 16:00, where they were received with coffee. They were then led to a parlour where they shook hands and introduced themselves to King Willem Alexander, Queen Máxima and former queen Princess Beatrix of the Netherlands. The talks were hosted in the throne room, and participants were assigned to come up with questions for the plenary discussion during dinner.

Needless to say, it was a very special evening. Laurens, Mitra and Julia were impressed by the moderation of the discussions by chair Marcel Levi, which led to inspiring conversations on topics from acceptable risk margins to the impact of AI regulation on our technological progress compared with other global players. Furthermore, did you know that according to Gerard, every medicine on the market has used AI in some part of its development?

This evening, bright minds in the Netherlands discussed how to continue responsibly with transforming AI technologies. It’s an optimistic outlook on the future of trustworthy AI in Europe—not to mention, memories for a lifetime.

Removing Clouds from Optical Earth Observation Imagery Using VPint2 (in 2 Lines of Python Code)

tl;dr: to remove clouds from your cloudy images in the most basic case, where you have pre-loaded cloudy target data (target), cloud-free reference data (features), and a good cloud mask (mask), run:

from VPint.VPint2 import VPint2_interpolator

VPint2 = VPint2_interpolator(target, features, mask=mask)
target_clean = VPint2.run()

Please see the rest of this post, and our ISPRS Journal of Photogrammetry and Remote Sensing paper, for details.

Using VPint2 for Cloud Removal

Everyone who has worked with EO data knows the frustration of having to deal with clouds. They get in the way of observations, ruining our neat time-series data, obscuring measurements at the time and place where we want them, and make our carefully designed models produce invalid output. The easiest way of dealing with cloudy imagery is to just not use them in the first place. This will result in the highest reliability of our analyses, but sometimes we simply don’t have the luxury to wait weeks (or months) for the next cloud-free image. Another approach might be to add cloud processing to our data acquisition pipeline; for example, by using the ‘LeastCloudy’ mosaicking option in Google Earth Engine. Although mosaicking can be a quick and easy way of dealing with cloudy data in existing pipelines, it is hardly an ideal solution. Mosaicked pixels do not provide any new information, and accuracy may leave much to be desired — particularly if the most recent cloud-free acquisition is several months old due to a stubborn rainy season. Although custom cloud removal algorithms may seem difficult to use and overwhelming at first, I hope that, through this blog post, I can show that the process does not need to be difficult. In fact, making the method easy to apply was one of the key goals of our recently published cloud removal method VPint2.

First things first: installation. To create a new environment for VPint2 using conda, use the following commands:

conda create --name VPint2 python=3.12.3
conda activate VPint2
git clone git@github.com:LaurensArp/VPint.git
cd VPint
pip install -r requirements.txt
python setup.py install

The data requirements for VPint2 cloud removal are as follows: a cloudy input image, a co-located cloud-free reference image, and a cloud mask. We will be trying out VPint2 with some atmospherically corrected Sentinel-2 Level 2A (L2A) imagery over the Netherlands. You can follow this tutorial with your own data if you like, but to follow along exactly, you can obtain the example data from the Copernicus Browser. You can search for the following data products: S2A_MSIL2A_20240718T105031_N0510_R051_T31UFT_20240718T144450.SAFE (cloudy input image) and S2A_MSIL2A_20240110T105421_N0510_R051_T31UFT_20240110T151449.SAFE (cloud-free reference image).

Data should be loaded as 3D NumPy arrays with the bands in the final dimension — if data is in the first dimension, please use the option “bands_first=True” later. We can load (and visualise) our data automatically using some util functions:

from VPint.VPint2 import VPint2_interpolator
from VPint.utils.EO_utils import load_product, load_product_windowed, normalise_and_visualise

target_path = "/mnt/c/Users/laure/Downloads/S2A_MSIL2A_20240718T105031_N0510_R051_T31UFT_20240718T144450.SAFE.zip"
features_path = "/mnt/c/Users/laure/Downloads/S2A_MSIL2A_20240110T105421_N0510_R051_T31UFT_20240110T151449.SAFE.zip"

target = load_product(target_path)
features = load_product(features_path)

normalise_and_visualise(target)
normalise_and_visualise(features)

Execution time: 67.5 seconds

This can take quite a while, because a full data product covers a large area. Since we are only trying to work with a subset of this data, it will save us a lot of (compute) time to use windowed data loading:

# 256 x 256 patch
y_size = 256
x_size = 256
# Offsets to just read at the top-right of the image
y_offset = 20
x_offset = 20

# Perform the windowed data loading
target = load_product_windowed(target_path, y_size, x_size, y_offset, x_offset)
features = load_product_windowed(features_path, y_size, x_size, y_offset, x_offset)

# Also load a cloud mask included in the data product
mask = load_product_windowed(target_path, y_size, x_size, y_offset, x_offset, keep_bands=["CLD"], bands_20m={"CLD":0})[:,:,0]

# Visualise
normalise_and_visualise(target)
normalise_and_visualise(features)
normalise_and_visualise_single(mask, percentile_clip=False)

Execution time: 13.4 seconds

We now have all requirements in place to start running VPint2 (keep in mind that the above is only necessary to load data; if you already have data of your own, you could skip those steps). The cloud mask provided with the Sentinel-2 L2A products are percentages ranging from 0 to 100; let’s set the cloud probability threshold to mask pixels at 10%. We can then create the object and run the algorithm in two simple lines:

VPint2 = VPint2_interpolator(target, features, mask=mask, bands_first=False, threshold=10) # Set bands_first to True if your data is C x H x W instead of H x W x C
pred = VPint2.run()

normalise_and_visualise(pred)

Execution time: 0.9 seconds

Oh dear, what happened there? The image looks all strange! What could have caused this? To answer that, let’s have a closer look at the input image after the cloud mask has been applied.

VPint2 = VPint2_interpolator(target, features, mask=mask, bands_first=False, threshold=10)
normalise_and_visualise(VPint2.target)

Execution time: 0.1 seconds

It seems that there are some pixels within the cloud that were not masked, and the masking around the edges of some clouds was iffy too. This is a fairly common scenario: cloud masking algorithms are not perfect, and sometimes cloudy pixels are wrongly considered cloud-free. If these pixels are allowed to stay in the image, the cloudy values will get spread around the cloudy region! Clearly, it is better to prioritise recall in this case: better to needlessly add a few non-cloudy pixels to the mask, than to have your target values contaminated by cloudy pixels.

One way to alleviate cloud masking problems is to use a better cloud masking algorithm. For example, the cloud masks provided with Sentinel-2 L2A products are produced by s2cloudless, which is fast and relatively effective, but not perfect, while the VPint2 paper used the state-of-the-art deep learning-based SEnSeIv2 cloud masking model. However, taking a deep dive into cloud masking, and running deep learning models, can be a pain for users who just want to easily and conveniently remove clouds from their images. For those users, it may be easier to simply buffer the cloud masks (though this will be a little slower, and sometimes insufficient):

VPint2 = VPint2_interpolator(target, features, mask=mask, buffer_mask=True, mask_buffer_size=5, bands_first=False, threshold=10) 
normalise_and_visualise(VPint2.target) # Visualise the buffered mask
pred = VPint2.run() # Run the algorithm

normalise_and_visualise(pred)

Execution time: 2.4 seconds

This looks a lot more like it! There are still some remaining issues, particularly on the left side of the image, from transparent clouds that did not get detected, and on the right side of the image, from cloud shadows that did not get masked by s2cloudless. The performance could be boosted even further by improving the cloud detection even more, such as by using the aforementioned SEnSeIv2. However, for this simple tutorial, this is as far as we will go. The bottom line is: if the reconstructed images do not look as good as expected, the cloud mask is the most likely culprit.

Finally, let’s look at some advanced functionality of VPint2. Although the example image may not call for it, in very specific cases, VPint2 can suffer from artefacts on the edges of objects in an image, or return extreme values. The causes of these issues are explained in the VPint2 paper, as are the extensions that are intended to alleviate them. For details, we refer to the paper; however, if you are experiencing some of these issues, running VPint2 with these extensions enabled may help. Since configuring the algorithm manually can be annoying, the easiest way to use them is to enable VPint2’s auto-adaptation functionality (which comes at a computational cost):

VPint2 = VPint2_interpolator(target, features, mask=mask, buffer_mask=True, mask_buffer_size=5, bands_first=False, threshold=10) 
# Run the algorithm with automatically configured extensions
pred = VPint2.run(resistance=True, prioritise_identity=True, clip_val=10000,
                    auto_adapt=True, auto_adaptation_epochs=10, auto_adaptation_max_iter=100, auto_adaptation_strategy='random', auto_adaptation_proportion=0.8)

normalise_and_visualise(pred)

Execution time: 11.0 seconds

That is it! I hope you found this tutorial useful. If you have any questions, please feel free to contact me using the contact details you can find via the GitHub repository. If you use this work in your publications, please cite:

@article{ArpEtAl24,
    author = {Laurens Arp and Holger Hoos and Peter {van Bodegom} and Alistair Francis and James Wheeler and Dean {van Laar} and Mitra Baratchi},
    title = {Training-free thick cloud removal for Sentinel-2 imagery using value propagation interpolation},
    journal = {ISPRS Journal of Photogrammetry and Remote Sensing},
    volume = {216},
    pages = {168-184},
    year = {2024},
    issn = {0924-2716},
    doi = "https://doi.org/10.1016/j.isprsjprs.2024.07.030",
    url = "https://www.sciencedirect.com/science/article/pii/S0924271624002995",
}

AutoAI4EO: NAS with AutoKeras for Earth Observation (Part 3)

In previous posts (Part 1 , Part 2) we talked about how to solve satellite imagery problems with NAS, and more specifically how we can do that with AutoKeras [1]. AutoKeras is a convenient, off-the-shelf NAS library with built-in functionality for deep learning tasks like classification and regression. However, we can use NAS for many other tasks, including low-level image tasks like super-resolution. In this post, we will show you how to do that, following the example from our recent paper [2].

Super-resolution

We need high-resolution imagery for many tasks, such as crop monitoring or building detection, but higher spatial resolution imagery might not be available or very expensive to obtain. Fortunately, we can increase the resolution of images using super-resolution (SR) techniques. The current state-of-the-art in super-resolution is deep learning. The design of super-resolution networks offers many challenges because the task determines what type of network you need. For example, maintaining sharp edges is imperative when your end goal is building detection. However, when your goal is crop monitoring, the spectral signatures are much more important. The manual design of the networks ensures that you can select the ideal network properties for your task. However, the problem is that SR is just one step in the pipeline: you still need to make many other choices, such as which model to use for the downstream task. This process of carefully selecting and crafting every step in the pipeline quickly becomes very time-consuming.

AutoSR4EO

A promising solution to the problem of EO pipeline design is to automatically design SR neural networks using NAS (see previous posts for more details). NAS has multiple advantages. First of all, while performing a NAS search takes longer than training a single model, it can be much faster than designing models through trial and error. Second, automatic design is really convenient if you want to create a customised neural network architecture, but you might not yet have experience configuring neural networks. Finally, the intelligent search algorithms used in NAS systems find better results than humans could. For these reasons, we proposed an architecture-level NAS based on state-of-the-art neural networks, which extends AutoKeras with the task of SR, called AutoSR4EO [2]. Our methods exploit the modular architecture of modern neural networks. While many NAS approaches focus on designing these repeated modules, we use modules from expert-designed networks and use NAS to determine which type of module to stack, how many to stack, and which pre-trained weights to load. Our methods achieve the highest rank compared to state-of-the-art neural networks. Our framework can be easily extended with newer architectures by adding model blocks to the search space.

Figure 1. Top: Three options for designing SR networks: (a) we use a single, well-performing model for all cases, ignoring the requirements of specific tasks; (b) we select or design a network for each task, which takes a lot of time; (c) we use a NAS system like AutoSR4EO to design the ideal network for any given SR task. Bottom: The search space of AutoSR4EO consists of state-of-the-art model blocks and sets of pre-trained weights obtained from different satellite image datasets (Wasala et al. 2024) [2]

Implementation

We needed to make three significant modifications to AutoKeras to make AutoSR4EO possible:
– We adjusted the AutoKeras source code to add two SR metrics: PSNR and SSIM.
– We implemented a new SR “task head”. AutoKeras has a few built-in task heads, the final layers of the network that reshape the network features into the correct output shape for the task. While AutoKeras has regression and classification heads, none actually return an image, which you need for dense image tasks like SR and segmentation.
We implemented a new search space based on SR networks. AutoKeras offers a few built-in search spaces, like the image classification task, which has blocks of convolutional layers but also full nets like Resnet or Xception. However, just like you need different metrics for different tasks, SR nets need different architectures, so we implemented new blocks based on SOTA SR networks.

*A note on the implementation*. We don’t give more details on the implementation in this blog because we implemented these changes in an older version of AutoKeras, and we have not tested these changes with the newer versions. While AutoKeras is very convenient off-the-shelf if you want to do one of the natively supported tasks, it can be tricky to customise it. You can see the code and the forked version of AutoKeras here and here. You can contact me for more details (j.wasala<at>liacs.leidenuniv.nl).

Caption: The search space of AutoSR4EO consists of (1) model blocks, (2) model depth and (3) choice of pre-trained weights. The implementation of the search space and network output and the addition of SR-specific metrics required significant modifications to the AutoKeras source code.

Towards a unified NAS for EO

AutoSR4EO shows that NAS can help us improve by automating network design for EO tasks. However, we have yet to have a single, unified NAS solution that would be able to solve many important EO tasks. Many existing NAS systems are designed for specific use cases (e.g., targeting a single problem or dataset), and convenient toolboxes like AutoKeras only cover a few deep learning tasks. These challenges limit the accessibility of NAS: either people cannot use it at all for their tasks, or they need to make significant modifications to their data, such as transforming it into a tabular format to use frameworks like AutoSklearn or AutoGluon. We need more research to develop NAS solutions for other tasks to make NAS, and by extension, deep learning, more accessible to researchers in high-impact fields like EO.

References

[1] Haifeng Jin, Qingquan Song, and Xia Hu. 2019. Auto-Keras: An Efficient Neural Architecture Search System. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’19). Association for Computing Machinery, New York, NY, USA, 1946–1956. https://doi.org/10.1145/3292500.3330648&nbsp

[2] Wąsala J, Marselis S, Arp L, Hoos H, Longépé N, Baratchi M. AutoSR4EO: An AutoML Approach to Super-Resolution for Earth Observation Images. Remote Sensing. 2024; 16(3):443. https://doi.org/10.3390/rs16030443

ADA goes to COSEAL23 and AutoML workshop

This year, we met again in person with the Optimisation and AutoML community at COSEAL in Paris at Sorbonne University. Our former group member, Koen van der Blom, co-hosted the event. We were very happy to see him again!

COSEAL was well-attended, with over 100 registered participants from the AutoML groups in Freiburg and Hannover, the organising LIP6, the Slovenian Jozef Stefan institute, and even from local companies like Huawei France, among others. In other words, Little Ada met a lot of new friends. Of course, ADA was well-represented as well: Holger, Mitra, Samira, Julia and Hadar were there. We were happy to find many of our friends and collaborators at COSEAL, including Frank Hutter, Marius Lindauer, Lars Kotthoff, Saso Dzeroski and former ADA member Jeroen.

COSEAL 2023 group picture. Can you spot the ADA members?

The first day, Julia gave her first academic talk on her work on AutoML for super-resolution, which she did for a master’s thesis. She also presented a poster on her current work on AutoML for methane plume detection (read more about it here). Hadar presented a poster on his master’s thesis work on Graph Neural Networks for SAT problems. The day ended on a (literal) high note, with a reception at the top of the tower of Sorbonne’s science faculty, with a beautiful view over Paris and, most notably, the Eifel tower.

The next day we had the opportunity to enjoy a long talk by Holger on the recent work on Neural Network Verification by Matthias and Annelot, which was rewarded with the best paper award at the AAAI Workshop on Safe AI. Samira presented yet another of our applications to AutoML to different fields by showcasing her work on AutoML for Radio Astronomy.

Finally, on Wednesday, Mitra gave a talk on AutoML for hybrid (physical) modelling in Earth Observation, a topic that Laurens and Nuno work on (Read more about Laurens’ work here, and Nuno’s work here [link upcoming]. Read the work by former master student Victor here).

After two and a half days of talks and presentations, it was time to say goodbye to COSEAL. Fortunately, some of us could stay to attend the AutoML workshop.

The AutoML workshop was a whole different event, focussed on discussions and exchanging ideas within the community. For two days, we had many fruitful discussions on topics including benchmarking, multi-objective optimisation, explainable and interactive AutoML, and ethics and fairness.

It was an exciting week, full of exchanges of research ideas and interesting discussions. A great success for ADA: we gave a peek into how varied our work in the field of AutoML is before returning to Leiden, a little tired but full of new ideas.

AutoAI4EO: NAS with AutoKeras for Earth Observation (Part 2)

This is the second blog post in the series about our research on Neural Architecture Search (NAS) for Earth Observation (EO). In Part 1 we introduced NAS and how it can be applied to EO. We talked about a NAS framework: AutoKeras [1], and briefly discussed how it can be customized for EO tasks.

In this blog post, we talk about how AutoKeras can be used to create methods for EO imagery. More specifically, we are going to tackle the task of classification of EO imagery through the work “Automated Machine Learning for Satellite Data: Integrating Remote Sensing Pre-trained Models into AutoML Systems” by Nelly R. Palacios Salinas, Mitra Baratchi, Jan N. van Rijn and Andreas Vollrath [2]. It is the first work that focused on developing AutoML methods for EO.

Even though we will discuss how you could create methods like this yourself, Palacios et al. already present a framework for the classification of EO imagery that can be used with just a few lines of code and without requiring in-depth prior knowledge of AutoML or deep learning in general. The links to the code as well as the full Python notebook for this blog can be found at the end of the post.

Classification

Image classification is the task of assigning labels to an image. For instance, you’d like to know whether your image is showing a desert or a forest or some other type of land cover. Classification has, among others, applications in tasks like urban planning, hazard detection, and monitoring of the environment [3]. In EO the term “image classification” is sometimes used to refer to what is called segmentation in computer science. This is the task of assigning labels to individual pixels. For instance, this is the case when you classify pixels into the classes “building” or “not building” for building segmentation tasks. We, however, will only talk about classification in the traditional sense where you consider the complete image.

EO imagery

The task of classifying natural images is very different from the task of classifying EO images. In EO images, areas of interest can be very small on the image. For instance, if you want to differentiate between “savanna” and “bare ground”, individual plants could cover only a few pixels or less, depending on the resolution of the image. In some satellite images, like those obtained from Sentinel-2 with a resolution of 10 m per pixel, a plant could even be much smaller than a single pixel. Additionally, an EO image can cover a large area that includes multiple land cover types and can contain image features on different scales, from a tree covering a few pixels to larger patterns in the image like a lake. These properties need to be taken into account when designing NAS frameworks for the classification of EO imagery.

NAS for classification of EO imagery

Many NAS frameworks exist for the classification of natural images, including NASnet [4], AutoGAN [5], and ENAS [6]. Whereas these frameworks work exceptionally well for natural images, these have not been designed with the complexity of satellite imagery in mind, and therefore their performance and applications for EO data are limited. One of the main reasons for these limitations lay down in the use of datasets with simpler images like CIFAR-10 [6] for training and evaluation.

Currently, NAS methods are being developed specifically for classifying EO images. For instance, the work of our research group members Palacios Salinas et al. [2] shows that their classifier developed in AutoKeras is able to outperform 71% of the baseline methods created for natural images. These results were achieved by customizing AutoKeras’ ImageClassifier.

Classification in AutoKeras

AutoKeras has a ready-to-use NAS system for image classification, which is called the ImageClassifier. The search space of the ImageClassifier consists of various code modules that can be morphed, repeated, and combined to form a neural network. This type of AutoKeras block that can combine multiple types of blocks is called a hyperblock. The options include ResNet [7], Xception [8], and a convolutional block. It’s possible to use pre-trained weights from ImageNet [8], which is a natural image dataset. If you need a refresher on AutoKeras, read our previous blogpost on this topic.

Tutorial: Hyperblock

AutoKeras has documentation on how to implement your own blocks, but let’s take it a step further and take a look at how we can implement a hyperblock that will allow you to choose from different blocks. We’re going to implement a hyperblock that will let the NAS framework choose between ResNet and Xception.

import autokeras as ak
import tensorflow as tf

class HyperBlock(ak.Block):
    def build(self, hp, inputs):
        inputs = tf.nest.flatten(inputs)[0]
        
        if hp.Choice("model_type",["resnet", "xception"]) == "resnet":
            outputs = ak.ResNetBlock().build(hp,inputs)
        else:
            outputs=ak.XceptionBlock().build(hp,inputs)
        return outputs

Here we define a new class that builds either a ResNet or a DenseNet block. You can modify the HyperBlock to make it possible to choose which model you want beforehand:

from typing import Optional

class HyperBlock(ak.Block):
    def __init__(self, model_type: Optional[str] = None,**kwargs):
        super().__init__(**kwargs)
        
        if model_type is not None and model_type != "resnet" and model_type != "xception":
            raise Exception(f"invalid model_type {model_type}")

        self.model_type=model_type

    def get_config(self):
        config = super().get_config()
        config.update({"model_type": self.model_type})
        return config

    def _build_model(self,hp, output_node,model_type: str):
        if model_type=="resnet":
            return ak.ResNetBlock().build(hp,output_node)
        elif model_type=="xception":
            return ak.XceptionBlock().build(hp, output_node)

    def build(self, hp, inputs):
        inputs=tf.nest.flatten(inputs)[0]

        # Let AutoKeras choose a model
        if self.model_type is None:
            model_type= hp.Choice("model_type", ["resnet", "xception"])
            with hp.conditional_scope("model_type",[model_type]):
                outputs = self._build_model(hp,inputs,model_type)
        # Select model yourself
        else:
            outputs = self._build_model(hp,inputs,model_type)

As you can see, this becomes considerably more complicated. Let’s break down the changes:

We added an __init__ method so you can specify the model_type parameter
Error handling is important: check whether a valid model_type parameter has been passed
get_config: add the new block parameter to the config
build: because you now have 2 scenarios (let AutoKeras choose a model or select yourself), we now need a conditional scope. The value of model_type which is selected by AutoKeras, will now only be active within the scope.
_build_model: helper function to avoid code repetition. This becomes especially helpful if you have many options.

Here you go, your first hyperblock! You can also make your own block based on a specific neural network architecture and include it in your hyperblock. We will show you how to do this in Part 3 of this series.

Transfer learning

The ImageClassifier also gives you the option to use models that are pre-trained on ImageNet. However, as we discussed, this is not very useful for EO imagery (or even natural images, see: Rethinking Pre-training and Self-training [9]). Palacios Salinas et al. solved this by loading weights obtained from pre-training on some common EO datasets: RESISC45 [10], EUROSAT [11], So2SAT [12] and UC Merced [13].

Tutorial: loading weights

We’re going to look at how we can load these weights in our HyperBlock. Luckily for us, the weights can be downloaded from https://tfhub.dev.

EO_VERSIONS= {
    "resisc45": "https://tfhub.dev/google/remote_sensing/resisc45-resnet50/1",
    "eurosat": "https://tfhub.dev/google/remote_sensing/eurosat-resnet50/1",
    "so2sat": "https://tfhub.dev/google/remote_sensing/so2sat-resnet50/1",
    "ucmerced": "https://tfhub.dev/google/remote_sensing/uc_merced-resnet50/1",
}

Before we can use these weights in our HyperBlock, we need to make a ResNet block that can load the weights.

import tensorflow_hub as hub

class EOResNetBlock(ak.Block):
    #Remote sensing pretrained modules based on: https://github.com/palaciosnrps/automl-rs-project/blob/714bbe36c68fd0f2b989bfee89eac9497d7acf45/autokeras/blocks/basic.py"""
    def __init__(
         self, 
         version: Optional[str] = None,
         **kwargs,
         ):
            super().__init__(**kwargs)

            if version is not None and version not in EO_VERSIONS.keys() and set(version) <= set(EO_VERSIONS.keys()):
                raise Exception(f"invalid version {version}")
        
            self.version=version

    def get_config(self):
        config = super().get_config()
        config.update({"version": self.version})
        return config

    def build(self, hp, inputs=None):
        input_node = tf.nest.flatten(inputs)[0]

        if self.version is None:
            version= hp.Choice("version", list(EO_VERSIONS.keys()))
        elif isinstance(self.version,list):
            version = self.version
        else:
            version = [self.version]
            
        module = hub.KerasLayer(EO_VERSIONS[version],tags='train',trainable=False)
        min_size = 224
        if input_node.shape[3] not in [1, 3]:
            if self.pretrained:
                raise ValueError(
                    "When pretrained is set to True, expect input to "
                    "have 1 or 3 channels, bug got "
                    "{channels}.".format(channels=input_node.shape[3])
                )

        if input_node.shape[1] < min_size or input_node.shape[2] < min_size:
            input_node = tf.keras.layers.experimental.preprocessing.Resizing(
                max(min_size, input_node.shape[1]),
                max(min_size, input_node.shape[2]),
            )(input_node)
        if input_node.shape[3] == 1:
            input_node = tf.keras.layers.Concatenate()([input_node] * 3)
        if input_node.shape[3] != 3:
            input_node = tf.keras.layers.Conv2D(filters=3, kernel_size=1, padding="same")(
                input_node
            )	

        output_node = module(input_node)
        return output_node

In fact, this block is very similar to the HyperBlock, but instead of model_type it has a version parameter that specifies which weights to use. Once again, we check whether a correct version has been specified by the user. The build function is a bit different:

We do not need a conditional scope now, because we don’t build blocks in a conditional statement.
A module is created that serves as an interface to pre-trained tensorflow models.
The number of channels of the input data is checked to make sure it agrees with the pre-trained model.

Now we can add our pre-trained block to the HyperBlock:

class HyperBlock(ak.Block):
    def __init__(self, model_type: Optional[str] = None, version: Optional[str] = None,**kwargs):
        super().__init__(**kwargs)
        
        if model_type is not None and model_type != "resnet" and model_type != "xception" and model_type != "eo_resnet":
            raise Exception(f"invalid model_type {model_type}")

        self.model_type=model_type
        self.version = version

    def get_config(self):
        config = super().get_config()
        config.update({"model_type": self.model_type})
        return config

    def _build_model(self,hp, output_node,model_type: str):
        if model_type=="resnet":
            return ak.ResNetBlock().build(hp,output_node)
        elif model_type=="xception":
            return ak.XceptionBlock().build(hp, output_node)
        elif model_type=="eo_resnet":
            return EOResNetBlock(version=self.version).build(hp,output_node)

    def build(self, hp, inputs):
        inputs=tf.nest.flatten(inputs)[0]

        # Let AutoKeras choose a model
        if self.model_type is None:
            model_type= hp.Choice("model_type", ["resnet", "xception", "eo_resnet"])
            with hp.conditional_scope("model_type",[model_type]):
                outputs = self._build_model(hp,inputs,model_type)
        # Select model yourself
        else:
            outputs = self._build_model(hp,inputs,self.model_type)
        return outputs

We can use our block to create a custom NAS with the AutoModel class of AutoKeras and train it on the UC Merced dataset. The number of trials determines how many networks are sampled by AutoKeras. We now set it to 1, just to test whether it works.

import tensorflow_datasets as tfds
# load the dataset 317.MiB
train_set=tfds.as_numpy(tfds.load("uc_merced", download=True,as_supervised=False, batch_size=-1,split="train[:80%]"))
test_set=tfds.as_numpy(tfds.load("uc_merced", download=True,as_supervised=False, batch_size=-1,split="train[80%:]"))

weights_versions = {k: v for k,v in EO_VERSIONS.items() if k != "ucmerced"}

input_node = ak.ImageInput()
output_node = HyperBlock(model_type="eo_resnet")(input_node)
output_node = ak.ClassificationHead()(output_node)
eo_nas_model = ak.AutoModel(input_node, output_node, max_trials=1,overwrite=True)

eo_nas_model.fit(x=train_set["image"], y=train_set["label"], epochs=10)

When you run this, you’ll find that the model compiles and the neural architecture search starts. Success!

Conclusion

In this blog post, we discussed NAS for the classification of EO images. Using the work by Palacios Salinas et al. as an example, you have learned how to customize the AutoKeras search space by creating a hyperblock and how to apply transfer learning by including pre-trained weights obtained from training on EO datasets.

The Python notebook for this blog post can be found at https://github.com/JuliaWasala/NAS_with_AutoKeras_EO. The original code by Nelly R. Palacios Salinas can be found on GitHub: https://github.com/palaciosnrps/automl-rs-project. This code allows you to use her methods for NAS for the classification of EO imagery in just a few lines of code.

In the next post, we will go even further into the customization of AutoKeras with another example of our research: NAS for super-resolution. We’re going to cover how to add our own custom metrics, add new model architectures to the search space, and extend AutoKeras to other tasks. Stay tuned for more advanced tutorials on AutoKeras for EO!

References

[1] Jin, H., Song, Q. and Hu, X., 2019, July. Auto-keras: An efficient neural architecture search system. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 1946-1956).

[2] Palacios Salinas, N.R., Baratchi, M., Rijn, J.N.V. and Vollrath, A., 2021, September. Automated machine learning for satellite data: integrating remote sensing pre-trained models into AutoML systems. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (pp. 447-462). Springer, Cham.

[3] Cheng, G., et al.: Remote sensing image scene classification meets deep learning: Challenges, methods, benchmarks, and opportunities. In: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 13, 3735–3756 (2020)

[4] Zoph, B., Vasudevan, V., Shlens, J. and Le, Q.V., 2018. Learning transferable architectures for scalable image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8697-8710).

[5] Gong, X., Chang, S., Jiang, Y. and Wang, Z., 2019. Autogan: Neural architecture search for generative adversarial networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 3224-3234).

[6] Pham, H., Guan, M., Zoph, B., Le, Q. and Dean, J., 2018, July. Efficient neural architecture search via parameters sharing. In International conference on machine learning (pp. 4095-4104). PMLR.

[7] He, K., Zhang, X., Ren, S. and Sun, J., 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).

[8] Chollet, F., 2017. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1251-1258).

[9] Zoph, B., Ghiasi, G., Lin, T.Y., Cui, Y., Liu, H., Cubuk, E.D. and Le, Q., 2020. Rethinking pre-training and self-training. Advances in neural information processing systems, 33, pp.3833-3845.

[10] Cheng, G., Han, J. and Lu, X., 2017. Remote sensing image scene classification: Benchmark and state of the art. Proceedings of the IEEE, 105(10), pp.1865-1883.

[11] Helber, P., Bischke, B., Dengel, A. and Borth, D., 2019. Eurosat: A novel dataset and deep learning benchmark for land use and land cover classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 12(7), pp.2217-2226.

[12] Zhu, X.X., Hu, J., Qiu, C., Shi, Y., Kang, J., Mou, L., Bagheri, H., Häberle, M., Hua, Y., Huang, R. and Hughes, L., 2019. So2Sat LCZ42: A benchmark dataset for global local climate zones classification. arXiv preprint arXiv:1912.12171.

[13] Yang, Y. and Newsam, S., 2010, November. Bag-of-visual-words and spatial extensions for land-use classification. In Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems (pp. 270-279).

AutoAI4EO: NAS with AutoKeras for Earth Observation (Part 1)

This is the first post in a series about our research on Neural Architecture Search (NAS) for Earth Observation (EO). This blog post is an introduction to NAS for EO with AutoKeras . In Part 2 we will talk about NAS for the classification of satellite imagery and transfer learning in AutoKeras. Part 3 will cover NAS for super-resolution for satellite imagery and advanced search space modifications in AutoKeras.

Today we will talk about what NAS is, and why it is so useful for the analysis of EO imagery. We’ll discuss AutoKeras, and how we have used it in our research to create methods customized for EO.

What is Neural Architecture Search?

Deep neural networks have become ubiquitous algorithms for automating different tasks ranging from language processing to facial recognition. These powerful methods can be used to model complex relationships in data without using manually engineered features. However, the design of the neural networks themselves can be a tedious and time-consuming process, during which many different architectures are examined by deep learning experts. Neural network architectures are controlled by hyperparameters that define the types of layers used, the number of layers, but also training parameters like the choice of optimizer or learning rate. The number of choices that have to be made makes it hard for other scientists to use these tools for their research: as a consequence, they often use simpler, available options like CNNs and miss out on the full capabilities of state-of-the-art neural network architecture (think: GANs, transformers, etc.).

Figure 1: Diagram of a standard NAS framework. A search strategy s samples candidate models from the search space S. The candidate model is evaluated. The search strategy is updated with the evaluation result.

There is currently a need for these state-of-the-art architectures to become more accessible to researchers from other fields, this is where NAS comes in. NAS frameworks can automatically design and optimize neural networks based on the input data, thus circumventing a human design expert. To do this, the framework needs 3 components:

A search space populated with possible hyperparameters (e.g., layer type, activation functions). Candidate neural networks are built from components in the search space.

A search strategy: you need an algorithm that will traverse the search space and intelligently design neural networks. The total number of possible networks that can be constructed from a given search space is often much larger than the number of those that can reasonably be evaluated (for example, there are approximately 10¹⁵ architectures in the ENAS search space . This number is so large, because there is a choice of 4 different activation functions for the 12 layers that are in the base module that is repeated to create the network, resulting in 4¹² or approximately 10¹⁵ options [1]), therefore you need a way to decide on the next candidate architecture to consider.

An evaluation metric: finally, you want to be able to evaluate and compare the architectures that your framework has generated so you can find the best one.

The search space and the evaluation metric are often task-specific and used for many NAS frameworks. However, the search strategy can vary strongly from framework to framework. There are many search strategies you can choose from, including ones using Reinforcement Learning (e.g., NAS-RL [2]), Evolutionary Algorithms (e.g., Large-scale Evolution [3]), or even stochastic gradient descent (e.g., DARTS [4]). NAS libraries often allow you to use these strategies as well as others like random search and Bayesian optimization. So far, our research is focused on designing the search space and thus we used the standard search strategy offered by the NAS library.

Why use NAS for EO?

We’ve established that NAS has great potential to make state-of-the-art neural networks more accessible to scientists from all domains. But why is it especially well-suited to address EO problems?

Let’s think about a typical EO analysis pipeline for the task of image classification. You want to classify your images based on land cover: whether it is a city, a forest, a desert, etc. First, the raw data would be obtained by a measurement instrument, like Sentinel-2. These images first need to be preprocessed: for example, you want to calibrate the colors in the image to account for different lighting conditions, you want to remove artifacts, and filter your data for clouds. Then, you could consider using techniques like super-resolution to increase the quality of the data. Finally, you can classify your images using either a trained classifier or take the extra steps to label your data and train a classifier specifically for your dataset.

Usually, you would manually select the methods and procedures you would use at each step. This is a time-consuming process and makes it harder to automatically process the vast amounts of data that are generated by EO instruments. An additional challenge is that each step requires different expertise, and thus often different people to carry out these steps. This process could be automated with the help of AutoML techniques, saving researchers valuable hours. Additionally, it is really not possible for humans to find the best pipeline by (informed) trial and error if the pipeline is very complex and there are many design choices to be made. AutoML can make it possible to automate tasks based on EO data and as a result analyze more data.

Figure 2: Left: Sentinel-2 image. 27 January 2019. European Space Agency. Right: Sample of a frog from the CIFAR-10 dataset [5]. This dataset is often used for image classification.

Interest is rising in AI4EO: artificial intelligence techniques for Earth Observation, that are not simply direct applications of existing machine learning methods, but take unique properties of EO data into account. EO data exists in many forms, from measurements of wind direction to optical satellite images. EO imagery can contain many more features of different scales than natural images (for instance, of faces). Additionally, the tasks performed with EO images can be very different from the tasks performed with natural images. For example, in deforestation mapping, small differences between individual images can be of great importance. Therefore, we cannot simply use techniques for natural images. Besides, the performance of ML methods for EO problems can be greatly increased by using available knowledge and theory of physical models, as well as having the benefit of making these ML models more explainable.

In the case of NAS, there are examples of adaptions of existing NAS frameworks for related domains such as spatio-temporal forecasting. For instance, AutoST [6] modifies the DART framework with knowledge of spatio-temporal systems. The resulting framework can generate networks that outperform state-of-the-art approaches to various forecasting problems.

AutoKeras’ role

As mentioned in the previous section, there are many options in terms of NAS frameworks. In this section, we will describe one of those, AutoKeras [7], which we have used for our research on NAS for EO. AutoKeras is a Python library that enables users to implement NAS in Keras. It offers some ready-made options like NAS for Image Classification and NAS for Regression. There are many options to populate the search space, including existing models like ResNet [8] and Transformers as well as different search strategies. AutoKeras generates candidate architectures by mutating and repeating so-called blocks: these blocks are sub-networks, like a stack of one or more CNN layers or even complete models. Block parameters like the number of layers or the kernel size are automatically configured by AutoKeras, but the framework can also select different types of blocks and stack them.

AutoKeras can be used to create customized search spaces by changing the block parameters, but users can also create custom blocks. Additionally, it is possible to load pre-trained weights to speed up training. We can use this functionality to customize our methods for EO. We need to do this, because we want to use characteristics of EO data to achieve better results on EO tasks than we could by simply using techniques developed for natural images. Additionally, customising our search space to our task will help reduce the search space. Though an infinite search space can, in theory, help us discover neural networks that a human would not think of, in practice, this can result in prohibitively long running times before a good architecture is found.

In the coming blog posts, we will discuss two examples of how we have used AutoKeras in our research to create NAS methods specifically for EO. Next up will be the classification of satellite imagery, where we have used custom blocks and the power of transfer learning to achieve state-of-the-art results in image classification.

References

[1] Hieu Pham, Melody Guan, Barret Zoph, Quoc Le, Jeff Dean. Proceedings of the 35th International Conference on Machine Learning, PMLR 80:4095-4104, 2018.

[2] B. Zoph, Q.V. Le, Neural Architecture Search with reinforcement learning, Proceedings of the International Conference on Learning Representations (ICLR), 2017.

[3] Real, E., Moore, S., Selle, A., Saxena, S., Suematsu, Y. L., Tan, J., … & Kurakin, A. (2017, August). Large-scale evolution ofimage classifiers. In Proceedings of the 34th International Conference on Machine Learning-Volume 70 (pp. 2902-2911).JMLR. org

[4] Liu, H., Simonyan, K., & Yang, Y. (2018). Darts: Differentiable architecture search. arXiv preprint arXiv:1806.09055.

[5] A. Krizhevsky, “Learning Multiple Layers of Features from
Tiny Images,” Technical report, 2009.

[6] Li, T., Zhang, J., Bao, K., Liang, Y., Li, Y., Zheng, Y. (n.d.). AutoST: Efficient Neural Architecture Search for Spatio-Temporal Prediction. KDD, 20. https://doi.org/10.1145/3394486.3403122 

[7] Haifeng Jin, Qingquan Song, and Xia Hu. 2019. Auto-Keras: An Efficient Neural Architecture Search System. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’19). Association for Computing Machinery, New York, NY, USA, 1946–1956. https://doi.org/10.1145/3292500.3330648 

[8] K. He, X. Zhang, S. Ren and J. Sun, “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.

Anna Latour promoted to doctor

On Tuesday 13 September 2022, Anna Latour successfully defended her dissertation in the Great Auditorium of Leiden’s beautiful (and ancient) Academy Building.

Anna’s dissertation, titled “Optimal decision-making under constraints and uncertainty”, was formally approved by her doctorate committee in February. Now, finally, was her time to defend it to a committee of opponents. She is the first of prof. Hoos’s doctorate students from ADA Leiden to receive her title.

The Great Auditorium of the Academy Building in Leiden.

In true Leiden tradition, the ceremony was solemn, but festive. Anna’s opponents (prof. Plaat, dr. Van Leeuwen, prof. Kersting, prof. Stuckey, prof. Bonsangue, and prof. Kleijn) questioned her on the semantics and pros and cons of different probabilistic languages, deep learning, the interpretation of probabilities in logic, the choking hazards presented by printed copies of dissertations, adversarial settings in stochastic constraint (optimisation) problems, and the use of gradient methods in constraint solving. This resulted in a good discussion, characterised by mutual respect and a sense of humour. In good COVID practice, the ceremony was hybrid, with two opponents and even a paranimph participating virtually from Germany, Australia and Canada, and with all in-person attendees wearing masks. Anna’s promotores, prof. Hoos and prof. Kok, as well as her co-promotor dr. Nijssen were present in person.

Anna with the certificate declaring that she has the rights to the title of “doctor”.

The discussion was ended by a loud “Hora Est!” from the Beadle, after which the committee left the room for deliberation. Upon their return, it was prof. Hoos’s duty to speak the “magic formula” that promoted Anna to doctor in Mathematics and the Natural Sciences. This has to be done in Dutch, and since it was his first time promoting a student to doctor in the Netherlands, he was somewhat nervous. He did well, and Anna received her certificate (with a giant seal!) from professor Kleijn, the doctorate committee’s secretary.

Yesterday, I had the honour and great pleasure to confer, on behalf of @UniLeiden, the degree of PhD to one of the first members of my group at @liacs: I am very proud of Dr. Anna Latour! @aldlatour @ada_research pic.twitter.com/djr8AbOgrB
— Holger Hoos (@HolgerHoos) September 14, 2022

Anna’s co-promotor, dr. Nijssen, then praised her in his laudatio for being a scientist with many talents, stating that she has proved not only her scientific competence by earning her doctorate, but also demonstrated her proficiency at writing, presenting and teaching, by winning awards and scholarships for all of those skills during her time as a PhD candidate.

I would also like to congratulate Dr. Latour with a great PhD defense. As her co-promotor I am proud of what she has achieved! I look back at a great collaboration with @HolgerHoos, Joost Kok and other researchers at @LIACS, and am grateful to all members of the jury. https://t.co/fgIKGQpWbj
— Siegfried Nijssen (@SGRNijssen) September 14, 2022

Following the ceremony, we had a reception in the beautiful atrium of the Academy Building, with a view of the spiral staircase, surrounded by the busts of dead scientists. Afterwards, the freshly minted doctor, her (co-)promotores, her opponents and her paranimph went for an Italian lunch and a chat about the future.

The ancient Romans held a feast in honour of the gods Jupiter, Juno and Minerva (Epulum Jovis) on the 13^th of September. We think it is only fitting that, on a day that celebrates the Goddess of Knowledge (who adorns Leiden University’s logo), Leiden University gains another female doctor, from a research group named after the Godmother of Programming. Gaudeamus Igitur!

1/ From now on, I'll be Dr. Anna.#PhDone #PhDLife #PhDVoice #PhDChat #AcademicTwitter pic.twitter.com/5M9nY53slp
— Dr. Anna Latour (@aldlatour) September 15, 2022

The next step in Anna’s career is a postdoctoral position as Research Fellow in prof. Meel’s group at the School of Computing of the National University of Singapore. She is working on problems in the field of “Beyond NP”, focusing on (optimisation versions of) Boolean satisfiability, and counting. You can follow her and her career through her website.

You can download dr. Latour’s dissertation from the Leiden University dissertation repository.

The ADA research group welcomes Julia Wąsala

Julia recently joined the ADA group as a PhD candidate after completing her master’s thesis in the group and graduating cum laude. Before starting her master Computer Science: Artificial Intelligence at Leiden University, she completed a bachelor’s degree in Astronomy at Leiden.

In her master’s she specialised in Automated Machine Learning for Earth Observation. She developed a neural architecture search framework for super-resolution for Earth Observation images. Super-resolution is a technique used to increase the resolution of images. Increasing the resolution of satellite imagery can help achieve better results on a variety of tasks that are relevant to applications such as land and forestry management and disaster relief.

During her PhD, she will continue to work on the topic of Automated Machine Learning for Earth Observation, in collaboration with ESA Phi-Lab and SRON. The manual design of Earth Observation analysis pipelines is a time-consuming process. Developing automated tools can both save time, as well as make tools like Deep Learning more accessible to non-domain experts.

Julia will be supervised by LIACS researchers dr. Mitra Baratchi and prof. dr. Holger Hoos, as well as prof. dr. Ilse Aben dr. Bram Maasakkers from SRON, and dr. Rochelle Schneider from ESA-ESRIN.