EPSRC Doctoral Prize Fellow at the University of Bristol

News

2025-08-06

Published Stratify: Unifying Multi-Step Forecasting Strategies in ECML / Springer Nature!

2025-05-23

Published Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model in Astronomy & Astrophysics!

2025-03-25

Presented Towards Foundational Models for Dynamical System Reconstruction: Hierarchical Meta-Learning via Mixture of Experts at ICLR 2025 - First Workshop on Scalable Optimization for Efficient and Adaptive Foundation Models!

2025-03-19

Published Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images on Arxiv!

2024-10-03

I have successfully defended my thesis and have been awarded my PhD in Interactive AI!

2024-09-19

Published Euclid preparation - XLIII. Measuring detailed galaxy morphologies for Euclid with machine learning in Astronomy & Astrophysics!

2024-06-03

I have started my 2-year Fellowship at the University of Bristol working on Diffusion Models and Digital Twins!

2024-02-15

Published Time-Series Classification for Dynamic Strategies in Multi-Step Forecasting on Arxiv!

2024-02-15

Presented a talk on 'Optimizing Data Efficiency: Using Active Learning Strategies and the QUEST Method for Efficient Classification and Labeling in Large Datasets' at the Galaxies & AGN with the First Euclid Data and Beyond in Bologna.

2023-03-28

Presented a pecha-kucha talk on my PhD research at the Interactive AI CDT Spring Research Conference.

Keyword: diffusion

Keywords: active-learning, classification, computer-vision, diffusion, euclid-consortium, foundation-models, meta-learning, mixture-of-experts, multistep-forecasting, regression, software, time-series, transfer-learning

Top: Respective noised images produced by the cosine-beta schedule at different timesteps. Each image is a sample from the respective signal-to-noise bin directly below it. Due to the scales of pixel values, the introduced noise has a more significant impact on the typically fainter, low S/N images, leading to the images converging to Gaussian noise much sooner into forward process. The relationship of S/N and rate of convergence results in the entirety of the top left of the grid of images being pure noise, indicating inefficient training for lower S/N images. This highlights the difficulty in applying off-the-shelf pipelines to the complexities of real-world astronomical data that feature high dynamic range and varying quality over images. Bottom: Distribution of S/N of galaxy images. Even though the sample is dominated by lower S/N images, a non-negligible number of sources with S/N→1000 remains in the training set.
Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images
G. Stevens, S. Fotopoulou, M.N. Bremer, T. Matamoro Zatarain, K. Jahnke, B. Margalef-Bentabol, M. Huertas-Company, M.J. Smith, M. Walmsley, M. Salvato, M. Mezcua, A. Paulino-Afonso, M. Siudek, M. Talia, F. Ricci, W. Roster, Euclid Consortium
Arxiv Preprint
PDF DOI BIB ABSTRACT
Keywords: diffusion, computer-vision, euclid-consortium, classification
Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an active galactic nucleus (AGN) feature intense, concentrated emission from gas accretion around supermassive black holes, superimposed on regular galactic light, while quasi-stellar objects (QSO) are the extreme case of the AGN emission dominating the galaxy. The challenge of identifying AGN and QSO has been discussed many times in the literature, often requiring multi-wavelength observations. This paper introduces a novel approach to identify AGN and QSO from a single image. Diffusion models have been recently developed in the machine-learning literature to generate realistic-looking images of everyday objects. Utilising the spatial resolving power of the Euclid VIS images, we created a diffusion model trained on one million sources, without using any source pre-selection or labels. The model learns to reconstruct light distributions of normal galaxies, since the population is dominated by them. We condition the prediction of the central light distribution by masking the central few pixels of each source and reconstruct the light according to the diffusion model. We further use this prediction to identify sources that deviate from this profile by examining the reconstruction error of the few central pixels regenerated in each source's core. Our approach, solely using VIS imaging, features high completeness compared to traditional methods of AGN and QSO selection, including optical, near-infrared, mid-infrared, and X-rays.
@misc{stevens2025EuclidInpaintingAGN, author = {{Stevens}, G. and {Fotopoulou}, S. and {Bremer}, M.~N. and {Matamoro Zatarain}, T. and {Jahnke}, K. and {Margalef-Bentabol}, B. and {Huertas-Company}, M. and {Smith}, M.~J. and {Walmsley}, M. and {Salvato}, M. and {Mezcua}, M. and {Paulino-Afonso}, A. and {Siudek}, M. and {Talia}, M. and {Ricci}, F. and {Roster}, W. and the {Euclid Collaboration}.}, title = "{Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images}", year = {2025}, eprint = {2503.15321} }