Identifying Anomalous DESI Galaxy Spectra with a Variational Autoencoder

dc.contributor.authorC. Nicolaou
dc.contributor.authorRowina S Nathan
dc.contributor.authorO Lahav
dc.contributor.authorA. Palmese
dc.contributor.authorA. Saintonge
dc.contributor.authorJ. Aguilar
dc.contributor.authorS. P. Ahlen
dc.contributor.authorC. Allende Prieto
dc.contributor.authorS. Bailey
dc.contributor.authorS. BenZvi
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T20:52:19Z
dc.date.available2026-03-22T20:52:19Z
dc.date.issued2026
dc.description.abstractABSTRACT The tens of millions of spectra being captured by the Dark Energy Spectroscopic Instrument (DESI) provide tremendous discovery potential. In this work we show how Machine Learning, in particular Variational Autoencoders (VAE), can detect anomalies in a sample of approximately 200 000 DESI spectra comprising galaxies, quasars and stars. We demonstrate that the VAE can compress the dimensionality of a spectrum by a factor of 100, while still retaining enough information to accurately reconstruct spectral features. We detect anomalous spectra as those with high reconstruction error and those which are isolated in the VAE latent representation. The anomalies identified fall into two categories: spectra with artefacts and spectra with unique physical features. Awareness of the former could improve the DESI spectroscopic pipeline; whilst the latter could help us discover new and unusual objects. To further curate the list of outliers identified, we use the Astronomaly package which employs Active Learning to provide personalized outlier recommendations for visual inspection. In this work we also explore the VAE latent space, finding that different object classes and subclasses are separated despite being unlabelled. We inject controlled synthetic anomalies and analyse their locations in the latent space to illustrate how the VAE responds to atypical spectral features; and we demonstrate the interpretability of this latent space by identifying tracks within it that correspond to various spectral characteristics. In upcoming work we hope to apply the methods presented here to search for both systematics and astrophysically interesting objects in much larger datasets of DESI spectra.
dc.identifier.doi10.1093/mnras/stag010
dc.identifier.urihttps://doi.org/10.1093/mnras/stag010
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/84566
dc.language.isoen
dc.publisherOxford University Press
dc.relation.ispartofMonthly Notices of the Royal Astronomical Society
dc.sourceUniversity College London
dc.subjectAutoencoder
dc.subjectInterpretability
dc.subjectSpectral line
dc.subjectPhysics
dc.subjectPattern recognition (psychology)
dc.subjectArtificial intelligence
dc.subjectGalaxy
dc.subjectComputer science
dc.subjectIdentification (biology)
dc.subjectOutlier
dc.titleIdentifying Anomalous DESI Galaxy Spectra with a Variational Autoencoder
dc.typepreprint

Files