Sage Journals: Discover world-class research

Abstract

Advances in machine learning (ML) over the past decade have resulted in a proliferation of algorithmic applications for encoding, characterizing, and acting on complex data that may contain numerous multidimensional features. Recently, the emergence of deep-learning models trained across large datasets has created a new paradigm for ML in the form of Foundation Models (FMs). FMs are programs trained on large and broad datasets with an extensive number of parameters. Once built, these extremely powerful, flexible models can be utilized in less resource-intensive ways to build a variety of different downstream applications that can integrate previously disparate, multimodal data. The development of these applications can be done rapidly and with a much lower demand for ML expertise. Additionally, the necessary infrastructure and models themselves are already established within agencies such as NASA and ESA. At NASA, this work extends across several divisions of the Science Mission Directorate. Examples include the NASA Goddard and INDUS Large Language Models and the Prithvi Geospatial Foundation Model. Furthermore, ESA initiatives to bring FMs to Earth observations have led to the development of TerraMind. In February 2025, a workshop was held by NASA Ames Research Center and the SETI Institute to explore the potential of FMs in astrobiological research and identify the steps necessary to build and utilize such a model or models. Here, we share the findings and recommendations of that workshop and describe clear near-term and future opportunities in the development of a FM (or Models) for astrobiology applications. These applications would include a biosignature or life characterization task, a mission development and operations task, and a natural language task for integrating and supporting astrobiology research needs.

Keywords

Foundation models—Machine learning—Astrobiology—Workshops

Get full access to this article

View all access options for this article.

References

Barnes

, Turtle

, Trainer

, et al. Science goals and objectives for the Dragonfly Titan rotorcraft relocatable lander. Planet Sci J, 2021; 2(4):130; doi: 10.3847/PSJ/abfdcf

Bhattacharjee

, Trivedi

, Muraoka

, et al. INDUS: Effective and efficient language models for scientific applications. arXiv e-Prints, 2024 arXiv:2405.10725.

Bommasani

, Hudson

, Adeli

, et al. On the opportunities and risks of foundation models. arXiv e-Prints, 2021 arXiv:2108.07258.

Bothma

, Gilmore

, McKenzie

. The role of quantum effects in proton transfer reactions in enzymes: Quantum tunneling in a noisy environment? New J Phys, 2010; 12(5):055002; doi: 10.1088/1367-2630/12/5/055002

Bowkett

, Chien

, Marchetti

, et al. Autonomous surface sampling for the Europa lander mission concept. Sci Robot, 2025; 10(102):eadi5582; doi: 10.1126/scirobotics.adi5582

Chandru

, Potiszil

, Jia

. Alternative pathways in astrobiology: Reviewing and synthesizing contingency and non-biomolecular origins of terrestrial and extraterrestrial life. Life (Basel), 2024; 14(9):1069; doi: 10.3390/life14091069

Chien

, Visentin

, Basich

. Exploring beyond Earth using space robotics. Sci Robot, 2024; 9(91):eadi6424; doi: 10.1126/scirobotics.adi6424

Cleaves

, Hystad

, Prabhu

, et al. A robust, agnostic molecular biosignature based on machine learning. Proc Natl Acad Sci U S A, 2023; 120(41):e2307149120; doi: 10.1073/pnas.2307149120

Cleland

. Moving beyond definitions in the search for extraterrestrial life. Astrobiology, 2019; 19(6):722–729; doi: 10.1089/ast.2018.1980

10.

Cobb

, Himes

, Soboczenski

, et al. An ensemble of Bayesian neural networks for exoplanetary atmospheric retrieval. AJ, 2019; 158(1):33; doi: 10.3847/1538-3881/ab2390

11.

Des Marais

, Nuth

, Allamandola

, et al. The NASA astrobiology roadmap. Astrobiology, 2008; 8(4):715–730; doi: 10.1089/ast.2008.0819

12.

Fletcher

, Cavalié

, Grassi

, et al. Jupiter science enabled by ESA’s Jupiter icy moons explorer. Space Sci Rev, 2023; 219(7):53; doi: 10.1007/s11214-023-00996-6

13.

Georgiou

, Deamer

. Lipids as universal biomarkers of extraterrestrial life. Astrobiology, 2014; 14(6):541–549; doi: 10.1089/ast.2013.1134

14.

Gharib-Nezhad

, Valizadegan

, Batalha

, et al. TelescopeML. II. Convolutional neural networks for predicting brown dwarf atmospheric parameters. ApJ, 2025; 981(1):67; doi: 10.3847/1538-4357/ada1d2

15.

Harris

, Maclay

, Lutz

, et al. Remote and in situ characterization of Mars analogs: Coupling scales to improve the search for microbial signatures on Mars. Front Astron Space Sci, 2022; 9:849078; doi: 10.3389/fspas.2022.849078

16.

Jakubik

, Roy

, Phillips

, et al. Foundation models for generalist geospatial artificial intelligence. arXiv e-Prints, 2023 arXiv:2310.18660.

17.

Jheeta

, Chatzitheodoridis

, Devine

, et al. The way forward for the origin of life: Prions and prion-like molecules first hypothesis. Life (Basel), 2021; 11(9):872; doi: 10.3390/life11090872

18.

Lesnikowski

, Bickel

, Angerhausen

. Automated discovery of anomalous features in ultralarge planetary remote-sensing datasets using variational autoencoders. IEEE J Sel Top Appl Earth Observations Remote Sensing, 2024; 17:6589–6600; doi: 10.1109/JSTARS.2024.3369101

19.

Leung

, Bovy

. Towards an astronomical foundation model for stars with a transformer-based model. Mon Not R Astron Soc, 2023; 527(1):1494–1520; doi: 10.1093/mnras/stad3015

20.

, Desai

, Scott

, et al. Explainable machine learning identifies multi-omics signatures of muscle response to spaceflight in mice. NPJ Microgravity, 2023; 9(1):90; doi: 10.1038/s41526-023-00337-5

21.

Mathieu

, Leclercq

, Sanabria

, et al. Machine learning and deep learning applications in metagenomic taxonomy and functional annotation. Front Microbiol, 2022; 13:811495; doi: 10.3389/fmicb.2022.811495

22.

Mukkavilli

, Civitarese

, Schmude

, et al. AI foundation models for weather and climate: Applications, design, and implementation. arXiv e-Prints, 2023 arXiv:2309.10808.

23.

National Academies of Sciences, Engineering, and Medicine. Origins, Worlds, and Life: A Decadal Strategy for Planetary Science and Astrobiology 2023–2032. National Academies Press: Washington, DC; 2023.

24.

Nesnas

IAD

, Hockman

, Bandopadhyay

, et al. Autonomous exploration of small bodies toward greater autonomy for deep space missions. Front Robot AI, 2021; 8:650885; doi: 10.3389/frobt.2021.650885

25.

Neveu

, Hays

, Voytek

, et al. The ladder of life detection. Astrobiology, 2018; 18(11):1375–1402; doi: 10.1089/ast.2017.1773

26.

Nguyen

, Brandstetter

, Kapoor

, et al. ClimaX: A foundation model for weather and climate. arXiv e-Prints, 2023 arXiv:2301.10343.

27.

Nichols

, Pontefract

, Dion‐Kirschner

, et al. Lipid biosignatures from SO₄‐rich hypersaline lakes of the Cariboo Plateau. JGR Biogeosciences, 2023; 128(10):e2023JG007480; doi: 10.1029/2023JG007480

28.

Nichols

, Pontefract

, Masterson

, et al. Leveraging machine learning approaches to predict organic carbon abundance in Mars‐analog hypersaline lake sediments. J Geophys Res Machine Learning and Computation, 2024; 1(2):e2024JH000138; doi: 10.1029/2024JH000138

29.

Nussinov

, Zhang

, Liu

, et al. AlphaFold, Artificial Intelligence (AI), and Allostery. J Phys Chem B, 2022; 126(34):6372–6383; doi: 10.1021/acs.jpcb.2c04346

30.

Palucis

, Dietrich

, Hayes

, et al. The origin and evolution of the Peace Vallis fan system that drains to the Curiosity landing area, Gale crater, Mars. JGR Planets, 2014; 119(4):705–728; doi: 10.1002/2013JE004583

31.

Pappalardo

, Buratti

, Korth

, et al. Science overview of the Europa Clipper mission. Space Sci Rev, 2024; 220(4):40; doi: 10.1007/s11214-024-01070-5

32.

Pontefract

, Zhu

, Walker

, et al. Microbial diversity in a hypersaline sulfate lake: A terrestrial analog of ancient Mars. Front Microbiol, 2017; 8:1819; doi: 10.3389/fmicb.2017.01819

33.

Ratliff

, Fulford

, Pozarycki

, et al. The vacant niche revisited: Using negative results to refine the limits of habitability. bioRxiv, 2023; doi: 10.1101/2023.11.06.565904

34.

Rogers

, Qualizza

, Heidenreich

, et al. Silica‐bearing mounds and strata in the Southwest Melas Basin, Valles Marineris, Mars: Evidence for a hydrothermal origin. JGR Planets, 2023; 128(11):e2023JE007881; doi: 10.1029/2023JE007881

35.

Roussel

, Böhm

. Geospatial XAI: A review. IJGI, 2023; 12(9):355; doi: 10.3390/ijgi12090355

36.

Scharf

, Mayer

, Boston

. Using artificial intelligence to transform astrobiology. Nat Astron, 2023; 8(1):8–9; doi: 10.1038/s41550-023-02159-7

37.

Shinde

, Phillips

, Ankur

, et al. WxC-Bench: A novel dataset for weather and climate downstream tasks. arXiv e-Prints, 2024 arXiv:2412.02780.

38.

Szwarcman

, Roy

, Fraccaro

, et al. Prithvi-EO-2.0: A versatile multi-temporal foundation model for Earth observation applications. arXiv e-Prints, 2024 arXiv:2412.02732.

39.

Theiling

, Chou

, Da Poian

, et al. Science autonomy for ocean worlds astrobiology: A perspective. Astrobiology, 2022; 22(8):901–913; doi: 10.1089/ast.2021.0062

40.

Tonkovic

, Kalajdziski

, Zdravevski

, et al. Literature on applied machine learning in metagenomic classification: A scoping review. Biology (Basel), 2020; 9(12):453; doi: 10.3390/biology9120453

41.

Tosca

, Knoll

, McLennan

. Water activity and the challenge for life on early Mars. Science, 2008; 320(5880):1204–1207; doi: 10.1126/science.1155432

42.

Valizadegan

, Martinho

MJS

, Wilkens

, et al. ExoMiner: A highly accurate and explainable deep learning classifier that validates 301 new exoplanets. ApJ, 2022; 926(2):120; doi: 10.3847/1538-4357/ac4399

43.

Vaquero

, Daddi

, Thakker

, et al. EELS: Autonomous snake-like robot with task and motion planning capabilities for ice world exploration. Sci Robot, 2024; 9(88):eadh8332; doi: 10.1126/scirobotics.adh8332

44.

Vera

. Rubin Observatory. The Vera C. Rubin Observatory Data Preview 1. Association of Universities for Research in Astronomy: Washington, DC; 2025; doi: 10.71929/rubin/2570536

45.

Verma

, Maimone

, Gaines

, et al. Autonomous robotics is driving Perseverance rover’s progress on Mars. Sci Robot, 2023; 8(80):eadi3099; doi: 10.1126/scirobotics.adi3099

46.

Warren-Rhodes

, Cabrol

, Phillips

, et al. Orbit-to-ground framework to decode and predict biosignature patterns in terrestrial analogues. Nat Astron, 2023; 7(4):406–422; doi: 10.1038/s41550-022-01882-x

47.

Wilhelm

, Davila

, Eigenbrode

, et al. Xeropreservation of functionalized lipid biomarkers in hyperarid soils in the Atacama Desert. Org Geochem, 2017; 103:97–104; doi: 10.1016/j.orggeochem.2016.10.015

48.

Wolfe

, Lafuente

, Keller

, et al. Enabling data discovery with the astrobiology resource metadata standard. Astrobiology, 2024; 24(2):131–137; doi: 10.1089/ast.2023.0067

49.

Xiao

, Zhao

, Zhang

, et al. Protein large language models: A comprehensive survey. arXiv e-Prints, 2025 arXiv:2502.17504.

Foundation Models for Astrobiology: Paper I—Workshop and Overview

Abstract

Keywords

Get full access to this article

References