Pulse Brain · Growing Health Evidence Index
Tier 3 — Observational / field trialPeer-reviewed

DNA language models are powerful predictors of genome-wide variant effects

Gonzalo Benegas, Sanjit Singh Batra, Yun S. Song

Proceedings of the National Academy of Sciences · 2023

Read source ↗ All evidence

Summary

This peer-reviewed study presents genome-wide predictive networks (GPN), a machine learning approach using DNA language models to predict the phenotypic effects of genetic variants across entire plant genomes without requiring experimental training data. The authors demonstrate the approach on Arabidopsis and provide open-source code enabling application to any plant species, with results visualised through the UCSC Genome Browser. This represents a computational advance in genomic prediction methodology with potential applications for crop improvement and variant interpretation.

Regional applicability

The computational methodology is geographically neutral and directly transferable to United Kingdom crop and plant research. The open-source framework could support UK plant breeding programmes, genetics research, and crop improvement initiatives using genomic data from local germplasm or breeding populations.

Key measures

Predictive accuracy of DNA language models for variant effect prediction; genome-wide variant effect scores visualised in the UCSC Genome Browser

Outcomes reported

The study developed and validated genome-wide predictive models (GPN) using DNA language models to forecast the effects of genetic variants across entire plant genomes. The models were demonstrated on Arabidopsis thaliana and made available as open-source code for application to any species.

Theme
Measurement & metrics
Subject
Measurement methods & nutrient profiling
Study type
Research
Study design
Laboratory / computational study
Source type
Peer-reviewed study
Status
Published
Geography
United States
System type
Laboratory / in vitro
DOI
10.1073/pnas.2311219120
Catalogue ID
SNmp6e6pqt-tclfor

Topic tags

Pulse AI · ask about this record

Dig deeper with Pulse AI.

Pulse AI has read the whole catalogue. Ask about this record, its theme, or how the findings apply to UK farming and policy — every answer cites the underlying studies.