ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.10271
112
0

Predicting function of evolutionarily implausible DNA sequences

12 June 2025
Shiyu Jiang
Xuyin Liu
Zitong Jerry Wang
ArXiv (abs)PDFHTML
Main:4 Pages
6 Figures
Bibliography:2 Pages
4 Tables
Appendix:7 Pages
Abstract

Genomic language models (gLMs) show potential for generating novel, functional DNA sequences for synthetic biology, but doing so requires them to learn not just evolutionary plausibility, but also sequence-to-function relationships. We introduce a set of prediction tasks called Nullsettes, which assesses a model's ability to predict loss-of-function mutations created by translocating key control elements in synthetic expression cassettes. Across 12 state-of-the-art models, we find that mutation effect prediction performance strongly correlates with the predicted likelihood of the nonmutant. Furthermore, the range of likelihood values predictive of strong model performance is highly dependent on sequence length. Our work highlights the importance of considering both sequence likelihood and sequence length when using gLMs for mutation effect prediction.

View on arXiv
@article{jiang2025_2506.10271,
  title={ Predicting function of evolutionarily implausible DNA sequences },
  author={ Shiyu Jiang and Xuyin Liu and Zitong Jerry Wang },
  journal={arXiv preprint arXiv:2506.10271},
  year={ 2025 }
}
Comments on this paper