Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.15006
Cited By
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome
26 June 2023
Zhihan Zhou
Yanrong Ji
Weijian Li
Pratik Dutta
R. Davuluri
Han Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome"
50 / 53 papers shown
Title
DNAZEN: Enhanced Gene Sequence Representations via Mixed Granularities of Coding Units
Lei Mao
Yuanhe Tian
Yan Song
23
0
0
04 May 2025
Multimodal Large Language Models for Medicine: A Comprehensive Survey
Jiarui Ye
Hao Tang
LM&MA
91
0
0
29 Apr 2025
In-Context Learning can distort the relationship between sequence likelihoods and biological fitness
Pranav Kantroo
Günter P. Wagner
Benjamin B. Machta
47
0
0
23 Apr 2025
NdLinear Is All You Need for Representation Learning
Alex Reneau
Jerry Yao-Chieh Hu
Zhongfang Zhuang
Ting-Chun Liu
HAI
44
0
0
21 Mar 2025
Gene42: Long-Range Genomic Foundation Model With Dense Attention
Kirill Vishniakov
Boulbaba Ben Amor
Engin Tekin
Nancy A. ElNaker
Karthik Viswanathan
...
Tiago Magalhaes
Natalia Vassilieva
Dwarikanath Mahapatra
Marco Pimentel
and Shadab Khan
3DV
44
0
0
20 Mar 2025
HELM: Hierarchical Encoding for mRNA Language Modeling
Mehdi Yazdani-Jahromi
Mangal Prakash
Tommaso Mansi
Artem Moskalev
Rui Liao
97
2
0
13 Mar 2025
Can Large Language Models Predict Antimicrobial Resistance Gene?
Hyunwoo Yoo
AI4CE
LM&MA
50
0
0
06 Mar 2025
Enhancing DNA Foundation Models to Address Masking Inefficiencies
Monireh Safari
Pablo Millán Arias
Scott C. Lowe
Lila Kari
Angel X. Chang
Graham W. Taylor
72
0
0
25 Feb 2025
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification
Zicheng Liu
Siyuan Li
Zhiyuan Chen
Lei Xin
Fang Wu
Chang Yu
Qirong Yang
Yucheng Guo
Yifan Yang
Stan Z. Li
SyDa
AI4CE
92
0
0
11 Feb 2025
TFBS-Finder: Deep Learning-based Model with DNABERT and Convolutional Networks to Predict Transcription Factor Binding Sites
Nimisha Ghosh
Pratik Dutta
Daniele Santoni
60
0
0
03 Feb 2025
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity
Zahra Gharaee
Scott C. Lowe
ZeMing Gong
Pablo Millán Arias
Nicholas Pellegrino
...
Lila Kari
Dirk Steinke
Graham W. Taylor
Paul Fieguth
Angel X. Chang
56
8
0
28 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
18
0
17 Jan 2025
RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting
Yilei Jiang
Yingshui Tan
Xiangyu Yue
KELM
LRM
46
4
0
25 Dec 2024
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
Lifeng Qiao
Peng Ye
Yuchen Ren
Weiqiang Bai
Chaoqi Liang
Xinzhu Ma
Nanqing Dong
W. Ouyang
86
2
0
18 Dec 2024
BarcodeMamba: State Space Models for Biodiversity Analysis
Tiancheng Gao
Graham W. Taylor
Mamba
82
2
0
15 Dec 2024
Does your model understand genes? A benchmark of gene properties for biological and text models
Yoav Kan-Tor
Michael M. Danziger
Eden Zohar
Matan Ninio
Yishai Shimoni
73
1
0
05 Dec 2024
Specialized Foundation Models Struggle to Beat Supervised Baselines
Zongzhe Xu
Ritvik Gupta
Wenduo Cheng
Alexander Shen
Junhong Shen
Ameet Talwalkar
M. Khodak
AI4CE
54
6
0
05 Nov 2024
Revisiting K-mer Profile for Effective and Scalable Genome Representation Learning
Abdulkadir Celikkanat
A. Masegosa
Thomas D. Nielsen
21
1
0
04 Nov 2024
BSM: Small but Powerful Biological Sequence Model for Genes and Proteins
Weixi Xiang
Xueting Han
Xiujuan Chai
Jing Bai
26
1
0
15 Oct 2024
Plug-and-Play Controllable Generation for Discrete Masked Models
Wei Guo
Yuchen Zhu
Molei Tao
Yongxin Chen
40
1
0
03 Oct 2024
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models
Heng Yang
Jack Cole
Ke Li
28
0
0
02 Oct 2024
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
44
9
0
06 Sep 2024
Differentially Private Kernel Density Estimation
Erzhi Liu
Jerry Yao-Chieh Hu
Alex Reneau
Zhao Song
Han Liu
66
3
0
03 Sep 2024
Genomic Language Models: Opportunities and Challenges
Gonzalo Benegas
Chengzhong Ye
C. Albors
Jianan Canal Li
Yun S. Song
AI4CE
LM&MA
ELM
50
18
0
16 Jul 2024
Multi-modal Transfer Learning between Biological Foundation Models
Juan Jose Garau-Luis
Patrick Bordes
Liam Gonzalez
Masa Roller
Bernardo P. de Almeida
...
Stefan Laurent
Jan Grzegorzewski
Maren Lang
Thomas Pierrot
Guillaume Richard
AI4CE
41
3
0
20 Jun 2024
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model
Sajib Acharjee Dip
Uddip Acharjee Shuvo
Tran Chau
Haoqiu Song
Petra Choi
Xuan Wang
Liqing Zhang
LM&MA
23
2
0
19 Jun 2024
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
Yu Zhang
Xiusi Chen
Bowen Jin
Sheng Wang
Shuiwang Ji
Wei Wang
Jiawei Han
44
27
0
16 Jun 2024
BEACON: Benchmark for Comprehensive RNA Tasks and Language Models
Yuchen Ren
Zhiyuan Chen
Lifeng Qiao
Hongtai Jing
Yuchen Cai
...
Siqi Sun
Hongliang Yan
Dong Yuan
Wanli Ouyang
Xihui Liu
44
9
0
14 Jun 2024
GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models
Zicheng Liu
Jiahui Li
Siyuan Li
Z. Zang
Cheng Tan
Yufei Huang
Yajing Bai
Stan Z. Li
ELM
29
8
0
01 Jun 2024
DYNA: Disease-Specific Language Model for Variant Pathogenicity
Huixin Zhan
Zijun Zhang
19
0
0
31 May 2024
DocReLM: Mastering Document Retrieval with Language Model
Gengchen Wei
Xinle Pang
Tianning Zhang
Yu Sun
Xun Qian
Chen Lin
Han-Sen Zhong
Wanli Ouyang
RALM
36
0
0
19 May 2024
An Embarrassingly Simple Approach to Enhance Transformer Performance in Genomic Selection for Crop Breeding
Renqi Chen
Wenwei Han
Haohao Zhang
Haoyang Su
Zhefan Wang
Xiaolei Liu
Hao Jiang
Wanli Ouyang
Nanqing Dong
14
0
0
15 May 2024
Self-Distillation Improves DNA Sequence Inference
Tong Yu
Lei Cheng
Ruslan Khalitov
Erland Brandser Olsson
Zhirong Yang
SyDa
40
0
0
14 May 2024
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
Siyuan Li
Zedong Wang
Zicheng Liu
Di Wu
Cheng Tan
Jiangbin Zheng
Yufei Huang
Stan Z. Li
40
7
0
13 May 2024
Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity
Zhufeng Li
S. S. Cranganore
Nicholas D. Youngblut
Niki Kilbertus
47
2
0
09 May 2024
BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model
Chenwei Xu
Yu-Chao Huang
Jerry Yao-Chieh Hu
Weijian Li
Ammar Gilani
H. Goan
Han Liu
52
19
0
04 Apr 2024
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
Jerry Yao-Chieh Hu
Pei-Hsuan Chang
Haozheng Luo
Hong-Yu Chen
Weijian Li
Wei-Po Wang
Han Liu
39
26
0
04 Apr 2024
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling
Yair Schiff
Chia-Hsiang Kao
Aaron Gokaslan
Tri Dao
Albert Gu
Volodymyr Kuleshov
Mamba
27
81
0
05 Mar 2024
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks
Rafael Josip Penić
Tin Vlasic
Roland G. Huber
Yue Wan
M. Šikić
AI4CE
22
27
0
29 Feb 2024
FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics
Chenrui Duan
Z. Zang
Yongjie Xu
Hang He
Zihan Liu
Zijia Song
Ju-Sheng Zheng
Stan Z. Li
13
2
0
24 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
29
1
0
12 Feb 2024
Progress and Opportunities of Foundation Models in Bioinformatics
Qing Li
Zhihang Hu
Yixuan Wang
Lei Li
Yimin Fan
Irwin King
Le Song
Yu Li
AI4CE
43
9
0
06 Feb 2024
Large Language Models in Plant Biology
H. Lam
Xing Er Ong
Marek Mutwil
11
16
0
05 Jan 2024
Predicting Anti-microbial Resistance using Large Language Models
Hyunwoo Yoo
B. Sokhansanj
James R. Brown
G. Rosen
LM&MA
19
2
0
01 Jan 2024
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e
Chenwei Xu
Jerry Yao-Chieh Hu
A. Narayanan
M. Thieme
V. Nagaslaev
...
Rui Shi
S. Memik
A. Shuping
Kyle Hazelwood
Han Liu
24
2
0
28 Dec 2023
BEND: Benchmarking DNA Language Models on biologically meaningful tasks
Frederikke Isa Marin
Felix Teufel
Marc Horlacher
Dennis Madsen
Dennis Pultz
Ole Winther
Wouter Boomsma
22
34
0
21 Nov 2023
To Transformers and Beyond: Large Language Models for the Genome
Micaela Elisa Consens
Cameron Dufault
Michael Wainberg
Duncan Forster
Mehran Karimzadeh
Hani Goodarzi
Fabian J. Theis
Alan Moses
Bo Wang
LM&MA
MedIm
26
26
0
13 Nov 2023
BarcodeBERT: Transformers for Biodiversity Analysis
Pablo Millán Arias
Niousha Sadjadi
Monireh Safari
ZeMing Gong
Austin T. Wang
...
Iuliia Zarubiieva
Dirk Steinke
Lila Kari
Angel X. Chang
Graham W. Taylor
50
7
0
04 Nov 2023
Splicing Up Your Predictions with RNA Contrastive Learning
Phil Fradkin
Ruian Shi
Bo Wang
Brendan Frey
Leo J. Lee
SSL
29
0
0
12 Oct 2023
Embed-Search-Align: DNA Sequence Alignment using Transformer Models
Pavan Holur
Kenneth C. Enevoldsen
Shreyas Rajesh
L. Mboning
Thalia Georgiou
Louis-S. Bouchard
Matteo Pellegrini
V. Roychowdhury
25
0
0
20 Sep 2023
1
2
Next