Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 1,211 papers shown
Title
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
Ming Cheng
Jiaying Gong
Chenhan Yuan
William A. Ingram
Edward A. Fox
Hoda Eldardiry
141
1
0
07 Nov 2024
DELIFT: Data Efficient Language model Instruction Fine Tuning
Ishika Agarwal
Krishnateja Killamsetty
Lucian Popa
Marina Danilevksy
ALM
VLM
97
3
0
07 Nov 2024
ATM: Improving Model Merging by Alternating Tuning and Merging
Luca Zhou
Daniele Solombrino
Donato Crisostomi
Maria Sofia Bucarelli
Fabrizio Silvestri
Emanuele Rodolà
MoMe
84
5
0
05 Nov 2024
Navigating Extremes: Dynamic Sparsity in Large Output Spaces
Nasib Ullah
Erik Schultheis
Mike Lasby
Yani Andrew Ioannou
Rohit Babbar
52
0
0
05 Nov 2024
Specialized Foundation Models Struggle to Beat Supervised Baselines
Zongzhe Xu
Ritvik Gupta
Wenduo Cheng
Alexander Shen
Junhong Shen
Ameet Talwalkar
M. Khodak
AI4CE
77
7
0
05 Nov 2024
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin
G. Subbaraj
Nicolas Angelard-Gontier
Yann LeCun
Irina Rish
Ravid Shwartz-Ziv
C. Pal
LRM
388
1
0
04 Nov 2024
MdEval: Massively Multilingual Code Debugging
Shukai Liu
Linzheng Chai
Jian Yang
Jiajun Shi
He Zhu
...
Yu Hao
Liqun Yang
Guanglin Niu
Ge Zhang
Zheng Li
LRM
ELM
103
6
0
04 Nov 2024
TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
R. Cabral
Soyeon Caren Han
Areej Alhassan
Riza Batista-Navarro
Goran Nenadic
Josiah Poon
59
0
0
04 Nov 2024
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?
QiHong Chen
Jiawei Li
Jiecheng Deng
Jiachen Yu
Justin Tian Jin Chen
Iftekhar Ahmed
86
1
0
03 Nov 2024
Music Foundation Model as Generic Booster for Music Downstream Tasks
Weihsiang Liao
Yuhta Takida
Yukara Ikemiya
Zhi-Wei Zhong
Chieh-Hsin Lai
...
Stefan Uhlich
Taketo Akama
Woosung Choi
Yuichiro Koyama
Yuki Mitsufuji
138
1
0
02 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
121
2
0
01 Nov 2024
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Shengxun Wei
Zan Gao
Yibo Zhao
Weili Guan
Weili Guan
Shengyong Chen
90
2
0
01 Nov 2024
FRoundation: Are Foundation Models Ready for Face Recognition?
Tahar Chettaoui
Naser Damer
Fadi Boutros
CVBM
55
7
0
31 Oct 2024
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
Xinyuan Chang
Maixuan Xue
Xinran Liu
Zheng Pan
Xing Wei
118
2
0
31 Oct 2024
ViT-LCA: A Neuromorphic Approach for Vision Transformers
Sanaz Mahmoodi Takaghaj
ViT
97
1
0
31 Oct 2024
Dynamic Uncertainty Ranking: Enhancing Retrieval-Augmented In-Context Learning for Long-Tail Knowledge in LLMs
Shuyang Yu
Runxue Bao
Parminder Bhatia
Taha A. Kass-Hout
Jiayu Zhou
Cao Xiao
107
1
0
31 Oct 2024
Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks
Youngjoon Lee
J. Gong
Joonhyuk Kang
70
0
0
31 Oct 2024
EchoFM: Foundation Model for Generalizable Echocardiogram Analysis
Sekeun Kim
Pengfei Jin
S. Song
Cheng Chen
Yiwei Li
Hui Ren
Xiang Li
Tianming Liu
Quanzheng Li
62
0
0
30 Oct 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
416
2
0
30 Oct 2024
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
172
5
0
29 Oct 2024
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
Ming Kang
F. F. Ting
Raphaël C.-W. Phan
C. Ting
ViT
MedIm
133
1
0
29 Oct 2024
Hierarchical mixtures of Unigram models for short text clustering: The role of Beta-Liouville priors
Massimo Bilancia
Samuele Magro
58
0
0
29 Oct 2024
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
Noel Elias
H. Esfahanizadeh
Kaan Kale
S. Vishwanath
Muriel Médard
50
0
0
28 Oct 2024
SubjECTive-QA: Measuring Subjectivity in Earnings Call Transcripts' QA Through Six-Dimensional Feature Analysis
Huzaifa Pardawala
Siddhant Sukhani
Agam Shah
Veer Kejriwal
Abhishek Pillai
Rohan Bhasin
Andrew DiBiasio
Tarun Mandapati
Dhruv Adha
Sudheer Chava
163
1
0
28 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
64
3
0
28 Oct 2024
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification
Hsun-Yu Kuo
Yin-Hsiang Liao
Yu-Chieh Chao
Wei-Yun Ma
Pu-Jen Cheng
SyDa
85
3
0
28 Oct 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
71
3
0
28 Oct 2024
MAMMAL -- Molecular Aligned Multi-Modal Architecture and Language
Yoel Shoshan
Moshiko Raboh
Michal Ozery-Flato
Vadim Ratner
Alex Golts
...
Sharon Kurant
Joseph A. Morrone
Parthasarathy Suryanarayanan
Michal Rosen-Zvi
Efrat Hexter
66
1
0
28 Oct 2024
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
Mufei Li
Siqi Miao
Pan Li
RALM
110
11
0
28 Oct 2024
Estimating Causal Effects of Text Interventions Leveraging LLMs
Siyi Guo
Myrl G. Marmarelis
Fred Morstatter
Kristina Lerman
CML
382
0
0
28 Oct 2024
UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Dehai Min
Zhiyang Xu
Guilin Qi
Lifu Huang
Chenyu You
RALM
107
2
0
26 Oct 2024
Vulnerability of LLMs to Vertically Aligned Text Manipulations
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Zhen Xiong
Nanyun Peng
Kai-Wei Chang
104
1
0
26 Oct 2024
Chemical Language Model Linker: blending text and molecules with modular adapters
Yifan Deng
Spencer S. Ericksen
Anthony Gitter
73
2
0
26 Oct 2024
GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing
Hosam Elgendy
Ahmed Sharshar
Ahmed Aboeitta
Yasser Ashraf
Mohsen Guizani
54
2
0
25 Oct 2024
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Farid Ariai
Gianluca Demartini
ELM
AILaw
VLM
57
4
0
25 Oct 2024
Graph Linearization Methods for Reasoning on Graphs with Large Language Models
Christos Xypolopoulos
Guokan Shang
Xiao Fei
Giannis Nikolentzos
Hadi Abdine
Iakovos Evdaimon
Michail Chatzianastasis
Giorgos Stamou
Michalis Vazirgiannis
62
1
0
25 Oct 2024
Probabilistic Language-Image Pre-Training
Sanghyuk Chun
Wonjae Kim
Song Park
Sangdoo Yun
MLLM
VLM
CLIP
365
4
2
24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
52
2
0
24 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
95
5
0
24 Oct 2024
End-to-end Training for Recommendation with Language-based User Profiles
Zhaolin Gao
Joyce Zhou
Yijia Dai
Thorsten Joachims
AI4Ed
91
3
0
24 Oct 2024
Cross-lingual Transfer of Reward Models in Multilingual Alignment
Jiwoo Hong
Noah Lee
Rodrigo Martínez-Castaño
César Rodríguez
James Thorne
76
5
0
23 Oct 2024
Lightweight Neural App Control
Filippos Christianos
Georgios Papoudakis
Thomas Coste
Jianye Hao
Jun Wang
Kun Shao
LM&Ro
69
5
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
124
0
0
23 Oct 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Hao Peng
Dianbo Sui
AI4CE
98
24
0
23 Oct 2024
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Yuxian Gu
Hao Zhou
Fandong Meng
Jie Zhou
Minlie Huang
122
5
0
22 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
61
1
0
22 Oct 2024
Conjuring Semantic Similarity
Tian Yu Liu
Stefano Soatto
DiffM
98
0
0
21 Oct 2024
Machine Learning Approaches for Mental Illness Detection on Social Media: A Systematic Review of Biases and Methodological Challenges
Yuchen Cao
Jianglai Dai
Zhongyan Wang
Yeyubei Zhang
Xiaorui Shen
Yunchong Liu
Yexin Tian
47
5
0
21 Oct 2024
Opportunities and Challenges of Generative-AI in Finance
Akshar Prabhu Desai
Ganesh Satish Mallya
Mohammad Luqman
Tejasvi Ravi
Nithya Kota
Pranjul Yadav
AIFin
84
3
0
21 Oct 2024
AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context
Naba Rizvi
Harper Strickland
Daniel Gitelman
Tristan Cooper
Alexis Morales-Flores
...
Haaset Owens
Saleha Ahmedi
Isha Khirwadkar
Imani Munyaka
Nedjma Ousidhoum
90
1
0
21 Oct 2024
Previous
1
2
3
...
11
12
13
...
23
24
25
Next