Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 1,211 papers shown
Title
CogLM: Tracking Cognitive Development of Large Language Models
Xinglin Wang
Peiwen Yuan
Shaoxiong Feng
Yiwei Li
Boyuan Pan
Heda Wang
Yao Hu
Kan Li
ELM
76
0
0
17 Aug 2024
Adaptive Uncertainty Quantification for Generative AI
Jungeum Kim
Sean O'Hagan
Veronika Rockova
MedIm
251
3
0
16 Aug 2024
W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering
Jinming Nian
Zhiyuan Peng
Qifan Wang
Yi Fang
RALM
106
2
0
15 Aug 2024
Normalized AOPC: Fixing Misleading Faithfulness Metrics for Feature Attribution Explainability
Joakim Edin
Andreas Geert Motzfeldt
Casper L. Christensen
Tuukka Ruotsalo
Lars Maaløe
Maria Maistro
92
4
0
15 Aug 2024
Masked Image Modeling: A Survey
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
N. Sebe
104
8
0
13 Aug 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
Hila Gonen
Terra Blevins
Alisa Liu
Luke Zettlemoyer
Noah A. Smith
69
5
0
12 Aug 2024
VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing
Chunyu Qiang
Wang Geng
Yi Zhao
Ruibo Fu
Tao Wang
...
Chen Zhang
Hao Che
L. Wang
Jianwu Dang
J. Tao
AI4TS
60
0
0
11 Aug 2024
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
Yuze Zhao
Jintao Huang
Jinghan Hu
Xingjun Wang
Yunlin Mao
...
Zhikai Wu
Baole Ai
Ang Wang
Wenmeng Zhou
Yingda Chen
64
36
0
10 Aug 2024
Range Membership Inference Attacks
Jiashu Tao
Reza Shokri
85
1
0
09 Aug 2024
Random Walk Diffusion for Efficient Large-Scale Graph Generation
Tobias Bernecker
Ghalia Rehawi
Francesco Paolo Casale
Janine Knauer-Arloth
Annalisa Marsico
53
1
0
08 Aug 2024
LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations
Lei Shi
Zhimeng Liu
Yi Yang
Weize Wu
Yuyang Zhang
...
Zipeng Liu
Huobin Tan
Hongyi Gao
Yue Zhang
Ge Wang
77
0
0
06 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAML
ELM
SILM
102
6
0
05 Aug 2024
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
Haolin Jin
Linghan Huang
Haipeng Cai
Jun Yan
Bo Li
Huaming Chen
102
34
0
05 Aug 2024
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services
Shaopeng Fu
Xuexue Sun
Ke Qing
Tianhang Zheng
Di Wang
AAML
MIACV
SILM
97
0
0
05 Aug 2024
A new approach for encoding code and assisting code understanding
Mengdan Fan
Changde Du
Haiyan Zhao
Zhi Jin
85
0
0
01 Aug 2024
State-observation augmented diffusion model for nonlinear assimilation with unknown dynamics
Zhuoyuan Li
Bin Dong
Linyue Chu
68
0
0
31 Jul 2024
GOProteinGNN: Leveraging Protein Knowledge Graphs for Protein Representation Learning
Dan Kalifa
Uriel Singer
Kira Radinsky
94
1
0
31 Jul 2024
Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification
Junru Chen
Tianyu Cao
Ninon De Mecquenem
Jiahe Li
Zhilong Chen
F. Friederici
Yang Yang
81
1
0
31 Jul 2024
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
Jinfa Huang
Jinsheng Pan
Zhongwei Wan
Hanjia Lyu
Jiebo Luo
75
5
0
30 Jul 2024
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Yupeng Chen
Senmiao Wang
Zhihang Lin
Zhihang Lin
Yushun Zhang
Tian Ding
Ruoyu Sun
Ruoyu Sun
CLL
125
3
0
30 Jul 2024
Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
Dhruv Verma
Debaditya Roy
Basura Fernando
56
1
0
30 Jul 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
84
1
0
29 Jul 2024
Learn while Unlearn: An Iterative Unlearning Framework for Generative Language Models
Haoyu Tang
Ye Liu
Xukai Liu
Xukai Liu
Yanghai Zhang
Kai Zhang
Xiaofang Zhou
Enhong Chen
MU
85
3
0
25 Jul 2024
Sentiment Reasoning for Healthcare
Khai-Nguyen Nguyen
Khai Le-Duc
Bach Phan Tat
Duy Le
Jerry Ngo
Long Vo-Dang
LRM
64
0
0
24 Jul 2024
NV-Retriever: Improving text embedding models with effective hard-negative mining
Gabriel de Souza P. Moreira
Radek Osmulski
Mengyao Xu
Ronay Ak
Benedikt Schifferer
Even Oldridge
RALM
81
40
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
82
7
0
22 Jul 2024
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
118
3
0
20 Jul 2024
Evaluating the Reliability of Self-Explanations in Large Language Models
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
LRM
83
0
0
19 Jul 2024
Unipa-GPT: Large Language Models for university-oriented QA in Italian
Irene Siragusa
Roberto Pirrone
65
1
0
19 Jul 2024
Conversational Query Reformulation with the Guidance of Retrieved Documents
Jeonghyun Park
Hwanhee Lee
59
0
0
17 Jul 2024
Large Visual-Language Models Are Also Good Classifiers: A Study of In-Context Multimodal Fake News Detection
Ye Jiang
Yimin Wang
MLLM
84
1
0
16 Jul 2024
Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
Phillip Mueller
Lars Mikelsons
AI4CE
69
2
0
15 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
66
2
0
12 Jul 2024
DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection
Sangpil Youm
Brodie Mather
Chathuri Jayaweera
Juliana Prada
Bonnie J. Dorr
VLM
66
0
0
12 Jul 2024
Transformer Layers as Painters
Qi Sun
Marc Pickett
Aakash Kumar Nain
Llion Jones
AI4CE
71
18
0
12 Jul 2024
Enrich the content of the image Using Context-Aware Copy Paste
Qiushi Guo
VLM
110
0
0
11 Jul 2024
Bootstrapping Vision-language Models for Self-supervised Remote Physiological Measurement
Zijie Yue
Miaojing Shi
Hanli Wang
Shuai Ding
Qijun Chen
Shanlin Yang
65
0
0
11 Jul 2024
FsPONER: Few-shot Prompt Optimization for Named Entity Recognition in Domain-specific Scenarios
Yongjian Tang
Rakebul Hasan
Thomas Runkler
104
2
0
10 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
84
7
1
10 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
102
10
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
72
49
0
09 Jul 2024
On Speeding Up Language Model Evaluation
Jin Peng Zhou
Christian K. Belardi
Ruihan Wu
Travis Zhang
Carla P. Gomes
Wen Sun
Kilian Q. Weinberger
78
1
0
08 Jul 2024
OneDiff: A Generalist Model for Image Difference Captioning
Erdong Hu
Longteng Guo
Tongtian Yue
Zijia Zhao
Shuning Xue
Jing Liu
VLM
55
2
0
08 Jul 2024
The infrastructure powering IBM's Gen AI model development
Talia Gershon
Seetharami Seelam
Brian M. Belgodere
Milton Bonilla
Lan Hoang
...
Ruchir Puri
Dakshi Agrawal
Drew Thorstensen
Joel Belog
Brent Tang
VLM
68
5
0
07 Jul 2024
A Principled Framework for Evaluating on Typologically Diverse Languages
Esther Ploeger
Wessel Poelman
Andreas Holck Høeg-Petersen
Anders Schlichtkrull
Miryam de Lhoneux
Johannes Bjerva
82
1
0
06 Jul 2024
CountGD: Multi-Modal Open-World Counting
Niki Amini-Naieni
Tengda Han
Andrew Zisserman
ObjD
104
11
0
05 Jul 2024
HAF-RM: A Hybrid Alignment Framework for Reward Model Training
Shujun Liu
Xiaoyu Shen
Yuhang Lai
Siyuan Wang
Shengbin Yue
Zengfeng Huang
Xuanjing Huang
Zhongyu Wei
47
1
0
04 Jul 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
Xiangyang Li
Kuicai Dong
Yi Quan Lee
Wei Xia
Yichun Yin
Xinyi Dai
Yasheng Wang
Ruiming Tang
120
16
0
03 Jul 2024
What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models
Shengqi Zhu
Jeffrey M. Rzeszotarski
KELM
99
1
0
02 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Min-man Wu
Min Wu
Xiaoli Li
Weisi Lin
ViT
VLM
110
8
0
02 Jul 2024
Previous
1
2
3
...
15
16
17
...
23
24
25
Next