Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,814 papers shown
Title
Adversarial Transformer Language Models for Contextual Commonsense Inference
Pedro Colon-Hernandez
H. Lieberman
Yida Xin
Claire Yin
C. Breazeal
Peter Chin
85
2
0
10 Feb 2023
Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement
Soyeong Jeong
Jinheon Baek
Sung Ju Hwang
Jong C. Park
68
2
0
10 Feb 2023
Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Sungjin Lee
Devamanyu Hazarika
Mahdi Namazifar
Di Jin
Yang Liu
Dilek Z. Hakkani-Tür
79
37
0
10 Feb 2023
Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval
Ben Chen
Linbo Jin
Xinxin Wang
D. Gao
Wen Jiang
Wei Ning
70
3
0
10 Feb 2023
ControversialQA: Exploring Controversy in Question Answering
Zhen Wang
Peide Zhu
Jie Yang
87
1
0
10 Feb 2023
Is Multimodal Vision Supervision Beneficial to Language?
Avinash Madasu
Vasudev Lal
66
4
0
10 Feb 2023
Event Temporal Relation Extraction with Bayesian Translational Model
Xingwei Tan
Gabriele Pergola
Yulan He
AI4TS
90
12
0
10 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Almog Gueta
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
90
52
0
09 Feb 2023
FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning
Yucheng Li
Shunyu Wang
Chenghua Lin
Frank Guerin
Loïc Barrault
73
27
0
09 Feb 2023
Efficient Attention via Control Variates
Lin Zheng
Jianbo Yuan
Chong-Jun Wang
Lingpeng Kong
139
20
0
09 Feb 2023
A Large-Scale Analysis of Persian Tweets Regarding Covid-19 Vaccination
Taha ShabaniMirzaei
Houmaan Chamani
Amirhossein Abaskohi
Zhivar Sourati Hassan Zadeh
B. Bahrak
40
1
0
09 Feb 2023
Global Constraints with Prompting for Zero-Shot Event Argument Classification
Zizheng Lin
Hongming Zhang
Yangqiu Song
70
16
0
09 Feb 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
91
53
0
09 Feb 2023
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-Tuning
Nuofan Xu
Chenhui Hu
23
2
0
09 Feb 2023
Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow
Anjana Arunkumar
Swaroop Mishra
Bhavdeep Singh Sachdeva
Chitta Baral
Chris Bryan
56
0
0
09 Feb 2023
CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data
Amir Namavar Jahromi
Ebrahim Pourjafari
H. Karimipour
Amit Satpathy
Lovell Hodge
62
3
0
08 Feb 2023
DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
Maor Ivgi
Oliver Hinder
Y. Carmon
ODL
157
66
0
08 Feb 2023
GPTScore: Evaluate as You Desire
Jinlan Fu
See-Kiong Ng
Zhengbao Jiang
Pengfei Liu
LM&MA
ALM
ELM
194
292
0
08 Feb 2023
Prompting for Multimodal Hateful Meme Classification
Rui Cao
Roy Ka-wei Lee
Wen-Haw Chong
Jing Jiang
VLM
85
83
0
08 Feb 2023
Training-free Lexical Backdoor Attacks on Language Models
Yujin Huang
Terry Yue Zhuo
Xingliang Yuan
Han Hu
Lizhen Qu
Chunyang Chen
SILM
97
46
0
08 Feb 2023
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models
Mohammadreza Banaei
Klaudia Bałazy
Artur Kasymov
R. Lebret
Jacek Tabor
Karl Aberer
OffRL
52
0
0
08 Feb 2023
Leveraging Summary Guidance on Medical Report Summarization
Yunqi Zhu
Xuebing Yang
Yuanyuan Wu
Wensheng Zhang
63
11
0
08 Feb 2023
Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
Chengwei Qin
Aston Zhang
Zhuosheng Zhang
Jiaao Chen
Michihiro Yasunaga
Diyi Yang
LM&MA
AI4MH
LRM
ELM
176
707
0
08 Feb 2023
Improving (Dis)agreement Detection with Inductive Social Relation Information From Comment-Reply Interactions
Yun Luo
Zihan Liu
Stan Z. Li
Yue Zhang
42
7
0
08 Feb 2023
CCRep: Learning Code Change Representations via Pre-Trained Code Model and Query Back
Zhongxin Liu
Zhijie Tang
Xin Xia
Xiaohu Yang
SSL
57
21
0
08 Feb 2023
COMBO: A Complete Benchmark for Open KG Canonicalization
Chengyue Jiang
Yong Jiang
Weiqi Wu
Yuting Zheng
Pengjun Xie
Kewei Tu
70
2
0
08 Feb 2023
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories
Suyu Ge
Chenyan Xiong
Corby Rosset
Arnold Overwijk
Jiawei Han
Paul N. Bennett
VLM
65
6
0
07 Feb 2023
Temporal Robustness against Data Poisoning
Wenxiao Wang
Soheil Feizi
AAML
OOD
86
12
0
07 Feb 2023
Cluster-Level Contrastive Learning for Emotion Recognition in Conversations
Kailai Yang
Tianlin Zhang
Hassan Alhuzali
Sophia Ananiadou
87
44
0
07 Feb 2023
Entity-Aware Dual Co-Attention Network for Fake News Detection
Sin-Han Yang
Chung-Chi Chen
Hen-Hsen Huang
Hsin-Hsi Chen
75
7
0
07 Feb 2023
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories
Oscar Sainz
Oier López de Lacalle
Eneko Agirre
German Rigau
77
7
0
07 Feb 2023
The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study
Yu Zhang
Bowen Jin
Qi Zhu
Yu Meng
Jiawei Han
92
20
0
07 Feb 2023
Continual Pre-training of Language Models
Zixuan Ke
Yijia Shao
Haowei Lin
Tatsuya Konishi
Gyuhak Kim
Bin Liu
CLL
KELM
159
140
0
07 Feb 2023
Capturing Topic Framing via Masked Language Modeling
Xiaobo Guo
Weicheng Ma
Soroush Vosoughi
48
2
0
07 Feb 2023
Data Selection for Language Models via Importance Resampling
Sang Michael Xie
Shibani Santurkar
Tengyu Ma
Percy Liang
134
196
0
06 Feb 2023
Techniques to Improve Neural Math Word Problem Solvers
Youyuan Zhang
AIMat
50
1
0
06 Feb 2023
Efficient and Flexible Topic Modeling using Pretrained Embeddings and Bag of Sentences
Johannes Schneider
97
3
0
06 Feb 2023
MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields
Jiaying Lu
Yongchen Qian
Shifan Zhao
Yuanzhe Xi
Carl Yang
VLM
76
4
0
06 Feb 2023
Computation vs. Communication Scaling for Future Transformers on Future Hardware
Suchita Pati
Shaizeen Aga
Mahzabeen Islam
Nuwan Jayasena
Matthew D. Sinclair
68
10
0
06 Feb 2023
Exploring Data Augmentation for Code Generation Tasks
Pinzhen Chen
Gerasimos Lampouras
103
10
0
05 Feb 2023
Precursor recommendation for inorganic synthesis by machine learning materials similarity from scientific literature
T. He
Haoyan Huo
Christopher J. Bartel
Zheren Wang
Kevin Cruse
Gerbrand Ceder
67
33
0
05 Feb 2023
Construction Grammar Provides Unique Insight into Neural Language Models
Leonie Weissweiler
Taiqi He
Naoki Otani
David R. Mortensen
Lori S. Levin
Hinrich Schütze
78
15
0
04 Feb 2023
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning
Jingqiang Chen
73
4
0
04 Feb 2023
The Science of Detecting LLM-Generated Texts
Ruixiang Tang
Yu-Neng Chuang
Helen Zhou
DeLMO
115
180
0
04 Feb 2023
Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances
Salvatore Giorgi
Douglas Bellew
Daniel Roy Sadek Habib
G. Sherman
Joao Sedoc
Chase Smitterberg
Amanda Devoto
McKenzie Himelein-Wachowiak
Brenda L. Curtis
24
3
0
04 Feb 2023
Representation Deficiency in Masked Language Modeling
Yu Meng
Jitin Krishnan
Sinong Wang
Qifan Wang
Yuning Mao
Han Fang
Marjan Ghazvininejad
Jiawei Han
Luke Zettlemoyer
159
7
0
04 Feb 2023
Towards Few-Shot Identification of Morality Frames using In-Context Learning
Shamik Roy
Nishanth Nakshatri
Dan Goldwasser
92
11
0
03 Feb 2023
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
K. Choromanski
Shanda Li
Valerii Likhosherstov
Kumar Avinava Dubey
Shengjie Luo
Di He
Yiming Yang
Tamás Sarlós
Thomas Weingarten
Adrian Weller
108
8
0
03 Feb 2023
Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach
Tanwi Mallick
Joshua Bergerson
Duane R. Verner
John K Hutchison
L. Levy
Prasanna Balaprakash
69
4
0
03 Feb 2023
LIQUID: A Framework for List Question Answering Dataset Generation
Seongyun Lee
Hyunjae Kim
Jaewoo Kang
RALM
81
19
0
03 Feb 2023
Previous
1
2
3
...
119
120
121
...
215
216
217
Next