Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,783 papers shown
Title
Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification
Mujeen Sung
James Gung
Elman Mansimov
Nikolaos Pappas
Raphael Shu
Salvatore Romeo
Yi Zhang
Vittorio Castelli
65
7
0
24 May 2023
#REVAL: a semantic evaluation framework for hashtag recommendation
Areej Alsini
D. Huynh
A. Datta
46
0
0
24 May 2023
Machine Reading Comprehension using Case-based Reasoning
Dung Ngoc Thai
Dhruv Agarwal
Mudit Chaudhary
Wenlong Zhao
Rajarshi Das
Manzil Zaheer
J. Lee
Hannaneh Hajishirzi
Andrew McCallum
101
1
0
24 May 2023
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content
Shuyang Cao
Lu Wang
63
5
0
24 May 2023
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models
Amirhossein Kazemnejad
Mehdi Rezagholizadeh
Prasanna Parthasarathi
Sarath Chandar
ELM
60
2
0
24 May 2023
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
Marjan Ghazvininejad
DiffM
103
4
0
24 May 2023
Allies: Prompting Large Language Model with Beam Search
Hao Sun
Xiao Liu
Yeyun Gong
Yan Zhang
Daxin Jiang
Linjun Yang
Nan Duan
RALM
90
6
0
24 May 2023
Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization
Shoujie Tong
Heming Xia
Damai Dai
Runxin Xu
Tianyu Liu
Binghuai Lin
Yunbo Cao
Zhifang Sui
51
0
0
24 May 2023
DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade
Zefan Cai
Xin Zheng
Tianyu Liu
Xu Wang
H. Meng
Jiaqi Han
Gang Yuan
Binghuai Lin
Baobao Chang
Yunbo Cao
70
4
0
24 May 2023
ATLAS: Automatically Detecting Discrepancies Between Privacy Policies and Privacy Labels
Akshatha Jain
David Rodríguez Torrado
J. D. Álamo
Norman M. Sadeh
75
14
0
24 May 2023
SenteCon: Leveraging Lexicons to Learn Human-Interpretable Language Representations
Victoria Lin
Louis-Philippe Morency
MILM
63
1
0
24 May 2023
SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank
Dheeraj Mekala
Adithya Samavedhi
Chengyu Dong
Jingbo Shang
OODD
57
2
0
24 May 2023
A Causal View of Entity Bias in (Large) Language Models
Fei Wang
Wen-An Mo
Yiwei Wang
Wenxuan Zhou
Muhao Chen
84
15
0
24 May 2023
TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering
Jian Wu
Yicheng Xu
Yan Gao
Jian-Guang Lou
Börje F. Karlsson
Manabu Okumura
LMTD
59
3
0
24 May 2023
You Are What You Annotate: Towards Better Models through Annotator Representations
Naihao Deng
Xinliang Frederick Zhang
Siyang Liu
Winston Wu
Lu Wang
Rada Mihalcea
63
21
0
24 May 2023
Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction
Anna Martin-Boyle
Andrew Head
Kyle Lo
Risham Sidhu
Marti A. Hearst
Dongyeop Kang
55
1
0
24 May 2023
Abductive Commonsense Reasoning Exploiting Mutually Exclusive Explanations
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
69
20
0
24 May 2023
COMET-M: Reasoning about Multiple Events in Complex Sentences
Sahithya Ravi
R. Ng
Vered Shwartz
LRM
ReLM
72
3
0
24 May 2023
Learning Semantic Role Labeling from Compatible Label Sequences
Tao Li
Ghazaleh Kazeminejad
S. Brown
Martha Palmer
Vivek Srikumar
57
1
0
24 May 2023
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations
James Y. Huang
Wenlin Yao
Kaiqiang Song
Hongming Zhang
Muhao Chen
Dong Yu
68
6
0
24 May 2023
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Ziwei He
Meng Yang
Minwei Feng
Jingcheng Yin
Xiang Wang
Jingwen Leng
Zhouhan Lin
ViT
97
14
0
24 May 2023
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
Li Sun
F. Luisier
Kayhan Batmanghelich
D. Florêncio
Changrong Zhang
VLM
44
6
0
23 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
104
2
0
23 May 2023
All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations
Yuxin Ren
Qipeng Guo
Zhijing Jin
Shauli Ravfogel
Mrinmaya Sachan
Bernhard Schölkopf
Ryan Cotterell
77
4
0
23 May 2023
Sources of Hallucination by Large Language Models on Inference Tasks
Nick McKenna
Tianyi Li
Liang Cheng
Mohammad Javad Hosseini
Mark Johnson
Mark Steedman
LRM
HILM
105
201
0
23 May 2023
Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models
Shashank Sonkar
Richard G. Baraniuk
31
1
0
23 May 2023
Do prompt positions really matter?
Junyu Mao
Stuart E. Middleton
Mahesan Niranjan
VLM
65
6
0
23 May 2023
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA
David Heineman
Yao Dou
Mounica Maddela
Wei Xu
100
17
0
23 May 2023
On Robustness of Finetuned Transformer-based NLP Models
Pavan Kalyan Reddy Neerudu
Subba Reddy Oota
Mounika Marreddy
Venkateswara Rao Kagita
Manish Gupta
83
9
0
23 May 2023
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions
Zhihan Zhang
Wenhao Yu
Zheng Ning
Mingxuan Ju
Meng Jiang
72
4
0
23 May 2023
Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation
Da Yin
Xiao Liu
Fan Yin
Ming Zhong
Hritik Bansal
Jiawei Han
Kai-Wei Chang
ALM
101
39
0
23 May 2023
TalkUp: Paving the Way for Understanding Empowering Language
Lucille Njoo
Chan Young Park
Octavia Stappart
Marvin Thielk
Yi Chu
Yulia Tsvetkov
88
3
0
23 May 2023
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
Zhongfu Chen
Kun Zhou
Beichen Zhang
Zheng Gong
Wayne Xin Zhao
Ji-Rong Wen
KELM
LRM
126
31
0
23 May 2023
QLoRA: Efficient Finetuning of Quantized LLMs
Tim Dettmers
Artidoro Pagnoni
Ari Holtzman
Luke Zettlemoyer
ALM
165
2,643
0
23 May 2023
Debiasing should be Good and Bad: Measuring the Consistency of Debiasing Techniques in Language Models
Robert D Morabito
Jad Kabbara
Ali Emami
42
7
0
23 May 2023
USB: A Unified Summarization Benchmark Across Tasks and Domains
Kundan Krishna
Prakhar Gupta
S. Ramprasad
Byron C. Wallace
Jeffrey P. Bigham
Zachary Chase Lipton
HILM
91
8
0
23 May 2023
Masked Path Modeling for Vision-and-Language Navigation
Zi-Yi Dou
Feng Gao
Nanyun Peng
LM&Ro
83
3
0
23 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
56
2
0
23 May 2023
Revisiting Machine Translation for Cross-lingual Classification
Mikel Artetxe
Vedanuj Goswami
Shruti Bhosale
Angela Fan
Luke Zettlemoyer
LRM
102
39
0
23 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
80
4
0
23 May 2023
Domain Private Transformers for Multi-Domain Dialog Systems
Anmol Kabra
Ethan R. Elenberg
52
0
0
23 May 2023
DetGPT: Detect What You Need via Reasoning
Renjie Pi
Jiahui Gao
Shizhe Diao
Boyao Wang
Hanze Dong
...
Lewei Yao
Jianhua Han
Hang Xu
Lingpeng Kong Tong Zhang
Tong Zhang
LRM
LM&Ro
86
99
0
23 May 2023
Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection
David Dukić
Kiril Gashteovski
Goran Glavaš
Jan vSnajder
73
1
0
23 May 2023
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
Bo Zhou
Qianglong Chen
Tianyu Wang
Xiaoshi Zhong
Yin Zhang
ELM
119
10
0
23 May 2023
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth Kumar
Ratish Puduppully
Raj Dabre
Anoop Kunchukuttan
103
13
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
129
3
0
23 May 2023
Revisiting Acceptability Judgements
Hai Hu
Ziyin Zhang
Wei-Ping Huang
J. Lai
Aini Li
Yi Ma
Jiahui Huang
Peng Zhang
Chien-Jer Charles Lin
Rui Wang
71
2
0
23 May 2023
Disentangled Variational Autoencoder for Emotion Recognition in Conversations
Kailai Yang
Tianlin Zhang
Sophia Ananiadou
DRL
95
11
0
23 May 2023
Assessing Linguistic Generalisation in Language Models: A Dataset for Brazilian Portuguese
Rodrigo Wilkens
Leonardo Zilio
Aline Villavicencio
58
1
0
23 May 2023
Can Language Models Understand Physical Concepts?
Lei Li
Jingjing Xu
Qingxiu Dong
Ce Zheng
Qi Liu
Lingpeng Kong
Xu Sun
ALM
61
22
0
23 May 2023
Previous
1
2
3
...
101
102
103
...
214
215
216
Next