Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,845 papers shown
Title
BERT-Flow-VAE: A Weakly-supervised Model for Multi-Label Text Classification
Ziwen Liu
J. Grau-Bové
Scott Orr
82
1
0
27 Oct 2022
TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack
Yu Cao
Dianqi Li
Meng Fang
Dinesh Manocha
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
83
17
0
27 Oct 2022
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
Yue Yu
Chenyan Xiong
Si Sun
Chao Zhang
Arnold Overwijk
VLM
OOD
162
22
0
27 Oct 2022
Disentangled and Robust Representation Learning for Bragging Classification in Social Media
Xiang Li
Yucheng Zhou
96
3
0
27 Oct 2022
Dictionary-Assisted Supervised Contrastive Learning
Patrick Y. Wu
Richard Bonneau
Joshua A. Tucker
Jonathan Nagler
CLIP
70
0
0
27 Oct 2022
DyREx: Dynamic Query Representation for Extractive Question Answering
Urchade Zaratiana
Niama El Khbir
Dennis Núñez
Pierre Holat
Nadi Tomeh
Thierry Charnois
115
2
0
26 Oct 2022
Privately Fine-Tuning Large Language Models with Differential Privacy
R. Behnia
Mohammadreza Ebrahimi
Jason L. Pacheco
B. Padmanabhan
135
51
0
26 Oct 2022
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine Translation Models
Mathieu Grosso
Pirashanth Ratnamogan
Alexis Mathey
William Vanhuffel
Michael Fotso Fotso
58
3
0
26 Oct 2022
MABEL: Attenuating Gender Bias using Textual Entailment Data
Jacqueline He
Mengzhou Xia
C. Fellbaum
Danqi Chen
60
32
0
26 Oct 2022
Causality Detection using Multiple Annotation Decisions
Quynh-Anh Nguyen
Arka Mitra
29
2
0
26 Oct 2022
Incorporating Pre-training Paradigm for Antibody Sequence-Structure Co-design
Kaiyuan Gao
Lijun Wu
Jinhua Zhu
Tianbo Peng
Yingce Xia
...
Shufang Xie
Tao Qin
Haiguang Liu
Kun He
Tie-Yan Liu
97
10
0
26 Oct 2022
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks
Colin Leong
Joshua Nemecek
Jacob Mansdorfer
Anna Filighera
A. Owodunni
Daniel Whitenack
VLM
AI4CE
171
29
0
26 Oct 2022
Learning on Large-scale Text-attributed Graphs via Variational Inference
Jianan Zhao
Meng Qu
Chaozhuo Li
Hao Yan
Qian Liu
Rui Li
Xing Xie
Jian Tang
VLM
145
142
0
26 Oct 2022
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
83
4
0
26 Oct 2022
Benchmarking Language Models for Code Syntax Understanding
Da Shen
Xinyun Chen
Chenguang Wang
Koushik Sen
Dawn Song
ELM
59
17
0
26 Oct 2022
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning
Yifan Chen
Devamanyu Hazarika
Mahdi Namazifar
Yang Liu
Di Jin
Dilek Z. Hakkani-Tür
71
4
0
26 Oct 2022
Discourse-Aware Emotion Cause Extraction in Conversations
Dexin Kong
Nan Yu
Yun Yuan
Guohong Fu
Chen Gong
55
2
0
26 Oct 2022
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis
Sudhandar Balakrishnan
Yihao Fang
Xioadan Zhu
48
1
0
26 Oct 2022
Automatic extraction of materials and properties from superconductors scientific literature
Luca Foppiano
P. B. Castro
Pedro Ortiz Suarez
K. Terashima
Y. Takano
Masashi Ishii
74
12
0
26 Oct 2022
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe
Xiang Yue
Huseyin A. Inan
Xuechen Li
Girish Kumar
Julia McAnallen
Hoda Shajari
Huan Sun
David Levitan
Robert Sim
154
86
0
25 Oct 2022
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Aaron Mueller
Yudi Xia
Tal Linzen
MILM
113
10
0
25 Oct 2022
OpenStance: Real-world Zero-shot Stance Detection
Hanzi Xu
Slobodan Vučetić
Wenpeng Yin
69
22
0
25 Oct 2022
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
87
1
0
25 Oct 2022
MOFormer: Self-Supervised Transformer model for Metal-Organic Framework Property Prediction
Zhonglin Cao
Rishikesh Magar
Yuyang Wang
A. Farimani
AI4CE
106
101
0
25 Oct 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
110
35
0
25 Oct 2022
PolyHope: Two-Level Hope Speech Detection from Tweets
F. Balouchzahi
Grigori Sidorov
Alexander Gelbukh
53
50
0
25 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models
Yujia Qin
Cheng Qian
Jing Yi
Weize Chen
Yankai Lin
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
99
21
0
25 Oct 2022
Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens
Nitish Joshi
X. Pan
Hengxing He
CML
126
30
0
25 Oct 2022
This joke is [MASK]: Recognizing Humor and Offense with Prompting
Junze Li
Mengjie Zhao
Yubo Xie
Antonis Maronikolakis
Pearl Pu
Hinrich Schütze
AAML
61
1
0
25 Oct 2022
SepLL: Separating Latent Class Labels from Weak Supervision Noise
Andreas Stephan
Vasiliki Kougia
Benjamin Roth
55
8
0
25 Oct 2022
Multilingual Relation Classification via Efficient and Effective Prompting
Yuxuan Chen
David Harbecke
Leonhard Hennig
LRM
87
12
0
25 Oct 2022
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation
Chen Zhang
L. F. D’Haro
Qiquan Zhang
Thomas Friedrichs
Haizhou Li
88
16
0
25 Oct 2022
Improving Imbalanced Text Classification with Dynamic Curriculum Learning
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
67
4
0
25 Oct 2022
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach
Xulong Zhang
Jianzong Wang
Ning Cheng
Kexin Zhu
Jing Xiao
71
1
0
25 Oct 2022
Parameter-Efficient Legal Domain Adaptation
Jonathan Li
R. Bhambhoria
Xiao-Dan Zhu
ELM
AILaw
ALM
103
14
0
25 Oct 2022
Evaluating Parameter Efficient Learning for Generation
Peng Xu
M. Patwary
Shrimai Prabhumoye
Virginia Adams
R. Prenger
Ming-Yu Liu
Nayeon Lee
Mohammad Shoeybi
Bryan Catanzaro
MoE
72
3
0
25 Oct 2022
Toward an Intelligent Tutoring System for Argument Mining in Legal Texts
Hannes Westermann
Jaromír Šavelka
Vern R. Walker
Kevin D. Ashley
Karim Benyekhlef
60
4
0
24 Oct 2022
Datavoidant: An AI System for Addressing Political Data Voids on Social Media
Claudia Flores-Saviaga
Shangbin Feng
Saiph Savage
86
16
0
24 Oct 2022
ExPUNations: Augmenting Puns with Keywords and Explanations
Jiao Sun
Anjali Narayan-Chen
Shereen Oraby
Alessandra Cervone
Tagyoung Chung
Jing Huang
Yang Liu
Nanyun Peng
81
10
0
24 Oct 2022
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
110
171
0
24 Oct 2022
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and Models
Chaitanya Malaviya
Sudeep Bhatia
Mark Yatskar
72
4
0
24 Oct 2022
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models
Stelios Maroudas
Sotiris Legkas
Prodromos Malakasiotis
Ilias Chalkidis
VLM
AILaw
ALM
ELM
84
4
0
24 Oct 2022
A Unified Framework for Pun Generation with Humor Principles
Yufei Tian
Divyanshu Sheth
Nanyun Peng
89
14
0
24 Oct 2022
Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models
Syrielle Montariol
Arij Riabi
Djamé Seddah
88
12
0
24 Oct 2022
Investigating the detection of Tortured Phrases in Scientific Literature
Puthineath Lay
M. Lentschat
Cyril Labbe
67
5
0
24 Oct 2022
Modeling Information Change in Science Communication with Semantically Matched Paraphrases
Dustin Wright
Jiaxin Pei
David Jurgens
Isabelle Augenstein
95
16
0
24 Oct 2022
Language-free Training for Zero-shot Video Grounding
Dahye Kim
Jungin Park
Jiyoung Lee
S. Park
Kwanghoon Sohn
96
21
0
24 Oct 2022
Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
Sedrick Scott Keh
50
7
0
24 Oct 2022
Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction
Yue Yang
Artemis Panagopoulou
Marianna Apidianaki
Mark Yatskar
Chris Callison-Burch
113
2
0
24 Oct 2022
Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation
Junru Lu
Xingwei Tan
Gabriele Pergola
Lin Gui
Yulan He
98
11
0
24 Oct 2022
Previous
1
2
3
...
132
133
134
...
215
216
217
Next