Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,783 papers shown
Title
Machine-Created Universal Language for Cross-lingual Transfer
Yaobo Liang
Quanzhi Zhu
Junhe Zhao
Nan Duan
84
7
0
22 May 2023
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
Joe Stacey
Marek Rei
59
3
0
22 May 2023
DUMB: A Benchmark for Smart Evaluation of Dutch Models
Wietse de Vries
Martijn B. Wieling
Malvina Nissim
ELM
ALM
MoE
62
6
0
22 May 2023
Rethinking Semi-supervised Learning with Language Models
Zhengxiang Shi
Francesco Tonolini
Nikolaos Aletras
Emine Yilmaz
G. Kazai
Yunlong Jiao
97
21
0
22 May 2023
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang
Hidetaka Kamigaito
Manabu Okumura
54
2
0
22 May 2023
Distilling ChatGPT for Explainable Automated Student Answer Assessment
Jiazheng Li
Lin Gui
Yuxiang Zhou
David West
Cesare Aloisi
Yulan He
79
28
0
22 May 2023
Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph
Jennifer D'Souza
Moussab Hrou
Sören Auer
77
2
0
22 May 2023
Automatic Code Summarization via ChatGPT: How Far Are We?
Weisong Sun
Chunrong Fang
Yudu You
Yun Miao
Yi Liu
...
Yuchen Chen
Quanjun Zhang
Hanwei Qian
Yang Liu
Zhenyu Chen
ELM
74
79
0
22 May 2023
On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
Fatma Elsafoury
Stamos Katsigiannis
71
1
0
22 May 2023
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Megha Chakraborty
Khusbu Pahwa
Anku Rani
Shreyas Chatterjee
Dwip Dalal
...
Shreyash Mishra
K. Sensharma
Aman Chadha
Amit P. Sheth
Amitava Das
DiffM
77
8
0
22 May 2023
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model
Xiao Wang
Wei Zhou
Qi Zhang
Jie Zhou
Songyang Gao
Junzhe Wang
Menghan Zhang
Xiang Gao
Yunwen Chen
Tao Gui
129
10
0
22 May 2023
Investigating Agency of LLMs in Human-AI Collaboration Tasks
Ashish Sharma
Sudha Rao
Chris Brockett
Akanksha Malhotra
Nebojsa Jojic
W. Dolan
LLMAG
88
14
0
22 May 2023
Ultra-Fine Entity Typing with Prior Knowledge about Labels: A Simple Clustering Based Strategy
Na Li
Zied Bouraoui
Steven Schockaert
71
10
0
22 May 2023
MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space
Hanxing Ding
Liang Pang
Zihao Wei
Huawei Shen
Xueqi Cheng
Tat-Seng Chua
DiffM
73
10
0
22 May 2023
Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization
Liang Chen
Hongru Wang
Yang Deng
Wai-Chung Kwan
Zezhong Wang
Kam-Fai Wong
94
15
0
22 May 2023
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
75
1
0
22 May 2023
A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches
Zihan Wang
Tianle Wang
Dheeraj Mekala
Jingbo Shang
VLM
82
8
0
22 May 2023
Fact-Checking Complex Claims with Program-Guided Reasoning
Liangming Pan
Xiaobao Wu
Xinyuan Lu
Anh Tuan Luu
William Yang Wang
Min-Yen Kan
Preslav Nakov
LRM
96
136
0
22 May 2023
TADA: Efficient Task-Agnostic Domain Adaptation for Transformers
Chia-Chien Hung
Lukas Lange
Jannik Strötgen
94
10
0
22 May 2023
Learning Interpretable Style Embeddings via Prompting LLMs
Ajay Patel
D. Rao
Ansh Kothary
Kathleen McKeown
Chris Callison-Burch
93
26
0
22 May 2023
MetaAdapt: Domain Adaptive Few-Shot Misinformation Detection via Meta Learning
Zhenrui Yue
Huimin Zeng
Yang Zhang
Lanyu Shang
Dong Wang
79
17
0
22 May 2023
G3Detector: General GPT-Generated Text Detector
Haolan Zhan
Xuanli He
Xingliang Yuan
Yuxiang Wu
Pontus Stenetorp
DeLMO
77
24
0
22 May 2023
Tokenized Graph Transformer with Neighborhood Augmentation for Node Classification in Large Graphs
Jinsong Chen
Chang-Shu Liu
Kai-Xin Gao
Gaichao Li
Kun He
94
4
0
22 May 2023
Transferring Fairness using Multi-Task Learning with Limited Demographic Information
Carlos Alejandro Aguirre
Mark Dredze
114
0
0
22 May 2023
Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training
Zhisong Zhang
Emma Strubell
Eduard H. Hovy
77
1
0
22 May 2023
PrOnto: Language Model Evaluations for 859 Languages
Luke Gessler
76
1
0
22 May 2023
PRODIGY: Enabling In-context Learning Over Graphs
Qian Huang
Hongyu Ren
Peng Chen
Gregor Krvzmanc
D. Zeng
Percy Liang
J. Leskovec
122
77
0
21 May 2023
Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Linyong Nan
Yilun Zhao
Weijin Zou
Narutatsu Ri
Jaesung Tae
Ellen Zhang
Arman Cohan
Dragomir R. Radev
LMTD
86
51
0
21 May 2023
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Linyuan Gong
Chenyan Xiong
Xiaodong Liu
Payal Bajaj
Yiqing Xie
Alvin Cheung
Jianfeng Gao
Xia Song
VLM
AI4CE
70
2
0
21 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
139
6
0
21 May 2023
BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer
Piyush Jha
Joseph Scott
Jaya Sriram Ganeshna
M. Singh
Vijay Ganesh
49
5
0
21 May 2023
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection
Xiao Yu
Yuang Qi
Kejiang Chen
Guoqiang Chen
Xi Yang
Pengyuan Zhu
Xiuwei Shang
Weiming Zhang
Neng H. Yu
DeLMO
73
11
0
21 May 2023
A Deeper (Autoregressive) Approach to Non-Convergent Discourse Parsing
Yoav Tulpan
Oren Tsur
50
0
0
21 May 2023
PanoContext-Former: Panoramic Total Scene Understanding with a Transformer
Yuan Dong
C. Fang
Liefeng Bo
Zilong Dong
Ping Tan
MDE
ViT
65
11
0
21 May 2023
Model Analysis & Evaluation for Ambiguous Question Answering
Konstantinos Papakostas
Irene Papadopoulou
ELM
23
1
0
21 May 2023
Continually Improving Extractive QA via Human Feedback
Ge Gao
Hung-Ting Chen
Yoav Artzi
Eunsol Choi
87
12
0
21 May 2023
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model
Wenxin Tan
50
1
0
21 May 2023
F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks
Xiangxiang Gao
Wei-wei Zhu
Jiasheng Gao
Congrui Yin
VLM
92
12
0
21 May 2023
A Dual-level Detection Method for Video Copy Detection
Tianyi Wang
Feipeng Ma
Zhenhua Liu
Fengyun Rao
80
3
0
21 May 2023
Are Your Explanations Reliable? Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack
Christopher Burger
Lingwei Chen
Thai Le
FAtt
AAML
90
11
0
21 May 2023
PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation
Eli Chien
Jiong Zhang
Cho-Jui Hsieh
Jyun-Yu Jiang
Wei-Cheng Chang
O. Milenkovic
Hsiang-Fu Yu
86
10
0
21 May 2023
Task-agnostic Distillation of Encoder-Decoder Language Models
Chen Zhang
Yang Yang
Jingang Wang
Dawei Song
64
5
0
21 May 2023
Lifelong Language Pretraining with Distribution-Specialized Experts
Wuyang Chen
Yan-Quan Zhou
Nan Du
Yanping Huang
James Laudon
Zhiwen Chen
Claire Cu
KELM
111
52
0
20 May 2023
Patton: Language Model Pretraining on Text-Rich Networks
Bowen Jin
Wentao Zhang
Yu Zhang
Yu Meng
Xinyang Zhang
Qi Zhu
Jiawei Han
VLM
112
46
0
20 May 2023
Self-supervised representations in speech-based depression detection
Wen Wu
Chuxu Zhang
P. Woodland
74
24
0
20 May 2023
SEntFiN 1.0: Entity-Aware Sentiment Analysis for Financial News
Ankur Sinha
Satishwar Kedas
Rishu Kumar
P. Malo
AIFin
49
51
0
20 May 2023
Brain encoding models based on multimodal transformers can transfer across language and vision
Jerry Tang
Meng Du
Vy A. Vo
Vasudev Lal
Alexander G. Huth
96
33
0
20 May 2023
Dynamic Transformers Provide a False Sense of Efficiency
Yiming Chen
Simin Chen
Zexin Li
Wei Yang
Cong Liu
R. Tan
Haizhou Li
AAML
90
12
0
20 May 2023
Collaborative Development of NLP models
Fereshte Khani
Marco Tulio Ribeiro
80
2
0
20 May 2023
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search
Mozhi Zhang
Hang Yan
Yaqian Zhou
Xipeng Qiu
76
10
0
20 May 2023
Previous
1
2
3
...
103
104
105
...
214
215
216
Next