Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,764 papers shown
Title
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models
Jiazheng Li
ZHAOYUE SUN
Bin Liang
Lin Gui
Yulan He
79
2
0
06 Jun 2023
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Zhiyu Zoey Chen
J. Choi
B. Fetahu
Oleg Rokhlenko
S. Malmasi
RALM
41
6
0
06 Jun 2023
Deep neural networks architectures from the perspective of manifold learning
German Magai
AAML
AI4CE
61
6
0
06 Jun 2023
BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs
Zhiyong Yang
Tinglin Huang
Ming Ding
Yuxiao Dong
Rex Ying
Yukuo Cen
Yangli-ao Geng
Jie Tang
SSL
VLM
91
9
0
06 Jun 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
Chujie Zheng
Pei Ke
Zheng Zhang
Minlie Huang
BDL
83
34
0
06 Jun 2023
Security Knowledge-Guided Fuzzing of Deep Learning Libraries
Nima Shiri Harzevili
Mohammad Mahdi Mohajer
Moshi Wei
H. Pham
Song Wang
AAML
AI4CE
56
1
0
05 Jun 2023
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
75
6
0
05 Jun 2023
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks
Jean-Michel Attendu
Jean-Philippe Corbeil
88
16
0
05 Jun 2023
How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study
Alexander Isenko
R. Mayer
Hans-Arno Jacobsen
78
8
0
05 Jun 2023
Machine Learning and Statistical Approaches to Measuring Similarity of Political Parties
Daria Boratyn
Damian Brzyski
Beata Kosowska-Gkastol
Jan Rybicki
Wojciech Słomczyński
Dariusz Stolicki
15
0
0
05 Jun 2023
Structured Voronoi Sampling
Afra Amini
Li Du
Ryan Cotterell
DiffM
97
2
0
05 Jun 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Junling Liu
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
...
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu
Michael Lingzhi Li
LM&MA
ELM
111
79
0
05 Jun 2023
Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?
D. Furman
Pablo E. Torres
José Raúl Rodríguez Rodríguez
Diego Letzen
María Vanina Martínez
Laura Alonso Alemany
32
7
0
05 Jun 2023
A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires
Hoyun Song
Jisu Shin
Huije Lee
Jong C. Park
72
7
0
05 Jun 2023
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
Ali Modarressi
Mohsen Fayyaz
Ehsan Aghazadeh
Yadollah Yaghoobzadeh
Mohammad Taher Pilehvar
100
28
0
05 Jun 2023
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Alham Fikri Aji
Genta Indra Winata
Radityo Eko Prasojo
Phil Blunsom
A. Kuncoro
65
8
0
05 Jun 2023
Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs
Alejandro Peña
Aythami Morales
Julian Fierrez
Ignacio Serna
J. Ortega-Garcia
Iñigo Puente
Jorge Cordova
Gonzalo Cordova
87
20
0
05 Jun 2023
UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception Detection
Aswathy Velutharambath
Roman Klinger
35
3
0
05 Jun 2023
Enhancing Language Representation with Constructional Information for Natural Language Understanding
Lvxiaowei Xu
Jian Wu
Jiawei Peng
Zhilin Gong
Ming Cai
Tianxiang Wang
61
3
0
05 Jun 2023
CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels
Hyunsoo Cho
Youna Kim
Sang-goo Lee
42
3
0
05 Jun 2023
Joint Pre-training and Local Re-training: Transferable Representation Learning on Multi-source Knowledge Graphs
Zequn Sun
Jiacheng Huang
Jing-Rong Lin
Xiaozhou Xu
Qijin Chen
Wei Hu
62
2
0
05 Jun 2023
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models
Jiabang He
Yilang Hu
Lei Wang
Xingdong Xu
Ning Liu
Hui-juan Liu
Hengtao Shen
VLM
OOD
68
3
0
05 Jun 2023
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications
Han Xie
Da Zheng
Jun Ma
Houyu Zhang
V. Ioannidis
...
Sheng Wang
Carl Yang
Yi Xu
Belinda Zeng
Trishul Chilimbi
AI4CE
90
40
0
05 Jun 2023
Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence
Anna Dawid
Yann LeCun
DRL
104
31
0
05 Jun 2023
Prompt to be Consistent is Better than Self-Consistent? Few-Shot and Zero-Shot Fact Verification with Pre-trained Language Models
Fengzhu Zeng
Wei Gao
69
7
0
05 Jun 2023
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
ELM
144
334
0
05 Jun 2023
Learning to Relate to Previous Turns in Conversational Search
Fengran Mo
J. Nie
Kaiyu Huang
Kelong Mao
Yutao Zhu
Peng Li
Yang Liu
103
28
0
05 Jun 2023
RadLing: Towards Efficient Radiology Report Understanding
Rikhiya Ghosh
Sanjeev Kumar Karn
Manuela Danu
Larisa Micu
Ramya Vunikili
Oladimeji Farri
MedIm
61
6
0
04 Jun 2023
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Omar Shaikh
Caleb Ziems
William B. Held
Aryan Pariani
Fred Morstatter
Diyi Yang
85
14
0
04 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELM
FedML
79
4
0
04 Jun 2023
Leverage Points in Modality Shifts: Comparing Language-only and Multimodal Word Representations
Aleksey Tikhonov
Lisa Bylinina
Denis Paperno
53
2
0
04 Jun 2023
Probing Physical Reasoning with Counter-Commonsense Context
Kazushi Kondo
Saku Sugawara
Akiko Aizawa
LRM
74
4
0
04 Jun 2023
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models
H. Ko
Kichang Yang
Minho Ryu
Taekyoon Choi
Seungmu Yang
Jiwung Hyun
Sung-Yong Park
Kyubyong Park
93
30
0
04 Jun 2023
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Lingfeng Shen
Haiyun Jiang
Lemao Liu
Shuming Shi
61
2
0
04 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
72
172
0
04 Jun 2023
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
Jianghui Wang
Yuxuan Wang
Dongyan Zhao
Zilong Zheng
87
1
0
04 Jun 2023
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Daniela Teodorescu
Saif M. Mohammad
59
13
0
03 Jun 2023
Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection
Minh Le Nguyen
K. Kishan
Toan Q. Nguyen
Thien Huu Nguyen
Ankit Chadha
Thuy Vu
39
0
0
03 Jun 2023
Stubborn Lexical Bias in Data and Models
Sofia Serrano
Jesse Dodge
Noah A. Smith
97
2
0
03 Jun 2023
Towards Coding Social Science Datasets with Language Models
Anonymous Acl
Taylor Sorensen
Lisa P. Argyle
Ethan C. Busby
Nancy Fulda
Joshua R Gubler
David Wingate
ALM
SyDa
55
11
0
03 Jun 2023
Incorporating Deep Syntactic and Semantic Knowledge for Chinese Sequence Labeling with GCN
Xuemei Tang
Jun Wang
Qi Su
GNN
50
0
0
03 Jun 2023
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos
S. Symeonidis
Pranav Kasela
G. Pasi
LM&MA
50
13
0
03 Jun 2023
MultiLegalPile: A 689GB Multilingual Legal Corpus
Joel Niklaus
Veton Matoshi
Matthias Sturmer
Ilias Chalkidis
Daniel E. Ho
AILaw
ELM
120
44
0
03 Jun 2023
A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers
Xiaoyan Zhao
Yang Deng
Min Yang
Lingzhi Wang
Rui Zhang
Hong Cheng
W. Lam
Ying Shen
Ruifeng Xu
KELM
98
36
0
03 Jun 2023
Span Identification of Epistemic Stance-Taking in Academic Written English
Masaki Eguchi
K. Kyle
35
6
0
03 Jun 2023
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
Robert Lo
Arnhav Datar
Abishek Sridhar
GAN
73
2
0
02 Jun 2023
A Simple yet Effective Self-Debiasing Framework for Transformer Models
Xiaoyue Wang
Lijie Wang
Xin Liu
Suhang Wu
Jinsong Su
Huasen Wu
68
4
0
02 Jun 2023
Revisiting the Role of Language Priors in Vision-Language Models
Zhiqiu Lin
Xinyue Chen
Deepak Pathak
Pengchuan Zhang
Deva Ramanan
VLM
159
27
0
02 Jun 2023
Knowledge of cultural moral norms in large language models
Aida Ramezani
Yang Xu
ELM
AILaw
72
51
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
80
9
0
02 Jun 2023
Previous
1
2
3
...
96
97
98
...
214
215
216
Next