Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,811 papers shown
Title
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research
Arpandeep Khatua
Vikram Sharma Mailthody
Bhagyashree Taleka
Tengfei Ma
Xiang Song
Wen-mei W. Hwu
AI4CE
111
39
0
27 Feb 2023
Prompt-based Learning for Text Readability Assessment
Bruce W. Lee
J. Lee
VLM
67
13
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
139
161
0
25 Feb 2023
Pre-Finetuning for Few-Shot Emotional Speech Recognition
Maximillian Chen
Zhou Yu
75
4
0
24 Feb 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Kashun Shum
Shizhe Diao
Tong Zhang
ReLM
LRM
119
138
0
24 Feb 2023
Modelling Temporal Document Sequences for Clinical ICD Coding
Clarence Boon Liang Ng
Diogo Santos
Marek Rei
64
8
0
24 Feb 2023
In-Depth Look at Word Filling Societal Bias Measures
Matúš Pikuliak
Ivana Benová
Viktor Bachratý
99
9
0
24 Feb 2023
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining
Ramon Ruiz-Dolz
Javier Iranzo-Sánchez
36
3
0
24 Feb 2023
Deep Learning for Video-Text Retrieval: a Review
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
77
18
0
24 Feb 2023
Dual Path Modeling for Semantic Matching by Perceiving Subtle Conflicts
Chao Xue
Di Liang
Sirui Wang
Wei Wu
Jing Zhang
70
9
0
24 Feb 2023
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views
Katerina Margatina
Shuai Wang
Yogarshi Vyas
Neha Ann John
Yassine Benajiba
Miguel Ballesteros
68
17
0
23 Feb 2023
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
Paul Pu Liang
Yun Cheng
Xiang Fan
Chun Kai Ling
Suzanne Nie
...
Nicholas B. Allen
Randy P. Auerbach
Faisal Mahmood
Ruslan Salakhutdinov
Louis-Philippe Morency
116
37
0
23 Feb 2023
MCWDST: a Minimum-Cost Weighted Directed Spanning Tree Algorithm for Real-Time Fake News Mitigation in Social Media
Ciprian-Octavian Truicua
Elena Simona Apostol
Radu-Cuatualin Nicolescu
Panagiotis Karras
GNN
59
27
0
23 Feb 2023
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate Political Stance Prediction
Yunyong Ko
Seongeun Ryu
Soeun Han
Youngseung Jeon
Jaehoon Kim
Sohyun Park
Kyungsik Han
Hanghang Tong
Sang-Wook Kim
117
15
0
23 Feb 2023
Data leakage in cross-modal retrieval training: A case study
Benno Weck
Xavier Serra
61
7
0
23 Feb 2023
Coarse-to-Fine Knowledge Selection for Document Grounded Dialogs
Yeqin Zhang
Haomin Fu
Cheng Fu
Haiyang Yu
Yongbin Li
Cam-Tu Nguyen
116
9
0
23 Feb 2023
FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Qichen Ye
Bowen Cao
Nuo Chen
Weiyuan Xu
Yuexian Zou
73
18
0
23 Feb 2023
Guiding Large Language Models via Directional Stimulus Prompting
Zekun Li
Baolin Peng
Pengcheng He
Michel Galley
Jianfeng Gao
Xi Yan
LLMAG
LRM
LM&Ro
136
101
0
22 Feb 2023
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Donghuo Zeng
Jianming Wu
Yanan Wang
Kazunori Matsumoto
Gen Hattori
K. Ikeda
81
0
0
22 Feb 2023
Connecting Vision and Language with Video Localized Narratives
P. Voigtlaender
Soravit Changpinyo
Jordi Pont-Tuset
Radu Soricut
V. Ferrari
VGen
143
23
0
22 Feb 2023
DNG: Taxonomy Expansion by Exploring the Intrinsic Directed Structure on Non-gaussian Space
Songlin Zhai
Weiqing Wang
Yuanfa Li
Yuan Meng
96
6
0
22 Feb 2023
Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks
Bowen Jin
Yu Zhang
Yu Meng
Jiawei Han
97
31
0
21 Feb 2023
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
94
54
0
21 Feb 2023
Learning to Retrieve Engaging Follow-Up Queries
Christopher Richardson
Sudipta Kar
Anjishnu Kumar
Anand Ramachandran
O. Khan
Zeynab Raeesy
A. Sethy
46
2
0
21 Feb 2023
Generic Dependency Modeling for Multi-Party Conversation
Weizhou Shen
Xiaojun Quan
Ke Yang
71
4
0
21 Feb 2023
Playing the Werewolf game with artificial intelligence for language understanding
Hisaichi Shibata
S. Miki
Yuta Nakamura
LLMAG
60
12
0
21 Feb 2023
Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain
Kuntal Kumar Pal
Kazuaki Kashihara
Ujjwala Anantheswaran
Kirby Kuznia
S. Jagtap
Chitta Baral
AAML
34
3
0
20 Feb 2023
Hashtag-Guided Low-Resource Tweet Classification
Shizhe Diao
Sedrick Scott Keh
Liangming Pan
Zhiliang Tian
Yan Song
Tong Zhang
VLM
AI4TS
65
7
0
20 Feb 2023
Can discrete information extraction prompts generalize across language models?
Nathanaël Carraz Rakotonirina
Roberto Dessì
Fabio Petroni
Sebastian Riedel
Marco Baroni
72
8
0
20 Feb 2023
Knowledge-aware Bayesian Co-attention for Multimodal Emotion Recognition
Zihan Zhao
Yu Wang
Yanfeng Wang
63
18
0
20 Feb 2023
Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Maxime Darrin
Guillaume Staerman
Eduardo Dadalto Camara Gomes
Jackie CK Cheung
Pablo Piantanida
Pierre Colombo
OODD
451
12
0
20 Feb 2023
What happens before and after: Multi-Event Commonsense in Event Coreference Resolution
Sahithya Ravi
Christy Tanner
R. Ng
Vered Shwarz
76
19
0
20 Feb 2023
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
S. Keerthi
Ayan Acharya
Borja Ocejo
Gregory Dexter
Rajiv Khanna
D. Durfee
Rahul Mazumder
AAML
71
7
0
19 Feb 2023
Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages
Ankan Mullick
Ishani Mondal
Sourjyadip Ray
R. Raghav
G. Chaitanya
Pawan Goyal
63
13
0
19 Feb 2023
Evaluating the Effectiveness of Pre-trained Language Models in Predicting the Helpfulness of Online Product Reviews
Ali Boluki
Javad Pourmostafa Roshan Sharami
D. Shterionov
54
1
0
19 Feb 2023
Multilingual Content Moderation: A Case Study on Reddit
Meng Ye
Karan Sikka
Katherine Atwell
Sabit Hassan
Ajay Divakaran
Malihe Alikhani
AI4MH
42
7
0
19 Feb 2023
Language-Specific Representation of Emotion-Concept Knowledge Causally Supports Emotion Inference
Ming Li
Yusheng Su
Hsiu-Yuan Huang
Jiali Cheng
Xin Hu
...
Yujia Qin
Xiaozhi Wang
Kristen A. Lindquist
Zhi-Yun Liu
Dan Zhang
74
6
0
19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
136
245
0
19 Feb 2023
Few-shot Multimodal Multitask Multilingual Learning
Aman Chadha
Vinija Jain
125
0
0
19 Feb 2023
Learning Language Representations with Logical Inductive Bias
Jianshu Chen
NAI
AI4CE
LRM
53
3
0
19 Feb 2023
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark
Dakuan Lu
Hengkui Wu
Jiaqing Liang
Yipei Xu
Qi He
Yipeng Geng
Mengkun Han
Ying Xin
Yanghua Xiao
94
62
0
18 Feb 2023
Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
CML
OODD
54
3
0
18 Feb 2023
Optimising Human-Machine Collaboration for Efficient High-Precision Information Extraction from Text Documents
Bradley Butcher
Miri Zilka
Darren Cook
Jiri Hron
Adrian Weller
73
4
0
18 Feb 2023
Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements
Jiawen Deng
Jiale Cheng
Hao Sun
Zhexin Zhang
Minlie Huang
LM&MA
ELM
95
17
0
18 Feb 2023
Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE
Qihuang Zhong
Liang Ding
Keqin Peng
Juhua Liu
Bo Du
Li Shen
Yibing Zhan
Dacheng Tao
VLM
81
13
0
18 Feb 2023
A Federated Approach for Hate Speech Detection
Jay Gala
Deep Gandhi
Jash Mehta
Zeerak Talat
58
4
0
18 Feb 2023
Scalable Prompt Generation for Semi-supervised Learning with Language Models
Yuhang Zhou
Suraj Maharjan
Bei Liu
VLM
94
14
0
18 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
173
0
0
18 Feb 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
214
16
0
17 Feb 2023
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
45
8
0
17 Feb 2023
Previous
1
2
3
...
117
118
119
...
215
216
217
Next