Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,803 papers shown
Title
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks with Soft-Thresholding
Chunyan Xiong
Meng Lu
Xiaotong Yu
JIAN-PENG Cao
Zhong Chen
D. Guo
X. Qu
MLT
124
0
0
14 Apr 2023
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition
Soumya Dutta
Sriram Ganapathy
117
18
0
14 Apr 2023
Evaluation of Social Biases in Recent Large Pre-Trained Models
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
Alind Jain
52
0
0
13 Apr 2023
Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter
M. Zarei
M. Christensen
S. Everts
Majid Komeili
39
1
0
13 Apr 2023
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
Shamsuddeen Hassan Muhammad
Idris Abdulmumin
Seid Muhie Yimam
David Ifeoluwa Adelani
Ibrahim Said Ahmad
N. Ousidhoum
Abinew Ali Ayele
Saif M. Mohammad
Meriem Beloucif
Sebastian Ruder
80
70
0
13 Apr 2023
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Ashraf Haddad
N. Aaraj
Preslav Nakov
Septimiu Fabian Mare
48
6
0
13 Apr 2023
Graph2topic: an opensource topic modeling framework based on sentence embedding and community detection
Leihan Zhang
Jiapeng Liu
Qiang Yan
80
1
0
13 Apr 2023
PGTask: Introducing the Task of Profile Generation from Dialogues
Rui Ribeiro
Joao Paulo Carvalho
Luísa Coheur
37
1
0
13 Apr 2023
A-CAP: Anticipation Captioning with Commonsense Knowledge
D. Vo
Quoc-An Luong
Akihiro Sugimoto
Hideki Nakayama
70
1
0
13 Apr 2023
Computational modeling of semantic change
Nina Tahmasebi
Haim Dubossarsky
108
6
0
13 Apr 2023
Can Large Language Models Transform Computational Social Science?
Caleb Ziems
William B. Held
Omar Shaikh
Jiaao Chen
Zhehao Zhang
Diyi Yang
LLMAG
133
322
0
12 Apr 2023
Learning Homographic Disambiguation Representation for Neural Machine Translation
Weixuan Wang
Wei Peng
Qun Liu
52
0
0
12 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
88
4
0
12 Apr 2023
Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning
Chi-Liang Liu
Hao Wang
Nuwa Xi
Sendong Zhao
Bing Qin
VLM
79
1
0
12 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELM
LM&MA
69
291
0
12 Apr 2023
MoMo: A shared encoder Model for text, image and multi-Modal representations
Rakesh Chada
Zhao-Heng Zheng
P. Natarajan
ViT
64
4
0
11 Apr 2023
L3MVN: Leveraging Large Language Models for Visual Target Navigation
Bangguo Yu
Hamidreza Kasaei
M. Cao
LM&Ro
118
101
0
11 Apr 2023
Zero-shot Temporal Relation Extraction with ChatGPT
Chenhan Yuan
Qianqian Xie
Sophia Ananiadou
101
85
0
11 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
84
79
0
11 Apr 2023
Prompt Learning for News Recommendation
Zizhuo Zhang
Bang-wei Wang
AI4TS
73
66
0
11 Apr 2023
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
Ensheng Shi
Yanlin Wang
Hongyu Zhang
Lun Du
Shi Han
Dongmei Zhang
Hongbin Sun
89
45
0
11 Apr 2023
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics
Jialu Li
Joey Tianyi Zhou
97
37
0
11 Apr 2023
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach
Bilal Ghanem
Alona Fyshe
61
4
0
10 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
100
34
0
10 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRM
VLM
106
26
0
10 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
88
21
0
10 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging
Aarne Talman
H. Çelikkanat
Sami Virpioja
Markus Heinonen
Jörg Tiedemann
BDL
UQCV
84
8
0
10 Apr 2023
SELFormer: Molecular Representation Learning via SELFIES Language Models
Atakan Yüksel
Erva Ulusoy
Atabey Ünlü
Tunca Dogan
101
61
0
10 Apr 2023
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts
Pratinav Seth
Mihir Agarwal
AI4MH
64
1
0
10 Apr 2023
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLM
LRM
122
232
0
10 Apr 2023
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
50
31
0
09 Apr 2023
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language Understanding
Yuqing Wang
Yun Zhao
Linda R. Petzold
AI4MH
LM&MA
ELM
95
53
0
09 Apr 2023
Team QUST at SemEval-2023 Task 3: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting Online News Genre, Framing and Persuasion Techniques
Ye Jiang
71
10
0
09 Apr 2023
Continual Graph Convolutional Network for Text Classification
Tiandeng Wu
Qijiong Liu
Yinhao Cao
yao. huang
Xiao-Ming Wu
Jiandong Ding
GNN
74
10
0
09 Apr 2023
Multi-class Categorization of Reasons behind Mental Disturbance in Long Texts
Muskan Garg
AI4MH
47
2
0
08 Apr 2023
Unsupervised Story Discovery from Continuous News Streams via Scalable Thematic Embedding
Susik Yoon
Dongha Lee
Yunyi Zhang
Jiawei Han
AI4TS
AIFin
147
8
0
08 Apr 2023
tmn at SemEval-2023 Task 9: Multilingual Tweet Intimacy Detection using XLM-T, Google Translate, and Ensemble Learning
Anna Glazkova
49
1
0
08 Apr 2023
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP
Lama Alkhaled
Tosin Adewumi
Sana Sabah Sabry
86
8
0
08 Apr 2023
SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers
Alberto Marchisio
David Durà
Maurizio Capra
Maurizio Martina
Guido Masera
Mohamed Bennai
94
22
0
08 Apr 2023
ASPEST: Bridging the Gap Between Active Learning and Selective Prediction
Jiefeng Chen
Jinsung Yoon
Sayna Ebrahimi
Sercan O. Arik
S. Jha
Tomas Pfister
115
1
0
07 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
97
46
0
07 Apr 2023
Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Hung-Ting Su
Yulei Niu
Xudong Lin
Winston H. Hsu
Shih-Fu Chang
VGen
ELM
112
6
0
07 Apr 2023
Probing Conceptual Understanding of Large Visual-Language Models
Madeline Chantry Schiappa
Raiyaan Abdullah
Shehreen Azad
Jared Claypoole
Michael Cogswell
Ajay Divakaran
Yogesh S Rawat
81
16
0
07 Apr 2023
Revisiting Automated Prompting: Are We Actually Doing Better?
Yulin Zhou
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Y. Gal
117
8
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers
Sriya Rallabandi
Sanchit Singhal
Pratinav Seth
21
3
0
07 Apr 2023
Architecture-Preserving Provable Repair of Deep Neural Networks
Zhe Tao
Stephanie Nawas
Jacqueline Mitchell
Aditya V. Thakur
AAML
64
11
0
07 Apr 2023
Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Hanmeng Liu
Ruoxi Ning
Zhiyang Teng
Jian Liu
Qiji Zhou
Yuexin Zhang
ELM
ReLM
LRM
125
258
0
07 Apr 2023
Towards Corpus-Scale Discovery of Selection Biases in News Coverage: Comparing What Sources Say About Entities as a Start
Sihao Chen
William F. Bruno
Dan Roth
68
1
0
06 Apr 2023
Deep Learning for Opinion Mining and Topic Classification of Course Reviews
Anna Koufakou
69
20
0
06 Apr 2023
Previous
1
2
3
...
111
112
113
...
215
216
217
Next