Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
A Dataset and Strong Baselines for Classification of Czech News Texts
Hynek Kydlívcek
Jindrich Libovický
42
1
0
20 Jul 2023
Exploring the Landscape of Natural Language Processing Research
Tim Schopf
Karim Arabi
Florian Matthes
74
14
0
20 Jul 2023
Instruction-following Evaluation through Verbalizer Manipulation
Shiyang Li
Jun Yan
Hai Wang
Zheng Tang
Xiang Ren
Vijay Srinivasan
Hongxia Jin
106
27
0
20 Jul 2023
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models
Somayeh Ghanbarzadeh
Yan-ping Huang
Hamid Palangi
R. C. Moreno
Hamed Khanpour
68
12
0
20 Jul 2023
Building Socio-culturally Inclusive Stereotype Resources with Community Engagement
Sunipa Dev
J. Goyal
Dinesh Tewari
Shachi Dave
Vinodkumar Prabhakaran
69
26
0
20 Jul 2023
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh
Ashutosh Singh
Prateek R. Agarwal
Zixuan Huang
Arpita Singh
...
Ryan Rossi
Puneet Mathur
Erik Learned-Miller
Franck Dernoncourt
Ryan Rossi
104
8
0
20 Jul 2023
Abusing Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs
Eugene Bagdasaryan
Tsung-Yin Hsieh
Ben Nassi
Vitaly Shmatikov
82
86
0
19 Jul 2023
Improving the Reusability of Pre-trained Language Models in Real-world Applications
Somayeh Ghanbarzadeh
Hamid Palangi
Yan-ping Huang
R. C. Moreno
Hamed Khanpour
50
0
0
19 Jul 2023
Integrating a Heterogeneous Graph with Entity-aware Self-attention using Relative Position Labels for Reading Comprehension Model
Shima Foolad
Kourosh Kiani
47
1
0
19 Jul 2023
Exploring Transformer Extrapolation
Zhen Qin
Yiran Zhong
Huiyuan Deng
53
9
0
19 Jul 2023
DAPrompt: Deterministic Assumption Prompt Learning for Event Causality Identification
Wei Xiang
Chuanhong Zhan
Bang Wang
43
2
0
19 Jul 2023
Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer
Honglin Mu
Wentian Xia
Wanxiang Che
57
1
0
19 Jul 2023
Multi-Grained Multimodal Interaction Network for Entity Linking
Pengfei Luo
Tong Xu
Shiwei Wu
Chen Zhu
Linli Xu
Enhong Chen
87
11
0
19 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
156
5
0
19 Jul 2023
Traffic-Domain Video Question Answering with Automatic Captioning
Ehsan Qasemi
Jonathan M Francis
A. Oltramari
96
9
0
18 Jul 2023
Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers
Jaeyoung Kim
Kyuheon Jung
Dongbin Na
Sion Jang
Eunbin Park
Sungchul Choi
OODD
67
7
0
18 Jul 2023
OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Dongming Wu
Tiancai Wang
Yuang Zhang
Xiangyu Zhang
Jianbing Shen
VOS
84
39
0
18 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
510
12,128
0
18 Jul 2023
Linearized Relative Positional Encoding
Zhen Qin
Weixuan Sun
Kaiyue Lu
Huizhong Deng
Dong Li
Xiaodong Han
Yuchao Dai
Lingpeng Kong
Yiran Zhong
64
13
0
18 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
140
40
0
18 Jul 2023
Attention over pre-trained Sentence Embeddings for Long Document Classification
Amine Abdaoui
Sourav Dutta
52
1
0
18 Jul 2023
Does Visual Pretraining Help End-to-End Reasoning?
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
78
3
0
17 Jul 2023
Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Huachuan Qiu
Shuai Zhang
Anqi Li
Hongliang He
Zhenzhong Lan
ALM
94
53
0
17 Jul 2023
ChatGPT is Good but Bing Chat is Better for Vietnamese Students
Xuan-Quy Dao
Ngoc-Bich Le
41
9
0
17 Jul 2023
Unified Open-Vocabulary Dense Visual Prediction
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjD
VLM
76
24
0
17 Jul 2023
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling
Longyue Wang
Zefeng Du
Donghua Liu
Cai Deng
Dian Yu
Haiyun Jiang
Yan Wang
Leyang Cui
Shuming Shi
Zhaopeng Tu
CoGe
97
6
0
16 Jul 2023
A Neural-Symbolic Approach Towards Identifying Grammatically Correct Sentences
Nicos Isaak
NAI
55
1
0
16 Jul 2023
DocTr: Document Transformer for Structured Information Extraction in Documents
Haofu Liao
Aruni RoyChowdhury
Weijian Li
Ankan Bansal
Yuting Zhang
Zhuowen Tu
R. Satzoda
R. Manmatha
Vijay Mahadevan
70
12
0
16 Jul 2023
SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Yi-Syuan Chen
Yun-Zhu Song
Cheng Yu Yeo
Bei Liu
Jianlong Fu
Hong-Han Shuai
VLM
LRM
92
4
0
15 Jul 2023
Intuitive Access to Smartphone Settings Using Relevance Model Trained by Contrastive Learning
Joonyoung Kim
Kangwook Lee
Haebin Shin
Hurnjoo Lee
Sechun Kang
Byunguk Choi
Dong Shin
Joohyung Lee
49
0
0
15 Jul 2023
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph
Jiashuo Sun
Chengjin Xu
Lumingyuan Tang
Saizhuo Wang
Chen Lin
Yeyun Gong
Lionel M. Ni
H. Shum
Jian Guo
LRM
108
83
0
15 Jul 2023
Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT
Souvik Kundu
S. Nittur
Maciej Szankin
Sairam Sundaresan
MQ
62
2
0
14 Jul 2023
Can Large Language Models Empower Molecular Property Prediction?
Chao Qian
Huayi Tang
Zhi-Jiang Yang
Hongsi Liang
Y. Liu
AI4CE
91
41
0
14 Jul 2023
Learning Sparse Neural Networks with Identity Layers
Mingjian Ni
Guangyao Chen
Xiawu Zheng
Peixi Peng
Liuliang Yuan
Yonghong Tian
64
0
0
14 Jul 2023
Composition-contrastive Learning for Sentence Embeddings
Sachin Chanchani
Ruihong Huang
108
14
0
14 Jul 2023
Learn from Incomplete Tactile Data: Tactile Representation Learning with Masked Autoencoders
G. Cao
Jiaqi Jiang
Danushka Bollegala
Shan Luo
86
14
0
14 Jul 2023
Time for aCTIon: Automated Analysis of Cyber Threat Intelligence in the Wild
G. Siracusano
D. Sanvito
Roberto González
Manikantan Srinivasan
Sivakaman Kamatchi
Wataru Takahashi
Masaru Kawakita
Takahiro Kakumaru
R. Bifulco
89
16
0
14 Jul 2023
Unsupervised Domain Adaptation using Lexical Transformations and Label Injection for Twitter Data
Akshat Gupta
Xiaomo Liu
Sameena Shah
19
0
0
14 Jul 2023
Improving BERT with Hybrid Pooling Network and Drop Mask
Qian Chen
Wen Wang
Qinglin Zhang
Chong Deng
Ma Yukun
Siqi Zheng
38
1
0
14 Jul 2023
MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System
Libo Qin
Shijue Huang
Qiguang Chen
Chenran Cai
Yudi Zhang
Bin Liang
Wanxiang Che
Ruifeng Xu
59
35
0
14 Jul 2023
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation
Letian Peng
Yuwei Zhang
Jingbo Shang
LRM
44
8
0
14 Jul 2023
MegaWika: Millions of reports and their sources across 50 diverse languages
Samuel Barham
Orion Weller
Michelle Yuan
Kenton W. Murray
M. Yarmohammadi
...
Alexander Martin
Anqi Liu
Aaron Steven White
Jordan L. Boyd-Graber
Benjamin Van Durme
SyDa
73
5
0
13 Jul 2023
Classical Out-of-Distribution Detection Methods Benchmark in Text Classification Tasks
M. Baran
Joanna Baran
Mateusz Wójcik
Maciej Ziȩba
Adam Gonczarek
OODD
74
5
0
13 Jul 2023
Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues
Wentao Deng
Jiahuan Pei
Zhaochun Ren
Zhumin Chen
Pengjie Ren
65
5
0
13 Jul 2023
SecureFalcon: Are We There Yet in Automated Software Vulnerability Detection with LLMs?
M. Ferrag
A. Battah
Norbert Tihanyi
Ridhi Jain
Diana Maimut
...
Thierry Lestable
Narinderjit Singh Thandi
Abdechakour Mechri
Merouane Debbah
Lucas C. Cordeiro
87
10
0
13 Jul 2023
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Mian
OffRL
257
622
0
12 Jul 2023
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers
S. Latif
Muhammad Usama
Mohammad Ibrahim Malik
Björn W. Schuller
101
16
0
12 Jul 2023
DDNAS: Discretized Differentiable Neural Architecture Search for Text Classification
Kuan-Yu Chen
Cheng Li
Kuo-Jung Lee
82
1
0
12 Jul 2023
Stability Guarantees for Feature Attributions with Multiplicative Smoothing
Anton Xue
Rajeev Alur
Eric Wong
117
6
0
12 Jul 2023
Exploring the Emotional and Mental Well-Being of Individuals with Long COVID Through Twitter Analysis
Guocheng Feng
Huaiyu Cai
Wei Quan
17
1
0
11 Jul 2023
Previous
1
2
3
...
89
90
91
...
213
214
215
Next