Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,810 papers shown
Title
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
95
29
0
14 Mar 2023
Input-length-shortening and text generation via attention values
Necset Ozkan Tan
A. Peng
Joshua Bensemann
Qiming Bao
Tim Hartill
M. Gahegan
Michael Witbrock
84
1
0
14 Mar 2023
Architext: Language-Driven Generative Architecture Design
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
VLM
AI4CE
73
6
0
13 Mar 2023
AMOM: Adaptive Masking over Masking for Conditional Masked Language Model
Yisheng Xiao
Ruiyang Xu
Lijun Wu
Juntao Li
Tao Qin
Yan-Tie Liu
Hao Fei
44
9
0
13 Mar 2023
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Mrigank Raman
Pratyush Maini
J. Zico Kolter
Zachary Chase Lipton
Danish Pruthi
AAML
71
17
0
13 Mar 2023
Meet in the Middle: A New Pre-training Paradigm
A. Nguyen
Nikos Karampatziakis
Weizhu Chen
62
21
0
13 Mar 2023
Transformer-based approaches to Sentiment Detection
O. E. Ojo
Hoang Thang Ta
Alexander Gelbukh
Hiram Calvo
O. O. Adebanji
Grigori Sidorov
31
7
0
13 Mar 2023
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Bo He
Jun Wang
Jielin Qiu
Trung Bui
Abhinav Shrivastava
Zhaowen Wang
91
71
0
13 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
86
1
0
13 Mar 2023
Addressing Biases in the Texts using an End-to-End Pipeline Approach
Shaina Raza
Syed Raza Bashir
Sneha
Urooj Qamar
57
0
0
13 Mar 2023
A Human Subject Study of Named Entity Recognition (NER) in Conversational Music Recommendation Queries
Elena V. Epure
Romain Hennequin
48
5
0
13 Mar 2023
LUKE-Graph: A Transformer-based Approach with Gated Relational Graph Attention for Cloze-style Reading Comprehension
Shima Foolad
Kourosh Kiani
39
3
0
12 Mar 2023
Improve Retrieval-based Dialogue System via Syntax-Informed Attention
Tengtao Song
Nuo Chen
Ji Jiang
Zhihong Zhu
Yuexian Zou
51
6
0
12 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A.B. Siddique
79
6
0
12 Mar 2023
Diffusion Models for Non-autoregressive Text Generation: A Survey
Yifan Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
MedIm
DiffM
116
36
0
12 Mar 2023
Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization
Miao Li
Jianzhong Qi
Jey Han Lau
70
11
0
12 Mar 2023
Multimodal Data Integration for Oncology in the Era of Deep Neural Networks: A Review
Asim Waqas
Aakash Tripathi
Ravichandran Ramachandran
Paul Stewart
Ghulam Rasool
AI4CE
121
37
0
11 Mar 2023
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation
Bing He
M. Ahamad
Srijan Kumar
OffRL
60
46
0
11 Mar 2023
Consistency Analysis of ChatGPT
Myeongjun Jang
Thomas Lukasiewicz
95
56
0
11 Mar 2023
Do large language models resemble humans in language use?
Zhenguang G. Cai
Xufeng Duan
David A. Haslett
Shuqi Wang
M. Pickering
ALM
127
41
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
95
47
0
10 Mar 2023
Detection of Abuse in Financial Transaction Descriptions Using Machine Learning
A. Leontjeva
Genevieve Richards
Kaavya Sriskandaraja
Jessica Perchman
L. Pizzato
13
0
0
10 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
59
7
0
10 Mar 2023
Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Mesut Erhan Unal
Adriana Kovashka
VLM
75
5
0
09 Mar 2023
Dynamic Stashing Quantization for Efficient Transformer Training
Guofu Yang
Daniel Lo
Robert D. Mullins
Yiren Zhao
MQ
88
8
0
09 Mar 2023
Detecting Images Generated by Diffusers
D. Coccomini
Andrea Esuli
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
DiffM
88
15
0
09 Mar 2023
Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions?
Yasuto Hoshi
Daisuke Miyashita
Yasuhiro Morioka
Youyang Ng
Osamu Torii
J. Deguchi
58
0
0
09 Mar 2023
Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction
Caiyuan Chu
Ya Li
Yifan Liu
Jia-Chen Gu
Quan Liu
Yongxin Ge
Guoping Hu
97
0
0
09 Mar 2023
Lexical Complexity Prediction: An Overview
Kai North
Marcos Zampieri
Matthew Shardlow
63
26
0
08 Mar 2023
RAF: Holistic Compilation for Deep Learning Model Training
Cody Hao Yu
Haozheng Fan
Guangtai Huang
Zhen Jia
Yizhi Liu
...
Yuan Zhou
Haichen Shen
Junru Shao
Mu Li
Yida Wang
72
3
0
08 Mar 2023
Extrapolative Controlled Sequence Generation via Iterative Refinement
Vishakh Padmakumar
Richard Yuanzhe Pang
He He
Ankur P. Parikh
82
10
0
08 Mar 2023
Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Guanshuo Wang
Fufu Yu
Jianing Li
Qiong Jia
Shouhong Ding
66
18
0
08 Mar 2023
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Ruixiang Tang
Xiaotian Han
Xiaoqian Jiang
Helen Zhou
LM&MA
AI4MH
SyDa
104
186
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
120
554
0
07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
81
122
0
07 Mar 2023
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification
Taja Kuzman
I. Mozetič
Nikola Ljubesic
114
94
0
07 Mar 2023
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization
Griffin Adams
Jason Zucker
Noémie Elhadad
93
23
0
07 Mar 2023
A Challenging Benchmark for Low-Resource Learning
Yudong Wang
Chang Ma
Qingxiu Dong
Lingpeng Kong
Jingjing Xu
88
4
0
07 Mar 2023
German BERT Model for Legal Named Entity Recognition
Harsh Darji
Jelena Mitrović
Michael Granitzer
AILaw
31
14
0
07 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
53
28
0
07 Mar 2023
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models
Jinjie Ni
Yukun Ma
Wen Wang
Qian Chen
Dianwen Ng
Han Lei
Trung Hieu Nguyen
Chong Zhang
B. Ma
Min Zhang
43
2
0
07 Mar 2023
ADELT: Transpilation Between Deep Learning Frameworks
Linyuan Gong
Jiayi Wang
Alvin Cheung
59
3
0
07 Mar 2023
Two-stage Pipeline for Multilingual Dialect Detection
Ankit Vaidya
Aditya Kane
80
5
0
06 Mar 2023
Depression Detection Using Digital Traces on Social Media: A Knowledge-aware Deep Learning Approach
Wenli Zhang
Jiaheng Xie
Zhuocheng Zhang
Xiang Liu
76
10
0
06 Mar 2023
Referring Multi-Object Tracking
Dongming Wu
Wencheng Han
Tiancai Wang
Xingping Dong
Xiangyu Zhang
Jianbing Shen
114
80
0
06 Mar 2023
AmQA: Amharic Question Answering Dataset
Tilahun Abedissa
Ricardo Usbeck
Yaregal Assabie
71
1
0
06 Mar 2023
SC-Block: Supervised Contrastive Blocking within Entity Resolution Pipelines
Alexander Brinkmann
Roee Shraga
Christian Bizer
91
10
0
06 Mar 2023
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Edoardo Mosca
Daryna Dementieva
Tohid Ebrahim Ajdari
Maximilian Kummeth
Kirill Gringauz
Yutong Zhou
Georg Groh
100
8
0
06 Mar 2023
Dynamic Prompting: A Unified Framework for Prompt Tuning
Xianjun Yang
Wei Cheng
Xujiang Zhao
Wenchao Yu
Linda R. Petzold
Haifeng Chen
VLM
119
16
0
06 Mar 2023
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification
Xiang Hu
Xinyu Kong
Kewei Tu
MILM
BDL
67
5
0
06 Mar 2023
Previous
1
2
3
...
115
116
117
...
215
216
217
Next