Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 4,869 papers shown
Title
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation
Joosung Lee
Woo-Ri Lee
30
77
0
26 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
25
9
0
26 Aug 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
26
29
0
25 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
19
0
0
25 Aug 2021
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens
Itay Itzhak
Omer Levy
17
18
0
25 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
28
38
0
24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
782
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
36
11
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
33
156
0
24 Aug 2021
Regularizing Transformers With Deep Probabilistic Layers
Aurora Cobo Aguilera
Pablo Martínez Olmos
Antonio Artés-Rodríguez
Fernando Pérez-Cruz
41
7
0
23 Aug 2021
Sarcasm Detection in Twitter -- Performance Impact while using Data Augmentation: Word Embeddings
Alif Tri Handoyo
Hidayat Ur Rahman
Derwin Suhartono
19
5
0
23 Aug 2021
Metric Learning in Multilingual Sentence Similarity Measurement for Document Alignment
Charith Rajitha
Lakmali Piyarathne
Dilan Sachintha
Surangika Ranathunga
24
3
0
21 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
28
52
0
20 Aug 2021
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Cer
Yinfei Yang
63
532
0
19 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
34
11
0
19 Aug 2021
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
Mingjun Yin
Shasha Li
Zikui Cai
Chengyu Song
Ulugbek S. Kamilov
Amit K. Roy-Chowdhury
S. Krishnamurthy
AAML
19
19
0
19 Aug 2021
An Effective System for Multi-format Information Extraction
Yaduo Liu
Longhui Zhang
Shujuan Yin
Xiaofeng Zhao
Feiliang Ren
32
1
0
16 Aug 2021
On Multi-Modal Learning of Editing Source Code
Saikat Chakraborty
Baishakhi Ray
KELM
36
59
0
15 Aug 2021
MUSIQ: Multi-scale Image Quality Transformer
Junjie Ke
Qifei Wang
Yilin Wang
P. Milanfar
Feng Yang
177
632
0
12 Aug 2021
Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author Discovery
Jason Portenoy
Marissa Radensky
Jevin D. West
Eric Horvitz
Daniel S. Weld
Tom Hope
94
31
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
175
330
0
12 Aug 2021
DeliData: A dataset for deliberation in multi-party problem solving
Georgi Karadzhov
Tom Stafford
Andreas Vlachos
34
16
0
11 Aug 2021
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models
Zehao Yu
Xi Yang
Chong Dang
Songzi Wu
P. Adekkanattu
...
T. George
William R. Hogan
Yi Guo
Jiang Bian
Yonghui Wu
18
35
0
10 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
39
114
0
10 Aug 2021
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews
Shruti Singh
M. Singh
Pawan Goyal
27
8
0
09 Aug 2021
Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images
Zhongwei Xie
Ling Liu
Lin Li
Luo Zhong
11
10
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
46
17
0
08 Aug 2021
Facebook AI WMT21 News Translation Task Submission
C. Tran
Shruti Bhosale
James Cross
Philipp Koehn
Sergey Edunov
Angela Fan
VLM
139
81
0
06 Aug 2021
Decoupled Transformer for Scalable Inference in Open-domain Question Answering
Haytham ElFadeel
Stanislav Peshterliev
40
1
0
05 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
48
76
0
04 Aug 2021
HTTP2vec: Embedding of HTTP Requests for Detection of Anomalous Traffic
Mateusz Gniewkowski
H. Maciejewski
T. Surmacz
Wiktor Walentynowicz
35
8
0
03 Aug 2021
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK Framework: A Self-Distillation Approach
Benjamin Ampel
Sagar Samtani
Steven Ullman
Hsinchun Chen
25
35
0
03 Aug 2021
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
Zaid Khan
Y. Fu
43
132
0
03 Aug 2021
More but Correct: Generating Diversified and Entity-revised Medical Response
Bin Li
Encheng Chen
Hongrui Liu
Yixuan Weng
Bin Sun
Shutao Li
Yongping Bai
Meiling Hu
MedIm
27
11
0
03 Aug 2021
Representation learning for neural population activity with Neural Data Transformers
Joel Ye
C. Pandarinath
AI4TS
AI4CE
13
53
0
02 Aug 2021
PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors
Andrei-Marius Avram
V. Pais
D. Tufis
AILaw
VLM
29
17
0
02 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
32
8
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
38
44
0
02 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
32
238
0
02 Aug 2021
EmailSum: Abstractive Email Thread Summarization
Shiyue Zhang
Asli Celikyilmaz
Jianfeng Gao
Joey Tianyi Zhou
30
38
0
30 Jul 2021
IIITG-ADBU@HASOC-Dravidian-CodeMix-FIRE2020: Offensive Content Detection in Code-Mixed Dravidian Text
Arup Baruah
K. Das
F. Barbhuiya
Kuntal Dey
27
12
0
29 Jul 2021
Local Structure Matters Most: Perturbation Study in NLU
Louis Clouâtre
Prasanna Parthasarathi
Amal Zouaq
Sarath Chandar
32
13
0
29 Jul 2021
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oğuz
Kushal Lakhotia
Anchit Gupta
Patrick Lewis
Vladimir Karpukhin
...
Xilun Chen
Sebastian Riedel
Wen-tau Yih
Sonal Gupta
Yashar Mehdad
RALM
33
66
0
28 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
120
3,858
0
28 Jul 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
27
4
0
28 Jul 2021
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
Zhenwen Liang
Jipeng Zhang
Lei Wang
Wei Qin
Yunshi Lan
Jie Shao
Xiangliang Zhang
AIMat
44
62
0
28 Jul 2021
An Evaluation of Generative Pre-Training Model-based Therapy Chatbot for Caregivers
Lu Wang
Munif Ishad Mujib
Jake Williams
G. Demiris
Jina Huh-Yoo
AI4MH
37
32
0
28 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
45
2
0
27 Jul 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Zuchao Li
Kevin Parnow
Hai Zhao
Zhuosheng Zhang
Rui Wang
Masao Utiyama
Eiichiro Sumita
13
8
0
27 Jul 2021
Previous
1
2
3
...
74
75
76
...
96
97
98
Next