ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 4,869 papers shown
Title
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for
  Emotion Recognition in Conversation
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation
Joosung Lee
Woo-Ri Lee
30
77
0
26 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading
  Comprehension Models
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
25
9
0
26 Aug 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
26
29
0
25 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation
  of Normative Claims in the Legal Domain
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
19
0
0
25 Aug 2021
Models In a Spelling Bee: Language Models Implicitly Learn the Character
  Composition of Tokens
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens
Itay Itzhak
Omer Levy
17
18
0
25 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube
  Comments and Posts
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
28
38
0
24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
782
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with
  Transformers
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
36
11
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
33
156
0
24 Aug 2021
Regularizing Transformers With Deep Probabilistic Layers
Regularizing Transformers With Deep Probabilistic Layers
Aurora Cobo Aguilera
Pablo Martínez Olmos
Antonio Artés-Rodríguez
Fernando Pérez-Cruz
41
7
0
23 Aug 2021
Sarcasm Detection in Twitter -- Performance Impact while using Data
  Augmentation: Word Embeddings
Sarcasm Detection in Twitter -- Performance Impact while using Data Augmentation: Word Embeddings
Alif Tri Handoyo
Hidayat Ur Rahman
Derwin Suhartono
19
5
0
23 Aug 2021
Metric Learning in Multilingual Sentence Similarity Measurement for
  Document Alignment
Metric Learning in Multilingual Sentence Similarity Measurement for Document Alignment
Charith Rajitha
Lakmali Piyarathne
Dilan Sachintha
Surangika Ranathunga
24
3
0
21 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with
  Structured Semantics for Medical Text Mining
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
28
52
0
20 Aug 2021
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text
  Models
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Cer
Yinfei Yang
63
532
0
19 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive
  Machine Translation
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
34
11
0
19 Aug 2021
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks
  in Complex Scenes
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
Mingjun Yin
Shasha Li
Zikui Cai
Chengyu Song
Ulugbek S. Kamilov
Amit K. Roy-Chowdhury
S. Krishnamurthy
AAML
19
19
0
19 Aug 2021
An Effective System for Multi-format Information Extraction
An Effective System for Multi-format Information Extraction
Yaduo Liu
Longhui Zhang
Shujuan Yin
Xiaofeng Zhao
Feiliang Ren
32
1
0
16 Aug 2021
On Multi-Modal Learning of Editing Source Code
On Multi-Modal Learning of Editing Source Code
Saikat Chakraborty
Baishakhi Ray
KELM
36
59
0
15 Aug 2021
MUSIQ: Multi-scale Image Quality Transformer
MUSIQ: Multi-scale Image Quality Transformer
Junjie Ke
Qifei Wang
Yilin Wang
P. Milanfar
Feng Yang
177
632
0
12 Aug 2021
Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author
  Discovery
Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author Discovery
Jason Portenoy
Marissa Radensky
Jevin D. West
Eric Horvitz
Daniel S. Weld
Tom Hope
94
31
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage
  Retrieval
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
175
330
0
12 Aug 2021
DeliData: A dataset for deliberation in multi-party problem solving
DeliData: A dataset for deliberation in multi-party problem solving
Georgi Karadzhov
Tom Stafford
Andreas Vlachos
34
16
0
11 Aug 2021
A Study of Social and Behavioral Determinants of Health in Lung Cancer
  Patients Using Transformers-based Natural Language Processing Models
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models
Zehao Yu
Xi Yang
Chong Dang
Songzi Wu
P. Adekkanattu
...
T. George
William R. Hogan
Yi Guo
Jiang Bian
Yonghui Wu
18
35
0
10 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code
  Representation
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
39
114
0
10 Aug 2021
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer
  Reviews
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews
Shruti Singh
M. Singh
Pawan Goyal
27
8
0
09 Aug 2021
Learning Joint Embedding with Modality Alignments for Cross-Modal
  Retrieval of Recipes and Food Images
Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images
Zhongwei Xie
Ling Liu
Lin Li
Luo Zhong
11
10
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based
  Code Embedding
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
46
17
0
08 Aug 2021
Facebook AI WMT21 News Translation Task Submission
Facebook AI WMT21 News Translation Task Submission
C. Tran
Shruti Bhosale
James Cross
Philipp Koehn
Sergey Edunov
Angela Fan
VLM
139
81
0
06 Aug 2021
Decoupled Transformer for Scalable Inference in Open-domain Question
  Answering
Decoupled Transformer for Scalable Inference in Open-domain Question Answering
Haytham ElFadeel
Stanislav Peshterliev
40
1
0
05 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple
  Constraints
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
48
76
0
04 Aug 2021
HTTP2vec: Embedding of HTTP Requests for Detection of Anomalous Traffic
HTTP2vec: Embedding of HTTP Requests for Detection of Anomalous Traffic
Mateusz Gniewkowski
H. Maciejewski
T. Surmacz
Wiktor Walentynowicz
35
8
0
03 Aug 2021
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK
  Framework: A Self-Distillation Approach
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK Framework: A Self-Distillation Approach
Benjamin Ampel
Sagar Samtani
Steven Ullman
Hsinchun Chen
25
35
0
03 Aug 2021
Exploiting BERT For Multimodal Target Sentiment Classification Through
  Input Space Translation
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
Zaid Khan
Y. Fu
43
132
0
03 Aug 2021
More but Correct: Generating Diversified and Entity-revised Medical
  Response
More but Correct: Generating Diversified and Entity-revised Medical Response
Bin Li
Encheng Chen
Hongrui Liu
Yixuan Weng
Bin Sun
Shutao Li
Yongping Bai
Meiling Hu
MedIm
27
11
0
03 Aug 2021
Representation learning for neural population activity with Neural Data
  Transformers
Representation learning for neural population activity with Neural Data Transformers
Joel Ye
C. Pandarinath
AI4TS
AI4CE
13
53
0
02 Aug 2021
PyEuroVoc: A Tool for Multilingual Legal Document Classification with
  EuroVoc Descriptors
PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors
Andrei-Marius Avram
V. Pais
D. Tufis
AILaw
VLM
29
17
0
02 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained
  Tokenization
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
32
8
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
38
44
0
02 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
32
238
0
02 Aug 2021
EmailSum: Abstractive Email Thread Summarization
EmailSum: Abstractive Email Thread Summarization
Shiyue Zhang
Asli Celikyilmaz
Jianfeng Gao
Joey Tianyi Zhou
30
38
0
30 Jul 2021
IIITG-ADBU@HASOC-Dravidian-CodeMix-FIRE2020: Offensive Content Detection
  in Code-Mixed Dravidian Text
IIITG-ADBU@HASOC-Dravidian-CodeMix-FIRE2020: Offensive Content Detection in Code-Mixed Dravidian Text
Arup Baruah
K. Das
F. Barbhuiya
Kuntal Dey
27
12
0
29 Jul 2021
Local Structure Matters Most: Perturbation Study in NLU
Local Structure Matters Most: Perturbation Study in NLU
Louis Clouâtre
Prasanna Parthasarathi
Amal Zouaq
Sarath Chandar
32
13
0
29 Jul 2021
Domain-matched Pre-training Tasks for Dense Retrieval
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oğuz
Kushal Lakhotia
Anchit Gupta
Patrick Lewis
Vladimir Karpukhin
...
Xilun Chen
Sebastian Riedel
Wen-tau Yih
Sonal Gupta
Yashar Mehdad
RALM
33
66
0
28 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
120
3,858
0
28 Jul 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
27
4
0
28 Jul 2021
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
Zhenwen Liang
Jipeng Zhang
Lei Wang
Wei Qin
Yunshi Lan
Jie Shao
Xiangliang Zhang
AIMat
44
62
0
28 Jul 2021
An Evaluation of Generative Pre-Training Model-based Therapy Chatbot for
  Caregivers
An Evaluation of Generative Pre-Training Model-based Therapy Chatbot for Caregivers
Lu Wang
Munif Ishad Mujib
Jake Williams
G. Demiris
Jina Huh-Yoo
AI4MH
37
32
0
28 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
45
2
0
27 Jul 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Zuchao Li
Kevin Parnow
Hai Zhao
Zhuosheng Zhang
Rui Wang
Masao Utiyama
Eiichiro Sumita
13
8
0
27 Jul 2021
Previous
123...747576...969798
Next