ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,845 papers shown
Title
BERT-Flow-VAE: A Weakly-supervised Model for Multi-Label Text
  Classification
BERT-Flow-VAE: A Weakly-supervised Model for Multi-Label Text Classification
Ziwen Liu
J. Grau-Bové
Scott Orr
82
1
0
27 Oct 2022
TASA: Deceiving Question Answering Models by Twin Answer Sentences
  Attack
TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack
Yu Cao
Dianqi Li
Meng Fang
Dinesh Manocha
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
83
17
0
27 Oct 2022
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with
  Contrastive and Distributionally Robust Learning
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
Yue Yu
Chenyan Xiong
Si Sun
Chao Zhang
Arnold Overwijk
VLMOOD
162
22
0
27 Oct 2022
Disentangled and Robust Representation Learning for Bragging
  Classification in Social Media
Disentangled and Robust Representation Learning for Bragging Classification in Social Media
Xiang Li
Yucheng Zhou
96
3
0
27 Oct 2022
Dictionary-Assisted Supervised Contrastive Learning
Dictionary-Assisted Supervised Contrastive Learning
Patrick Y. Wu
Richard Bonneau
Joshua A. Tucker
Jonathan Nagler
CLIP
70
0
0
27 Oct 2022
DyREx: Dynamic Query Representation for Extractive Question Answering
DyREx: Dynamic Query Representation for Extractive Question Answering
Urchade Zaratiana
Niama El Khbir
Dennis Núñez
Pierre Holat
Nadi Tomeh
Thierry Charnois
115
2
0
26 Oct 2022
Privately Fine-Tuning Large Language Models with Differential Privacy
Privately Fine-Tuning Large Language Models with Differential Privacy
R. Behnia
Mohammadreza Ebrahimi
Jason L. Pacheco
B. Padmanabhan
135
51
0
26 Oct 2022
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine
  Translation Models
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine Translation Models
Mathieu Grosso
Pirashanth Ratnamogan
Alexis Mathey
William Vanhuffel
Michael Fotso Fotso
58
3
0
26 Oct 2022
MABEL: Attenuating Gender Bias using Textual Entailment Data
MABEL: Attenuating Gender Bias using Textual Entailment Data
Jacqueline He
Mengzhou Xia
C. Fellbaum
Danqi Chen
60
32
0
26 Oct 2022
Causality Detection using Multiple Annotation Decisions
Causality Detection using Multiple Annotation Decisions
Quynh-Anh Nguyen
Arka Mitra
29
2
0
26 Oct 2022
Incorporating Pre-training Paradigm for Antibody Sequence-Structure
  Co-design
Incorporating Pre-training Paradigm for Antibody Sequence-Structure Co-design
Kaiyuan Gao
Lijun Wu
Jinhua Zhu
Tianbo Peng
Yingce Xia
...
Shufang Xie
Tao Qin
Haiguang Liu
Kun He
Tie-Yan Liu
97
10
0
26 Oct 2022
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of
  Downstream Tasks
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks
Colin Leong
Joshua Nemecek
Jacob Mansdorfer
Anna Filighera
A. Owodunni
Daniel Whitenack
VLMAI4CE
171
29
0
26 Oct 2022
Learning on Large-scale Text-attributed Graphs via Variational Inference
Learning on Large-scale Text-attributed Graphs via Variational Inference
Jianan Zhao
Meng Qu
Chaozhuo Li
Hao Yan
Qian Liu
Rui Li
Xing Xie
Jian Tang
VLM
145
142
0
26 Oct 2022
Analyzing Multi-Task Learning for Abstractive Text Summarization
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
83
4
0
26 Oct 2022
Benchmarking Language Models for Code Syntax Understanding
Benchmarking Language Models for Code Syntax Understanding
Da Shen
Xinyun Chen
Chenguang Wang
Koushik Sen
Dawn Song
ELM
59
17
0
26 Oct 2022
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning
Yifan Chen
Devamanyu Hazarika
Mahdi Namazifar
Yang Liu
Di Jin
Dilek Z. Hakkani-Tür
71
4
0
26 Oct 2022
Discourse-Aware Emotion Cause Extraction in Conversations
Discourse-Aware Emotion Cause Extraction in Conversations
Dexin Kong
Nan Yu
Yun Yuan
Guohong Fu
Chen Gong
55
2
0
26 Oct 2022
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in
  Financial Sentiment Analysis
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis
Sudhandar Balakrishnan
Yihao Fang
Xioadan Zhu
48
1
0
26 Oct 2022
Automatic extraction of materials and properties from superconductors
  scientific literature
Automatic extraction of materials and properties from superconductors scientific literature
Luca Foppiano
P. B. Castro
Pedro Ortiz Suarez
K. Terashima
Y. Takano
Masashi Ishii
74
12
0
26 Oct 2022
Synthetic Text Generation with Differential Privacy: A Simple and
  Practical Recipe
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe
Xiang Yue
Huseyin A. Inan
Xuechen Li
Girish Kumar
Julia McAnallen
Hoda Shajari
Huan Sun
David Levitan
Robert Sim
154
86
0
25 Oct 2022
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language
  Models
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Aaron Mueller
Yudi Xia
Tal Linzen
MILM
113
10
0
25 Oct 2022
OpenStance: Real-world Zero-shot Stance Detection
OpenStance: Real-world Zero-shot Stance Detection
Hanzi Xu
Slobodan Vučetić
Wenpeng Yin
69
22
0
25 Oct 2022
Universal Evasion Attacks on Summarization Scoring
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
87
1
0
25 Oct 2022
MOFormer: Self-Supervised Transformer model for Metal-Organic Framework
  Property Prediction
MOFormer: Self-Supervised Transformer model for Metal-Organic Framework Property Prediction
Zhonglin Cao
Rishikesh Magar
Yuyang Wang
A. Farimani
AI4CE
106
101
0
25 Oct 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue
  Understanding
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
110
35
0
25 Oct 2022
PolyHope: Two-Level Hope Speech Detection from Tweets
PolyHope: Two-Level Hope Speech Detection from Tweets
F. Balouchzahi
Grigori Sidorov
Alexander Gelbukh
53
50
0
25 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models
Exploring Mode Connectivity for Pre-trained Language Models
Yujia Qin
Cheng Qian
Jing Yi
Weize Chen
Yankai Lin
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
99
21
0
25 Oct 2022
Are All Spurious Features in Natural Language Alike? An Analysis through
  a Causal Lens
Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens
Nitish Joshi
X. Pan
Hengxing He
CML
126
30
0
25 Oct 2022
This joke is [MASK]: Recognizing Humor and Offense with Prompting
This joke is [MASK]: Recognizing Humor and Offense with Prompting
Junze Li
Mengjie Zhao
Yubo Xie
Antonis Maronikolakis
Pearl Pu
Hinrich Schütze
AAML
61
1
0
25 Oct 2022
SepLL: Separating Latent Class Labels from Weak Supervision Noise
SepLL: Separating Latent Class Labels from Weak Supervision Noise
Andreas Stephan
Vasiliki Kougia
Benjamin Roth
55
8
0
25 Oct 2022
Multilingual Relation Classification via Efficient and Effective
  Prompting
Multilingual Relation Classification via Efficient and Effective Prompting
Yuxuan Chen
David Harbecke
Leonhard Hennig
LRM
87
12
0
25 Oct 2022
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation
Chen Zhang
L. F. D’Haro
Qiquan Zhang
Thomas Friedrichs
Haizhou Li
88
16
0
25 Oct 2022
Improving Imbalanced Text Classification with Dynamic Curriculum
  Learning
Improving Imbalanced Text Classification with Dynamic Curriculum Learning
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
67
4
0
25 Oct 2022
Improving Speech Representation Learning via Speech-level and
  Phoneme-level Masking Approach
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach
Xulong Zhang
Jianzong Wang
Ning Cheng
Kexin Zhu
Jing Xiao
71
1
0
25 Oct 2022
Parameter-Efficient Legal Domain Adaptation
Parameter-Efficient Legal Domain Adaptation
Jonathan Li
R. Bhambhoria
Xiao-Dan Zhu
ELMAILawALM
103
14
0
25 Oct 2022
Evaluating Parameter Efficient Learning for Generation
Evaluating Parameter Efficient Learning for Generation
Peng Xu
M. Patwary
Shrimai Prabhumoye
Virginia Adams
R. Prenger
Ming-Yu Liu
Nayeon Lee
Mohammad Shoeybi
Bryan Catanzaro
MoE
72
3
0
25 Oct 2022
Toward an Intelligent Tutoring System for Argument Mining in Legal Texts
Toward an Intelligent Tutoring System for Argument Mining in Legal Texts
Hannes Westermann
Jaromír Šavelka
Vern R. Walker
Kevin D. Ashley
Karim Benyekhlef
60
4
0
24 Oct 2022
Datavoidant: An AI System for Addressing Political Data Voids on Social
  Media
Datavoidant: An AI System for Addressing Political Data Voids on Social Media
Claudia Flores-Saviaga
Shangbin Feng
Saiph Savage
86
16
0
24 Oct 2022
ExPUNations: Augmenting Puns with Keywords and Explanations
ExPUNations: Augmenting Puns with Keywords and Explanations
Jiao Sun
Anjali Narayan-Chen
Shereen Oraby
Alessandra Cervone
Tagyoung Chung
Jing Huang
Yang Liu
Nanyun Peng
81
10
0
24 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
110
171
0
24 Oct 2022
Cascading Biases: Investigating the Effect of Heuristic Annotation
  Strategies on Data and Models
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and Models
Chaitanya Malaviya
Sudeep Bhatia
Mark Yatskar
72
4
0
24 Oct 2022
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy
  light-weight models in the era of humongous Language Models
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models
Stelios Maroudas
Sotiris Legkas
Prodromos Malakasiotis
Ilias Chalkidis
VLMAILawALMELM
84
4
0
24 Oct 2022
A Unified Framework for Pun Generation with Humor Principles
A Unified Framework for Pun Generation with Humor Principles
Yufei Tian
Divyanshu Sheth
Nanyun Peng
89
14
0
24 Oct 2022
Multilingual Auxiliary Tasks Training: Bridging the Gap between
  Languages for Zero-Shot Transfer of Hate Speech Detection Models
Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models
Syrielle Montariol
Arij Riabi
Djamé Seddah
88
12
0
24 Oct 2022
Investigating the detection of Tortured Phrases in Scientific Literature
Investigating the detection of Tortured Phrases in Scientific Literature
Puthineath Lay
M. Lentschat
Cyril Labbe
67
5
0
24 Oct 2022
Modeling Information Change in Science Communication with Semantically
  Matched Paraphrases
Modeling Information Change in Science Communication with Semantically Matched Paraphrases
Dustin Wright
Jiaxin Pei
David Jurgens
Isabelle Augenstein
95
16
0
24 Oct 2022
Language-free Training for Zero-shot Video Grounding
Language-free Training for Zero-shot Video Grounding
Dahye Kim
Jungin Park
Jiyoung Lee
S. Park
Kwanghoon Sohn
96
21
0
24 Oct 2022
Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
Sedrick Scott Keh
50
7
0
24 Oct 2022
Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun
  Property Prediction
Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction
Yue Yang
Artemis Panagopoulou
Marianna Apidianaki
Mark Yatskar
Chris Callison-Burch
113
2
0
24 Oct 2022
Event-Centric Question Answering via Contrastive Learning and Invertible
  Event Transformation
Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation
Junru Lu
Xingwei Tan
Gabriele Pergola
Lin Gui
Yulan He
98
11
0
24 Oct 2022
Previous
123...132133134...215216217
Next