ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,764 papers shown
Title
Plug-and-Play Document Modules for Pre-trained Models
Plug-and-Play Document Modules for Pre-trained Models
Chaojun Xiao
Zhengyan Zhang
Xu Han
Chi-Min Chan
Yankai Lin
Zhiyuan Liu
Xiangyang Li
Zhonghua Li
Bo Zhao
Maosong Sun
KELM
102
6
0
28 May 2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge
  Interaction Graph for Lightweight Text-Image Retrieval
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Jiapeng Wang
Chengyu Wang
Xiaodan Wang
Jun Huang
Lianwen Jin
VLM
113
5
0
28 May 2023
AI Coach Assist: An Automated Approach for Call Recommendation in
  Contact Centers for Agent Coaching
AI Coach Assist: An Automated Approach for Call Recommendation in Contact Centers for Agent Coaching
Md Tahmid Rahman Laskar
Cheng Chen
Xue-Yong Fu
M. Azizi
Shashi Bhushan
Simon Corston-Oliver
56
2
0
28 May 2023
Diagnosing Transformers: Illuminating Feature Spaces for Clinical
  Decision-Making
Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making
Aliyah R. Hsu
Yeshwanth Cherapanamjeri
Briton Park
Tristan Naumann
A. Odisho
Bin Yu
MedIm
60
0
0
27 May 2023
PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Qingqing Cao
Bhargavi Paranjape
Hannaneh Hajishirzi
MLLMVLM
75
27
0
27 May 2023
CIF-PT: Bridging Speech and Text Representations for Spoken Language
  Understanding via Continuous Integrate-and-Fire Pre-Training
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Linhao Dong
Zhecheng An
Peihao Wu
Jun Zhang
Lu Lu
Zejun Ma
49
6
0
27 May 2023
The Curse of Recursion: Training on Generated Data Makes Models Forget
The Curse of Recursion: Training on Generated Data Makes Models Forget
Ilia Shumailov
Zakhar Shumaylov
Yiren Zhao
Y. Gal
Nicolas Papernot
Ross J. Anderson
DiffM
95
301
0
27 May 2023
A Match Made in Heaven: A Multi-task Framework for Hyperbole and
  Metaphor Detection
A Match Made in Heaven: A Multi-task Framework for Hyperbole and Metaphor Detection
Naveen Badathala
Abisek Rajakumar Kalarani
Tejpalsingh Siledar
P. Bhattacharyya
49
12
0
27 May 2023
Fine-tuning Happens in Tiny Subspaces: Exploring Intrinsic Task-specific
  Subspaces of Pre-trained Language Models
Fine-tuning Happens in Tiny Subspaces: Exploring Intrinsic Task-specific Subspaces of Pre-trained Language Models
Zhong Zhang
Bang Liu
Junming Shao
83
9
0
27 May 2023
Query-Efficient Black-Box Red Teaming via Bayesian Optimization
Query-Efficient Black-Box Red Teaming via Bayesian Optimization
Deokjae Lee
JunYeong Lee
Jung-Woo Ha
Jin-Hwa Kim
Sang-Woo Lee
Hwaran Lee
Hyun Oh Song
AAML
91
25
0
27 May 2023
Weaker Than You Think: A Critical Look at Weakly Supervised Learning
Weaker Than You Think: A Critical Look at Weakly Supervised Learning
D. Zhu
Xiaoyu Shen
Marius Mosbach
Andreas Stephan
Dietrich Klakow
NoLa
87
9
0
27 May 2023
Modeling Adversarial Attack on Pre-trained Language Models as Sequential
  Decision Making
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making
Xuanjie Fang
Sijie Cheng
Yang Liu
Wen Wang
AAML
63
9
0
27 May 2023
Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning
Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning
Zhenrui Yue
Huimin Zeng
Mengfei Lan
Heng Ji
D. Wang
VLM
70
14
0
27 May 2023
Fine-Tuning Language Models with Just Forward Passes
Fine-Tuning Language Models with Just Forward Passes
Sadhika Malladi
Tianyu Gao
Eshaan Nichani
Alexandru Damian
Jason D. Lee
Danqi Chen
Sanjeev Arora
160
205
0
27 May 2023
Why Does Zero-Shot Cross-Lingual Generation Fail? An Explanation and a
  Solution
Why Does Zero-Shot Cross-Lingual Generation Fail? An Explanation and a Solution
Tianjian Li
Kenton W. Murray
104
26
0
27 May 2023
Slide, Constrain, Parse, Repeat: Synchronous SlidingWindows for Document
  AMR Parsing
Slide, Constrain, Parse, Repeat: Synchronous SlidingWindows for Document AMR Parsing
Yara Rizk
Tahira Naseem
Ramón Fernández Astudillo
Radu Florian
Salim Roukos
89
0
0
26 May 2023
Metaphor Detection via Explicit Basic Meanings Modelling
Metaphor Detection via Explicit Basic Meanings Modelling
Yucheng Li
Shunyu Wang
Chenghua Lin
Guerin Frank
133
20
0
26 May 2023
Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale
Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale
Vijeta Deshpande
Dan Pechi
Shree Thatte
Vladislav Lialin
Anna Rumshisky
121
8
0
26 May 2023
Entailment as Robust Self-Learner
Entailment as Robust Self-Learner
Jiaxin Ge
Hongyin Luo
Yoon Kim
James R. Glass
109
3
0
26 May 2023
Characterizing and Measuring Linguistic Dataset Drift
Characterizing and Measuring Linguistic Dataset Drift
Tyler A. Chang
Kishaloy Halder
Neha Ann John
Yogarshi Vyas
Yassine Benajiba
Miguel Ballesteros
Dan Roth
71
2
0
26 May 2023
Exploiting Abstract Meaning Representation for Open-Domain Question
  Answering
Exploiting Abstract Meaning Representation for Open-Domain Question Answering
Cunxiang Wang
Zhikun Xu
Qipeng Guo
Xiangkun Hu
Xuefeng Bai
Zheng Zhang
Yue Zhang
88
4
0
26 May 2023
SOC: Semantic-Assisted Object Cluster for Referring Video Object
  Segmentation
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
Zhuoyan Luo
Yicheng Xiao
Yong-Jin Liu
Shuyan Li
Yitong Wang
Yansong Tang
Xiu Li
Yujiu Yang
VOS
67
38
0
26 May 2023
NormBank: A Knowledge Bank of Situational Social Norms
NormBank: A Knowledge Bank of Situational Social Norms
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
109
45
0
26 May 2023
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and
  Evaluation
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation
Marius Mosbach
Tiago Pimentel
Shauli Ravfogel
Dietrich Klakow
Yanai Elazar
108
135
0
26 May 2023
UMSE: Unified Multi-scenario Summarization Evaluation
UMSE: Unified Multi-scenario Summarization Evaluation
Shen Gao
Zhitao Yao
Chongyang Tao
Preslav Nakov
Fajie Yuan
Zhaochun Ren
Zhumin Chen
86
5
0
26 May 2023
Model-Based Simulation for Optimising Smart Reply
Model-Based Simulation for Optimising Smart Reply
Benjamin Towle
Ke Zhou
69
1
0
26 May 2023
KNSE: A Knowledge-aware Natural Language Inference Framework for
  Dialogue Symptom Status Recognition
KNSE: A Knowledge-aware Natural Language Inference Framework for Dialogue Symptom Status Recognition
Wei Chen
Shiqi Wei
Zhongyu Wei
Xuanjing Huang
65
6
0
26 May 2023
On convex decision regions in deep network representations
On convex decision regions in deep network representations
Lenka Tvetková
Thea Brusch
Teresa Scheidt
Fabian Martin Mager
R. Aagaard
Jonathan Foldager
T. S. Alstrøm
Lars Kai Hansen
83
2
0
26 May 2023
Towards a Common Understanding of Contributing Factors for Cross-Lingual
  Transfer in Multilingual Language Models: A Review
Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review
Fred Philippy
Siwen Guo
Shohreh Haddadan
LRM
72
37
0
26 May 2023
Parameter-Efficient Fine-Tuning without Introducing New Latency
Parameter-Efficient Fine-Tuning without Introducing New Latency
Baohao Liao
Yan Meng
Christof Monz
59
56
0
26 May 2023
AlignScore: Evaluating Factual Consistency with a Unified Alignment
  Function
AlignScore: Evaluating Factual Consistency with a Unified Alignment Function
Yuheng Zha
Yichi Yang
Ruichen Li
Zhiting Hu
HILM
120
208
0
26 May 2023
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction
  Model
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model
I-Hung Hsu
Zhiyu Xie
Kuan-Hao Huang
Premkumar Natarajan
Nanyun Peng
64
43
0
26 May 2023
Automatic Emotion Experiencer Recognition
Automatic Emotion Experiencer Recognition
Maximilian Wegge
Roman Klinger
72
1
0
26 May 2023
RankCSE: Unsupervised Sentence Representations Learning via Learning to
  Rank
RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Wei Wu
Yunsen Xian
Dongyan Zhao
Kai Chen
Rui Yan
SSL
97
38
0
26 May 2023
Detect Any Shadow: Segment Anything for Video Shadow Detection
Detect Any Shadow: Segment Anything for Video Shadow Detection
Yonghui Wang
Wen-gang Zhou
Yunyao Mao
Houqiang Li
VLM
98
24
0
26 May 2023
TADA: Task-Agnostic Dialect Adapters for English
TADA: Task-Agnostic Dialect Adapters for English
William B. Held
Caleb Ziems
Diyi Yang
70
13
0
26 May 2023
Adversarial Multi-task Learning for End-to-end Metaphor Detection
Adversarial Multi-task Learning for End-to-end Metaphor Detection
Shenglong Zhang
Yang Liu
23
11
0
26 May 2023
Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for
  Financial Tasks
Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks
Agam Shah
Sudheer Chava
82
15
0
26 May 2023
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate
  Model
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao
Hongcheng Gao
Hao Zhang
Zhijie Deng
DeLMO
78
20
0
26 May 2023
Discovering Novel Actions from Open World Egocentric Videos with
  Object-Grounded Visual Commonsense Reasoning
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
Sanjoy Kundu
Shubham Trehan
Sathyanarayanan N. Aakur
LRMLM&Ro
73
3
0
26 May 2023
Neural Architecture Search for Parameter-Efficient Fine-tuning of Large
  Pre-trained Language Models
Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models
Neal Lawton
Anoop Kumar
Govind Thattai
Aram Galstyan
Greg Ver Steeg
47
19
0
26 May 2023
An Investigation of Noise in Morphological Inflection
An Investigation of Noise in Morphological Inflection
Adam Wiemerslage
Changbing Yang
Garrett Nicolai
Miikka Silfverberg
Katharina Kann
69
4
0
26 May 2023
Nichelle and Nancy: The Influence of Demographic Attributes and
  Tokenization Length on First Name Biases
Nichelle and Nancy: The Influence of Demographic Attributes and Tokenization Length on First Name Biases
Haozhe An
Rachel Rudinger
78
10
0
26 May 2023
Counterfactual reasoning: Testing language models' understanding of
  hypothetical scenarios
Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios
Jiaxuan Li
Lang-Chi Yu
Allyson Ettinger
LRMELM
65
27
0
26 May 2023
LANISTR: Multimodal Learning from Structured and Unstructured Data
LANISTR: Multimodal Learning from Structured and Unstructured Data
Sayna Ebrahimi
Sercan O. Arik
Yihe Dong
Tomas Pfister
57
4
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
93
29
0
26 May 2023
IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks
IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks
Xuanli He
Jun Wang
Benjamin I. P. Rubinstein
Trevor Cohn
SILM
73
14
0
25 May 2023
Prototype-Based Interpretability for Legal Citation Prediction
Prototype-Based Interpretability for Legal Citation Prediction
Chunyan Luo
R. Bhambhoria
Samuel Dahan
Xiao-Dan Zhu
ELMAILaw
104
7
0
25 May 2023
Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by
  Rewriting Text
Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by Rewriting Text
Ashim Gupta
Carter Blum
Temma Choji
Yingjie Fei
Shalin S Shah
Alakananda Vempala
Vivek Srikumar
AAML
62
9
0
25 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
113
37
0
25 May 2023
Previous
123...99100101...214215216
Next