ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,814 papers shown
Title
Show me your NFT and I tell you how it will perform: Multimodal
  representation learning for NFT selling price prediction
Show me your NFT and I tell you how it will perform: Multimodal representation learning for NFT selling price prediction
Davide Costa
Lucio La Cava
Andrea Tagarelli
64
23
0
03 Feb 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense
  Retrieval
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
83
4
0
03 Feb 2023
Bioformer: an efficient transformer language model for biomedical text
  mining
Bioformer: an efficient transformer language model for biomedical text mining
Li Fang
Qingyu Chen
Chih-Hsuan Wei
Zhiyong Lu
Kai Wang
MedImAI4CE
65
22
0
03 Feb 2023
Detecting Reddit Users with Depression Using a Hybrid Neural Network
  SBERT-CNN
Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN
Ziyi Chen
Ren Yang
S. Fu
Nansu Zong
Hongfang Liu
Ming Huang
AI4MH
46
14
0
03 Feb 2023
Revisiting Intermediate Layer Distillation for Compressing Language
  Models: An Overfitting Perspective
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective
Jongwoo Ko
Seungjoon Park
Minchan Jeong
S. Hong
Euijai Ahn
Duhyeuk Chang
Se-Young Yun
67
6
0
03 Feb 2023
Self-Supervised Relation Alignment for Scene Graph Generation
Self-Supervised Relation Alignment for Scene Graph Generation
Bicheng Xu
Renjie Liao
Leonid Sigal
75
0
0
02 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial
  Defense
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAMLVLM
119
5
0
02 Feb 2023
Curriculum-Guided Abstractive Summarization
Curriculum-Guided Abstractive Summarization
Sajad Sotudeh
Hanieh Deilamsalehy
Franck Dernoncourt
Nazli Goharian
88
2
0
02 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image
  Alignment
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu
Wilson Yan
Pieter Abbeel
99
25
0
02 Feb 2023
How to choose "Good" Samples for Text Data Augmentation
How to choose "Good" Samples for Text Data Augmentation
Xiaotian Lin
Nankai Lin
Yingwen Fu
Ziyu Yang
Shengyi Jiang
86
2
0
02 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViTMedImAI4TSAI4CE
112
10
0
01 Feb 2023
Analyzing Leakage of Personally Identifiable Information in Language
  Models
Analyzing Leakage of Personally Identifiable Information in Language Models
Nils Lukas
A. Salem
Robert Sim
Shruti Tople
Lukas Wutschitz
Santiago Zanella Béguelin
PILM
201
235
0
01 Feb 2023
Analyzing Feed-Forward Blocks in Transformers through the Lens of
  Attention Maps
Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps
Goro Kobayashi
Tatsuki Kuribayashi
Sho Yokoi
Kentaro Inui
114
16
0
01 Feb 2023
Improved Knowledge Distillation for Pre-trained Language Models via
  Knowledge Selection
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection
Chenglong Wang
Yi Lu
Yongyu Mu
Yimin Hu
Tong Xiao
Jingbo Zhu
97
9
0
01 Feb 2023
On the Role of Morphological Information for Contextual Lemmatization
On the Role of Morphological Information for Contextual Lemmatization
Olia Toporkov
Rodrigo Agerri
61
9
0
01 Feb 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image
  and Video
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Haiyang Xu
Qinghao Ye
Mingshi Yan
Yaya Shi
Jiabo Ye
...
Guohai Xu
Ji Zhang
Songfang Huang
Feiran Huang
Jingren Zhou
MLLMVLMMoE
123
171
0
01 Feb 2023
Multimodality Representation Learning: A Survey on Evolution,
  Pretraining and Its Applications
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
104
32
0
01 Feb 2023
An Evaluation of Persian-English Machine Translation Datasets with
  Transformers
An Evaluation of Persian-English Machine Translation Datasets with Transformers
A. Sartipi
Meghdad Dehghan
A. Fatemi
69
3
0
01 Feb 2023
Filtering Context Mitigates Scarcity and Selection Bias in Political
  Ideology Prediction
Filtering Context Mitigates Scarcity and Selection Bias in Political Ideology Prediction
Chen Chen
D. Walker
Venkatesh Saligrama
32
0
0
01 Feb 2023
The Impacts of Unanswerable Questions on the Robustness of Machine
  Reading Comprehension Models
The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models
Son Quoc Tran
Phong Nguyen-Thuan Do
Uyen Le
Matt Kretchmar
ELMAAML
86
8
0
31 Jan 2023
In-Context Retrieval-Augmented Language Models
In-Context Retrieval-Augmented Language Models
Ori Ram
Yoav Levine
Itay Dalmedigos
Dor Muhlgay
Amnon Shashua
Kevin Leyton-Brown
Y. Shoham
KELMRALMLRM
119
616
0
31 Jan 2023
PADL: Language-Directed Physics-Based Character Control
PADL: Language-Directed Physics-Based Character Control
Jordan Juravsky
Yunrong Guo
Sanja Fidler
Xue Bin Peng
87
45
0
31 Jan 2023
Dynamic Scheduled Sampling with Imitation Loss for Neural Text
  Generation
Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation
Xiang Lin
Prathyusha Jwalapuram
Shafiq Joty
DiffM
63
0
0
31 Jan 2023
Zero-shot cross-lingual transfer language selection using linguistic
  similarity
Zero-shot cross-lingual transfer language selection using linguistic similarity
J. Eronen
M. Ptaszynski
Fumito Masui
100
38
0
31 Jan 2023
Recursive Neural Networks with Bottlenecks Diagnose
  (Non-)Compositionality
Recursive Neural Networks with Bottlenecks Diagnose (Non-)Compositionality
Verna Dankers
Ivan Titov
88
2
0
31 Jan 2023
What Makes Good Examples for Visual In-Context Learning?
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLMVPVLMVLMLRM
106
117
0
31 Jan 2023
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with
  Natural Language Style Prompt
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Dongchao Yang
Songxiang Liu
Rongjie Huang
Chao Weng
Helen Meng
DiffMVLM
89
102
0
31 Jan 2023
Sentence Identification with BOS and EOS Label Combinations
Sentence Identification with BOS and EOS Label Combinations
Takuma Udagawa
H. Kanayama
Issei Yoshida
52
2
0
31 Jan 2023
Differentiable Entailment for Parameter Efficient Few Shot Learning
Differentiable Entailment for Parameter Efficient Few Shot Learning
Ethan Kim
Jerry Yang
55
0
0
31 Jan 2023
MILO: Model-Agnostic Subset Selection Framework for Efficient Model
  Training and Tuning
MILO: Model-Agnostic Subset Selection Framework for Efficient Model Training and Tuning
Krishnateja Killamsetty
A. Evfimievski
Tejaswini Pedapati
K. Kate
Lucian Popa
Rishabh K. Iyer
67
8
0
30 Jan 2023
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus
Veton Matoshi
Pooja Rani
Andrea Galassi
Matthias Sturmer
Ilias Chalkidis
ELMAILaw
100
60
0
30 Jan 2023
Representation biases in sentence transformers
Representation biases in sentence transformers
Dmitry Nikolaev
Sebastian Padó
65
8
0
30 Jan 2023
ContCommRTD: A Distributed Content-based Misinformation-aware Community
  Detection System for Real-Time Disaster Reporting
ContCommRTD: A Distributed Content-based Misinformation-aware Community Detection System for Real-Time Disaster Reporting
Elena Simona Apostol
Ciprian-Octavian Truică
Adrian Paschke
85
20
0
30 Jan 2023
Quantifying Context Mixing in Transformers
Quantifying Context Mixing in Transformers
Hosein Mohebbi
Willem H. Zuidema
Grzegorz Chrupała
Afra Alishahi
228
28
0
30 Jan 2023
On student-teacher deviations in distillation: does it pay to disobey?
On student-teacher deviations in distillation: does it pay to disobey?
Vaishnavh Nagarajan
A. Menon
Srinadh Bhojanapalli
H. Mobahi
Surinder Kumar
137
10
0
30 Jan 2023
Active Learning for Multilingual Semantic Parser
Active Learning for Multilingual Semantic Parser
Zhuang Li
Gholamreza Haffari
69
6
0
30 Jan 2023
On Robustness of Prompt-based Semantic Parsing with Large Pre-trained
  Language Model: An Empirical Study on Codex
On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex
Terry Yue Zhuo
Zhuang Li
Yujin Huang
Fatemeh Shiri
Weiqing Wang
Gholamreza Haffari
Yuan-Fang Li
AAML
109
57
0
30 Jan 2023
GE-Blender: Graph-Based Knowledge Enhancement for Blender
GE-Blender: Graph-Based Knowledge Enhancement for Blender
Xiaolei Lian
Xunzhu Tang
Yue Wang
76
2
0
30 Jan 2023
EDSA-Ensemble: an Event Detection Sentiment Analysis Ensemble
  Architecture
EDSA-Ensemble: an Event Detection Sentiment Analysis Ensemble Architecture
A. Petrescu
Ciprian-Octavian Truică
Elena Simona Apostol
Adrian Paschke
77
12
0
30 Jan 2023
Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection
  by Distorting Task-Agnostic Features
Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Sishuo Chen
Wenkai Yang
Xiaohan Bi
Xu Sun
OODD
66
15
0
30 Jan 2023
Evaluating Neuron Interpretation Methods of NLP Models
Evaluating Neuron Interpretation Methods of NLP Models
Yimin Fan
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
82
8
0
30 Jan 2023
Schema-Guided Semantic Accuracy: Faithfulness in Task-Oriented Dialogue
  Response Generation
Schema-Guided Semantic Accuracy: Faithfulness in Task-Oriented Dialogue Response Generation
Jinghong Chen
Weizhe Lin
Bill Byrne
46
1
0
29 Jan 2023
EMP-EVAL: A Framework for Measuring Empathy in Open Domain Dialogues
EMP-EVAL: A Framework for Measuring Empathy in Open Domain Dialogues
Bushra Amjad
M. Zeeshan
M. O. Beg
60
1
0
29 Jan 2023
Multi-video Moment Ranking with Multimodal Clue
Multi-video Moment Ranking with Multimodal Clue
Danyang Hou
Liang Pang
Yanyan Lan
Huawei Shen
Xueqi Cheng
55
1
0
29 Jan 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu
Zehua Chen
Yiitan Yuan
Xinhao Mei
Xubo Liu
Danilo Mandic
Wenwu Wang
Mark D. Plumbley
DiffM
187
510
0
29 Jan 2023
Large Language Models for Biomedical Knowledge Graph Construction:
  Information extraction from EMR notes
Large Language Models for Biomedical Knowledge Graph Construction: Information extraction from EMR notes
Vahan Arsenyan
Spartak Bughdaryan
Fadi Shaya
Kent Small
Davit Shahnazaryan
81
13
0
29 Jan 2023
Neural Relation Graph: A Unified Framework for Identifying Label Noise
  and Outlier Data
Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data
Jang-Hyun Kim
Sangdoo Yun
Hyun Oh Song
85
19
0
29 Jan 2023
MQAG: Multiple-choice Question Answering and Generation for Assessing
  Information Consistency in Summarization
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
87
36
0
28 Jan 2023
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark
  Datasets
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets
Tosin Adewumi
Isabella Sodergren
Lama Alkhaled
Sana Sabah Sabry
F. Liwicki
Marcus Liwicki
73
4
0
28 Jan 2023
AutoPEFT: Automatic Configuration Search for Parameter-Efficient
  Fine-Tuning
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
Han Zhou
Xingchen Wan
Ivan Vulić
Anna Korhonen
85
48
0
28 Jan 2023
Previous
123...120121122...215216217
Next