ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,810 papers shown
Title
The Life Cycle of Knowledge in Big Language Models: A Survey
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
95
29
0
14 Mar 2023
Input-length-shortening and text generation via attention values
Input-length-shortening and text generation via attention values
Necset Ozkan Tan
A. Peng
Joshua Bensemann
Qiming Bao
Tim Hartill
M. Gahegan
Michael Witbrock
84
1
0
14 Mar 2023
Architext: Language-Driven Generative Architecture Design
Architext: Language-Driven Generative Architecture Design
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
VLMAI4CE
73
6
0
13 Mar 2023
AMOM: Adaptive Masking over Masking for Conditional Masked Language
  Model
AMOM: Adaptive Masking over Masking for Conditional Masked Language Model
Yisheng Xiao
Ruiyang Xu
Lijun Wu
Juntao Li
Tao Qin
Yan-Tie Liu
Hao Fei
44
9
0
13 Mar 2023
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Mrigank Raman
Pratyush Maini
J. Zico Kolter
Zachary Chase Lipton
Danish Pruthi
AAML
71
17
0
13 Mar 2023
Meet in the Middle: A New Pre-training Paradigm
Meet in the Middle: A New Pre-training Paradigm
A. Nguyen
Nikos Karampatziakis
Weizhu Chen
62
21
0
13 Mar 2023
Transformer-based approaches to Sentiment Detection
O. E. Ojo
Hoang Thang Ta
Alexander Gelbukh
Hiram Calvo
O. O. Adebanji
Grigori Sidorov
31
7
0
13 Mar 2023
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Bo He
Jun Wang
Jielin Qiu
Trung Bui
Abhinav Shrivastava
Zhaowen Wang
91
71
0
13 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
86
1
0
13 Mar 2023
Addressing Biases in the Texts using an End-to-End Pipeline Approach
Addressing Biases in the Texts using an End-to-End Pipeline Approach
Shaina Raza
Syed Raza Bashir
Sneha
Urooj Qamar
57
0
0
13 Mar 2023
A Human Subject Study of Named Entity Recognition (NER) in
  Conversational Music Recommendation Queries
A Human Subject Study of Named Entity Recognition (NER) in Conversational Music Recommendation Queries
Elena V. Epure
Romain Hennequin
48
5
0
13 Mar 2023
LUKE-Graph: A Transformer-based Approach with Gated Relational Graph
  Attention for Cloze-style Reading Comprehension
LUKE-Graph: A Transformer-based Approach with Gated Relational Graph Attention for Cloze-style Reading Comprehension
Shima Foolad
Kourosh Kiani
39
3
0
12 Mar 2023
Improve Retrieval-based Dialogue System via Syntax-Informed Attention
Improve Retrieval-based Dialogue System via Syntax-Informed Attention
Tengtao Song
Nuo Chen
Ji Jiang
Zhihong Zhu
Yuexian Zou
51
6
0
12 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A.B. Siddique
79
6
0
12 Mar 2023
Diffusion Models for Non-autoregressive Text Generation: A Survey
Diffusion Models for Non-autoregressive Text Generation: A Survey
Yifan Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
MedImDiffM
116
36
0
12 Mar 2023
Compressed Heterogeneous Graph for Abstractive Multi-Document
  Summarization
Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization
Miao Li
Jianzhong Qi
Jey Han Lau
70
11
0
12 Mar 2023
Multimodal Data Integration for Oncology in the Era of Deep Neural
  Networks: A Review
Multimodal Data Integration for Oncology in the Era of Deep Neural Networks: A Review
Asim Waqas
Aakash Tripathi
Ravichandran Ramachandran
Paul Stewart
Ghulam Rasool
AI4CE
121
37
0
11 Mar 2023
Reinforcement Learning-based Counter-Misinformation Response Generation:
  A Case Study of COVID-19 Vaccine Misinformation
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation
Bing He
M. Ahamad
Srijan Kumar
OffRL
60
46
0
11 Mar 2023
Consistency Analysis of ChatGPT
Consistency Analysis of ChatGPT
Myeongjun Jang
Thomas Lukasiewicz
95
56
0
11 Mar 2023
Do large language models resemble humans in language use?
Do large language models resemble humans in language use?
Zhenguang G. Cai
Xufeng Duan
David A. Haslett
Shuqi Wang
M. Pickering
ALM
127
41
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
95
47
0
10 Mar 2023
Detection of Abuse in Financial Transaction Descriptions Using Machine
  Learning
Detection of Abuse in Financial Transaction Descriptions Using Machine Learning
A. Leontjeva
Genevieve Richards
Kaavya Sriskandaraja
Jessica Perchman
L. Pizzato
13
0
0
10 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence
  Reasoning
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
59
7
0
10 Mar 2023
Weakly-Supervised HOI Detection from Interaction Labels Only and
  Language/Vision-Language Priors
Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors
Mesut Erhan Unal
Adriana Kovashka
VLM
75
5
0
09 Mar 2023
Dynamic Stashing Quantization for Efficient Transformer Training
Dynamic Stashing Quantization for Efficient Transformer Training
Guofu Yang
Daniel Lo
Robert D. Mullins
Yiren Zhao
MQ
88
8
0
09 Mar 2023
Detecting Images Generated by Diffusers
Detecting Images Generated by Diffusers
D. Coccomini
Andrea Esuli
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
DiffM
88
15
0
09 Mar 2023
Can a Frozen Pretrained Language Model be used for Zero-shot Neural
  Retrieval on Entity-centric Questions?
Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions?
Yasuto Hoshi
Daisuke Miyashita
Yasuhiro Morioka
Youyang Ng
Osamu Torii
J. Deguchi
58
0
0
09 Mar 2023
Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent
  Induction
Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction
Caiyuan Chu
Ya Li
Yifan Liu
Jia-Chen Gu
Quan Liu
Yongxin Ge
Guoping Hu
97
0
0
09 Mar 2023
Lexical Complexity Prediction: An Overview
Lexical Complexity Prediction: An Overview
Kai North
Marcos Zampieri
Matthew Shardlow
63
26
0
08 Mar 2023
RAF: Holistic Compilation for Deep Learning Model Training
RAF: Holistic Compilation for Deep Learning Model Training
Cody Hao Yu
Haozheng Fan
Guangtai Huang
Zhen Jia
Yizhi Liu
...
Yuan Zhou
Haichen Shen
Junru Shao
Mu Li
Yida Wang
72
3
0
08 Mar 2023
Extrapolative Controlled Sequence Generation via Iterative Refinement
Extrapolative Controlled Sequence Generation via Iterative Refinement
Vishakh Padmakumar
Richard Yuanzhe Pang
He He
Ankur P. Parikh
82
10
0
08 Mar 2023
Exploiting the Textual Potential from Vision-Language Pre-training for
  Text-based Person Search
Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search
Guanshuo Wang
Fufu Yu
Jianing Li
Qiong Jia
Shouhong Ding
66
18
0
08 Mar 2023
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Ruixiang Tang
Xiaotian Han
Xiaoqian Jiang
Helen Zhou
LM&MAAI4MHSyDa
104
186
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
120
554
0
07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
SemEval-2023 Task 10: Explainable Detection of Online Sexism
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
81
122
0
07 Mar 2023
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use
  Case of Automatic Genre Identification
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification
Taja Kuzman
I. Mozetič
Nikola Ljubesic
114
94
0
07 Mar 2023
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course
  Summarization
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization
Griffin Adams
Jason Zucker
Noémie Elhadad
93
23
0
07 Mar 2023
A Challenging Benchmark for Low-Resource Learning
A Challenging Benchmark for Low-Resource Learning
Yudong Wang
Chang Ma
Qingxiu Dong
Lingpeng Kong
Jingjing Xu
88
4
0
07 Mar 2023
German BERT Model for Legal Named Entity Recognition
German BERT Model for Legal Named Entity Recognition
Harsh Darji
Jelena Mitrović
Michael Granitzer
AILaw
31
14
0
07 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based
  Summarization Evaluation
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
53
28
0
07 Mar 2023
Adaptive Knowledge Distillation between Text and Speech Pre-trained
  Models
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models
Jinjie Ni
Yukun Ma
Wen Wang
Qian Chen
Dianwen Ng
Han Lei
Trung Hieu Nguyen
Chong Zhang
B. Ma
Min Zhang
43
2
0
07 Mar 2023
ADELT: Transpilation Between Deep Learning Frameworks
ADELT: Transpilation Between Deep Learning Frameworks
Linyuan Gong
Jiayi Wang
Alvin Cheung
59
3
0
07 Mar 2023
Two-stage Pipeline for Multilingual Dialect Detection
Two-stage Pipeline for Multilingual Dialect Detection
Ankit Vaidya
Aditya Kane
80
5
0
06 Mar 2023
Depression Detection Using Digital Traces on Social Media: A
  Knowledge-aware Deep Learning Approach
Depression Detection Using Digital Traces on Social Media: A Knowledge-aware Deep Learning Approach
Wenli Zhang
Jiaheng Xie
Zhuocheng Zhang
Xiang Liu
76
10
0
06 Mar 2023
Referring Multi-Object Tracking
Referring Multi-Object Tracking
Dongming Wu
Wencheng Han
Tiancai Wang
Xingping Dong
Xiangyu Zhang
Jianbing Shen
114
80
0
06 Mar 2023
AmQA: Amharic Question Answering Dataset
AmQA: Amharic Question Answering Dataset
Tilahun Abedissa
Ricardo Usbeck
Yaregal Assabie
71
1
0
06 Mar 2023
SC-Block: Supervised Contrastive Blocking within Entity Resolution
  Pipelines
SC-Block: Supervised Contrastive Blocking within Entity Resolution Pipelines
Alexander Brinkmann
Roee Shraga
Christian Bizer
91
10
0
06 Mar 2023
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP
  Models
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Edoardo Mosca
Daryna Dementieva
Tohid Ebrahim Ajdari
Maximilian Kummeth
Kirill Gringauz
Yutong Zhou
Georg Groh
100
8
0
06 Mar 2023
Dynamic Prompting: A Unified Framework for Prompt Tuning
Dynamic Prompting: A Unified Framework for Prompt Tuning
Xianjun Yang
Wei Cheng
Xujiang Zhao
Wenchao Yu
Linda R. Petzold
Haifeng Chen
VLM
119
16
0
06 Mar 2023
A Multi-Grained Self-Interpretable Symbolic-Neural Model For
  Single/Multi-Labeled Text Classification
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification
Xiang Hu
Xinyu Kong
Kewei Tu
MILMBDL
67
5
0
06 Mar 2023
Previous
123...115116117...215216217
Next