ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,811 papers shown
Title
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size
  of Public Graph Datasets for Deep Learning Research
IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research
Arpandeep Khatua
Vikram Sharma Mailthody
Bhagyashree Taleka
Tengfei Ma
Xiang Song
Wen-mei W. Hwu
AI4CE
111
39
0
27 Feb 2023
Prompt-based Learning for Text Readability Assessment
Prompt-based Learning for Text Readability Assessment
Bruce W. Lee
J. Lee
VLM
67
13
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
139
161
0
25 Feb 2023
Pre-Finetuning for Few-Shot Emotional Speech Recognition
Pre-Finetuning for Few-Shot Emotional Speech Recognition
Maximillian Chen
Zhou Yu
75
4
0
24 Feb 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from
  Labeled Data
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Kashun Shum
Shizhe Diao
Tong Zhang
ReLMLRM
119
138
0
24 Feb 2023
Modelling Temporal Document Sequences for Clinical ICD Coding
Modelling Temporal Document Sequences for Clinical ICD Coding
Clarence Boon Liang Ng
Diogo Santos
Marek Rei
64
8
0
24 Feb 2023
In-Depth Look at Word Filling Societal Bias Measures
In-Depth Look at Word Filling Societal Bias Measures
Matúš Pikuliak
Ivana Benová
Viktor Bachratý
99
9
0
24 Feb 2023
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio
  Features for Argument Mining
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining
Ramon Ruiz-Dolz
Javier Iranzo-Sánchez
36
3
0
24 Feb 2023
Deep Learning for Video-Text Retrieval: a Review
Deep Learning for Video-Text Retrieval: a Review
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
77
18
0
24 Feb 2023
Dual Path Modeling for Semantic Matching by Perceiving Subtle Conflicts
Dual Path Modeling for Semantic Matching by Perceiving Subtle Conflicts
Chao Xue
Di Liang
Sirui Wang
Wei Wu
Jing Zhang
70
9
0
24 Feb 2023
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift
  with Multiple Views
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views
Katerina Margatina
Shuai Wang
Yogarshi Vyas
Neha Ann John
Yassine Benajiba
Miguel Ballesteros
68
17
0
23 Feb 2023
Quantifying & Modeling Multimodal Interactions: An Information
  Decomposition Framework
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
Paul Pu Liang
Yun Cheng
Xiang Fan
Chun Kai Ling
Suzanne Nie
...
Nicholas B. Allen
Randy P. Auerbach
Faisal Mahmood
Ruslan Salakhutdinov
Louis-Philippe Morency
116
37
0
23 Feb 2023
MCWDST: a Minimum-Cost Weighted Directed Spanning Tree Algorithm for
  Real-Time Fake News Mitigation in Social Media
MCWDST: a Minimum-Cost Weighted Directed Spanning Tree Algorithm for Real-Time Fake News Mitigation in Social Media
Ciprian-Octavian Truicua
Elena Simona Apostol
Radu-Cuatualin Nicolescu
Panagiotis Karras
GNN
59
27
0
23 Feb 2023
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate
  Political Stance Prediction
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate Political Stance Prediction
Yunyong Ko
Seongeun Ryu
Soeun Han
Youngseung Jeon
Jaehoon Kim
Sohyun Park
Kyungsik Han
Hanghang Tong
Sang-Wook Kim
117
15
0
23 Feb 2023
Data leakage in cross-modal retrieval training: A case study
Data leakage in cross-modal retrieval training: A case study
Benno Weck
Xavier Serra
61
7
0
23 Feb 2023
Coarse-to-Fine Knowledge Selection for Document Grounded Dialogs
Coarse-to-Fine Knowledge Selection for Document Grounded Dialogs
Yeqin Zhang
Haomin Fu
Cheng Fu
Haiyang Yu
Yongbin Li
Cam-Tu Nguyen
116
9
0
23 Feb 2023
FiTs: Fine-grained Two-stage Training for Knowledge-aware Question
  Answering
FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Qichen Ye
Bowen Cao
Nuo Chen
Weiyuan Xu
Yuexian Zou
73
18
0
23 Feb 2023
Guiding Large Language Models via Directional Stimulus Prompting
Guiding Large Language Models via Directional Stimulus Prompting
Zekun Li
Baolin Peng
Pengcheng He
Michel Galley
Jianfeng Gao
Xi Yan
LLMAGLRMLM&Ro
136
101
0
22 Feb 2023
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Donghuo Zeng
Jianming Wu
Yanan Wang
Kazunori Matsumoto
Gen Hattori
K. Ikeda
81
0
0
22 Feb 2023
Connecting Vision and Language with Video Localized Narratives
Connecting Vision and Language with Video Localized Narratives
P. Voigtlaender
Soravit Changpinyo
Jordi Pont-Tuset
Radu Soricut
V. Ferrari
VGen
143
23
0
22 Feb 2023
DNG: Taxonomy Expansion by Exploring the Intrinsic Directed Structure on
  Non-gaussian Space
DNG: Taxonomy Expansion by Exploring the Intrinsic Directed Structure on Non-gaussian Space
Songlin Zhai
Weiqing Wang
Yuanfa Li
Yuan Meng
96
6
0
22 Feb 2023
Edgeformers: Graph-Empowered Transformers for Representation Learning on
  Textual-Edge Networks
Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks
Bowen Jin
Yu Zhang
Yu Meng
Jiawei Han
97
31
0
21 Feb 2023
In-context Example Selection with Influences
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
94
54
0
21 Feb 2023
Learning to Retrieve Engaging Follow-Up Queries
Learning to Retrieve Engaging Follow-Up Queries
Christopher Richardson
Sudipta Kar
Anjishnu Kumar
Anand Ramachandran
O. Khan
Zeynab Raeesy
A. Sethy
46
2
0
21 Feb 2023
Generic Dependency Modeling for Multi-Party Conversation
Generic Dependency Modeling for Multi-Party Conversation
Weizhou Shen
Xiaojun Quan
Ke Yang
71
4
0
21 Feb 2023
Playing the Werewolf game with artificial intelligence for language
  understanding
Playing the Werewolf game with artificial intelligence for language understanding
Hisaichi Shibata
S. Miki
Yuta Nakamura
LLMAG
60
12
0
21 Feb 2023
Exploring the Limits of Transfer Learning with Unified Model in the
  Cybersecurity Domain
Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain
Kuntal Kumar Pal
Kazuaki Kashihara
Ujjwala Anantheswaran
Kirby Kuznia
S. Jagtap
Chitta Baral
AAML
34
3
0
20 Feb 2023
Hashtag-Guided Low-Resource Tweet Classification
Hashtag-Guided Low-Resource Tweet Classification
Shizhe Diao
Sedrick Scott Keh
Liangming Pan
Zhiliang Tian
Yan Song
Tong Zhang
VLMAI4TS
65
7
0
20 Feb 2023
Can discrete information extraction prompts generalize across language
  models?
Can discrete information extraction prompts generalize across language models?
Nathanaël Carraz Rakotonirina
Roberto Dessì
Fabio Petroni
Sebastian Riedel
Marco Baroni
72
8
0
20 Feb 2023
Knowledge-aware Bayesian Co-attention for Multimodal Emotion Recognition
Knowledge-aware Bayesian Co-attention for Multimodal Emotion Recognition
Zihan Zhao
Yu Wang
Yanfeng Wang
63
18
0
20 Feb 2023
Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Maxime Darrin
Guillaume Staerman
Eduardo Dadalto Camara Gomes
Jackie CK Cheung
Pablo Piantanida
Pierre Colombo
OODD
451
12
0
20 Feb 2023
What happens before and after: Multi-Event Commonsense in Event
  Coreference Resolution
What happens before and after: Multi-Event Commonsense in Event Coreference Resolution
Sahithya Ravi
Christy Tanner
R. Ng
Vered Shwarz
76
19
0
20 Feb 2023
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
S. Keerthi
Ayan Acharya
Borja Ocejo
Gregory Dexter
Rajiv Khanna
D. Durfee
Rahul Mazumder
AAML
71
7
0
19 Feb 2023
Intent Identification and Entity Extraction for Healthcare Queries in
  Indic Languages
Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages
Ankan Mullick
Ishani Mondal
Sourjyadip Ray
R. Raghav
G. Chaitanya
Pawan Goyal
63
13
0
19 Feb 2023
Evaluating the Effectiveness of Pre-trained Language Models in
  Predicting the Helpfulness of Online Product Reviews
Evaluating the Effectiveness of Pre-trained Language Models in Predicting the Helpfulness of Online Product Reviews
Ali Boluki
Javad Pourmostafa Roshan Sharami
D. Shterionov
54
1
0
19 Feb 2023
Multilingual Content Moderation: A Case Study on Reddit
Multilingual Content Moderation: A Case Study on Reddit
Meng Ye
Karan Sikka
Katherine Atwell
Sabit Hassan
Ajay Divakaran
Malihe Alikhani
AI4MH
42
7
0
19 Feb 2023
Language-Specific Representation of Emotion-Concept Knowledge Causally
  Supports Emotion Inference
Language-Specific Representation of Emotion-Concept Knowledge Causally Supports Emotion Inference
Ming Li
Yusheng Su
Hsiu-Yuan Huang
Jiali Cheng
Xin Hu
...
Yujia Qin
Xiaozhi Wang
Kristen A. Lindquist
Zhi-Yun Liu
Dan Zhang
74
6
0
19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and
  Fine-tuned BERT
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
136
245
0
19 Feb 2023
Few-shot Multimodal Multitask Multilingual Learning
Few-shot Multimodal Multitask Multilingual Learning
Aman Chadha
Vinija Jain
125
0
0
19 Feb 2023
Learning Language Representations with Logical Inductive Bias
Learning Language Representations with Logical Inductive Bias
Jianshu Chen
NAIAI4CELRM
53
3
0
19 Feb 2023
BBT-Fin: Comprehensive Construction of Chinese Financial Domain
  Pre-trained Language Model, Corpus and Benchmark
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark
Dakuan Lu
Hengkui Wu
Jiaqing Liang
Yipei Xu
Qi He
Yipeng Geng
Mengkun Han
Ying Xin
Yanghua Xiao
94
62
0
18 Feb 2023
Improving the Out-Of-Distribution Generalization Capability of Language
  Models: Counterfactually-Augmented Data is not Enough
Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
CMLOODD
54
3
0
18 Feb 2023
Optimising Human-Machine Collaboration for Efficient High-Precision
  Information Extraction from Text Documents
Optimising Human-Machine Collaboration for Efficient High-Precision Information Extraction from Text Documents
Bradley Butcher
Miri Zilka
Darren Cook
Jiri Hron
Adrian Weller
73
4
0
18 Feb 2023
Towards Safer Generative Language Models: A Survey on Safety Risks,
  Evaluations, and Improvements
Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements
Jiawen Deng
Jiale Cheng
Hao Sun
Zhexin Zhang
Minlie Huang
LM&MAELM
95
17
0
18 Feb 2023
Bag of Tricks for Effective Language Model Pretraining and Downstream
  Adaptation: A Case Study on GLUE
Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE
Qihuang Zhong
Liang Ding
Keqin Peng
Juhua Liu
Bo Du
Li Shen
Yibing Zhan
Dacheng Tao
VLM
81
13
0
18 Feb 2023
A Federated Approach for Hate Speech Detection
A Federated Approach for Hate Speech Detection
Jay Gala
Deep Gandhi
Jash Mehta
Zeerak Talat
58
4
0
18 Feb 2023
Scalable Prompt Generation for Semi-supervised Learning with Language
  Models
Scalable Prompt Generation for Semi-supervised Learning with Language Models
Yuhang Zhou
Suraj Maharjan
Bei Liu
VLM
94
14
0
18 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
173
0
0
18 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
214
16
0
17 Feb 2023
Designing and Evaluating Interfaces that Highlight News Coverage
  Diversity Using Discord Questions
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
45
8
0
17 Feb 2023
Previous
123...117118119...215216217
Next