ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,803 papers shown
Title
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks
  with Soft-Thresholding
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks with Soft-Thresholding
Chunyan Xiong
Meng Lu
Xiaotong Yu
JIAN-PENG Cao
Zhong Chen
D. Guo
X. Qu
MLT
124
0
0
14 Apr 2023
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion
  Recognition
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition
Soumya Dutta
Sriram Ganapathy
117
18
0
14 Apr 2023
Evaluation of Social Biases in Recent Large Pre-Trained Models
Evaluation of Social Biases in Recent Large Pre-Trained Models
Swapnil Sharma
Nikita Anand
V. KranthiKiranG.
Alind Jain
52
0
0
13 Apr 2023
Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter
Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter
M. Zarei
M. Christensen
S. Everts
Majid Komeili
39
1
0
13 Apr 2023
SemEval-2023 Task 12: Sentiment Analysis for African Languages
  (AfriSenti-SemEval)
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
Shamsuddeen Hassan Muhammad
Idris Abdulmumin
Seid Muhie Yimam
David Ifeoluwa Adelani
Ibrahim Said Ahmad
N. Ousidhoum
Abinew Ali Ayele
Saif M. Mohammad
Meriem Beloucif
Sebastian Ruder
80
70
0
13 Apr 2023
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Ashraf Haddad
N. Aaraj
Preslav Nakov
Septimiu Fabian Mare
48
6
0
13 Apr 2023
Graph2topic: an opensource topic modeling framework based on sentence
  embedding and community detection
Graph2topic: an opensource topic modeling framework based on sentence embedding and community detection
Leihan Zhang
Jiapeng Liu
Qiang Yan
80
1
0
13 Apr 2023
PGTask: Introducing the Task of Profile Generation from Dialogues
PGTask: Introducing the Task of Profile Generation from Dialogues
Rui Ribeiro
Joao Paulo Carvalho
Luísa Coheur
37
1
0
13 Apr 2023
A-CAP: Anticipation Captioning with Commonsense Knowledge
A-CAP: Anticipation Captioning with Commonsense Knowledge
D. Vo
Quoc-An Luong
Akihiro Sugimoto
Hideki Nakayama
70
1
0
13 Apr 2023
Computational modeling of semantic change
Computational modeling of semantic change
Nina Tahmasebi
Haim Dubossarsky
108
6
0
13 Apr 2023
Can Large Language Models Transform Computational Social Science?
Can Large Language Models Transform Computational Social Science?
Caleb Ziems
William B. Held
Omar Shaikh
Jiaao Chen
Zhehao Zhang
Diyi Yang
LLMAG
133
322
0
12 Apr 2023
Learning Homographic Disambiguation Representation for Neural Machine
  Translation
Learning Homographic Disambiguation Representation for Neural Machine Translation
Weixuan Wang
Wei Peng
Qun Liu
52
0
0
12 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using
  Census Data
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
88
4
0
12 Apr 2023
Global Prompt Cell: A Portable Control Module for Effective Prompt
  Tuning
Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning
Chi-Liang Liu
Hao Wang
Nuwa Xi
Sendong Zhao
Bing Qin
VLM
79
1
0
12 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
  Language Models in Multilingual Learning
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELMLM&MA
69
291
0
12 Apr 2023
MoMo: A shared encoder Model for text, image and multi-Modal
  representations
MoMo: A shared encoder Model for text, image and multi-Modal representations
Rakesh Chada
Zhao-Heng Zheng
P. Natarajan
ViT
64
4
0
11 Apr 2023
L3MVN: Leveraging Large Language Models for Visual Target Navigation
L3MVN: Leveraging Large Language Models for Visual Target Navigation
Bangguo Yu
Hamidreza Kasaei
M. Cao
LM&Ro
118
101
0
11 Apr 2023
Zero-shot Temporal Relation Extraction with ChatGPT
Zero-shot Temporal Relation Extraction with ChatGPT
Chenhan Yuan
Qianqian Xie
Sophia Ananiadou
101
85
0
11 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image
  Models
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
84
79
0
11 Apr 2023
Prompt Learning for News Recommendation
Prompt Learning for News Recommendation
Zizhuo Zhang
Bang-wei Wang
AI4TS
73
66
0
11 Apr 2023
Towards Efficient Fine-tuning of Pre-trained Code Models: An
  Experimental Study and Beyond
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
Ensheng Shi
Yanlin Wang
Hongyu Zhang
Lun Du
Shi Han
Dongmei Zhang
Hongbin Sun
89
45
0
11 Apr 2023
Improving Vision-and-Language Navigation by Generating Future-View Image
  Semantics
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics
Jialu Li
Joey Tianyi Zhou
97
37
0
11 Apr 2023
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using
  Negative Sampling based Approach
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach
Bilal Ghanem
Alona Fyshe
61
4
0
10 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLMLRMNAI
100
34
0
10 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRMVLM
106
26
0
10 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
88
21
0
10 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight
  Averaging
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging
Aarne Talman
H. Çelikkanat
Sami Virpioja
Markus Heinonen
Jörg Tiedemann
BDLUQCV
84
8
0
10 Apr 2023
SELFormer: Molecular Representation Learning via SELFIES Language Models
SELFormer: Molecular Representation Learning via SELFIES Language Models
Atakan Yüksel
Erva Ulusoy
Atabey Ünlü
Tunca Dogan
101
61
0
10 Apr 2023
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for
  Classifying Common Mental Illnesses on Social Media Posts
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts
Pratinav Seth
Mihir Agarwal
AI4MH
64
1
0
10 Apr 2023
OpenAGI: When LLM Meets Domain Experts
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLMLRM
122
232
0
10 Apr 2023
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for
  Medical domain
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
50
31
0
09 Apr 2023
Are Large Language Models Ready for Healthcare? A Comparative Study on
  Clinical Language Understanding
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language Understanding
Yuqing Wang
Yun Zhao
Linda R. Petzold
AI4MHLM&MAELM
95
53
0
09 Apr 2023
Team QUST at SemEval-2023 Task 3: A Comprehensive Study of Monolingual
  and Multilingual Approaches for Detecting Online News Genre, Framing and
  Persuasion Techniques
Team QUST at SemEval-2023 Task 3: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting Online News Genre, Framing and Persuasion Techniques
Ye Jiang
71
10
0
09 Apr 2023
Continual Graph Convolutional Network for Text Classification
Continual Graph Convolutional Network for Text Classification
Tiandeng Wu
Qijiong Liu
Yinhao Cao
yao. huang
Xiao-Ming Wu
Jiandong Ding
GNN
74
10
0
09 Apr 2023
Multi-class Categorization of Reasons behind Mental Disturbance in Long
  Texts
Multi-class Categorization of Reasons behind Mental Disturbance in Long Texts
Muskan Garg
AI4MH
47
2
0
08 Apr 2023
Unsupervised Story Discovery from Continuous News Streams via Scalable
  Thematic Embedding
Unsupervised Story Discovery from Continuous News Streams via Scalable Thematic Embedding
Susik Yoon
Dongha Lee
Yunyi Zhang
Jiawei Han
AI4TSAIFin
147
8
0
08 Apr 2023
tmn at SemEval-2023 Task 9: Multilingual Tweet Intimacy Detection using
  XLM-T, Google Translate, and Ensemble Learning
tmn at SemEval-2023 Task 9: Multilingual Tweet Intimacy Detection using XLM-T, Google Translate, and Ensemble Learning
Anna Glazkova
49
1
0
08 Apr 2023
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for
  NLP
Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP
Lama Alkhaled
Tosin Adewumi
Sana Sabah Sabry
86
8
0
08 Apr 2023
SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers
SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers
Alberto Marchisio
David Durà
Maurizio Capra
Maurizio Martina
Guido Masera
Mohamed Bennai
94
22
0
08 Apr 2023
ASPEST: Bridging the Gap Between Active Learning and Selective
  Prediction
ASPEST: Bridging the Gap Between Active Learning and Selective Prediction
Jiefeng Chen
Jinsung Yoon
Sayna Ebrahimi
Sercan O. Arik
S. Jha
Tomas Pfister
115
1
0
07 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for
  High-Fidelity Text-to-Image Synthesis
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
97
46
0
07 Apr 2023
Language Models are Causal Knowledge Extractors for Zero-shot Video
  Question Answering
Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Hung-Ting Su
Yulei Niu
Xudong Lin
Winston H. Hsu
Shih-Fu Chang
VGenELM
112
6
0
07 Apr 2023
Probing Conceptual Understanding of Large Visual-Language Models
Probing Conceptual Understanding of Large Visual-Language Models
Madeline Chantry Schiappa
Raiyaan Abdullah
Shehreen Azad
Jared Claypoole
Michael Cogswell
Ajay Divakaran
Yogesh S Rawat
81
16
0
07 Apr 2023
Revisiting Automated Prompting: Are We Actually Doing Better?
Revisiting Automated Prompting: Are We Actually Doing Better?
Yulin Zhou
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Y. Gal
117
8
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism
  using Majority Voted Fine-Tuned Transformers
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers
Sriya Rallabandi
Sanchit Singhal
Pratinav Seth
21
3
0
07 Apr 2023
Architecture-Preserving Provable Repair of Deep Neural Networks
Architecture-Preserving Provable Repair of Deep Neural Networks
Zhe Tao
Stephanie Nawas
Jacqueline Mitchell
Aditya V. Thakur
AAML
64
11
0
07 Apr 2023
Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Hanmeng Liu
Ruoxi Ning
Zhiyang Teng
Jian Liu
Qiji Zhou
Yuexin Zhang
ELMReLMLRM
125
258
0
07 Apr 2023
Towards Corpus-Scale Discovery of Selection Biases in News Coverage:
  Comparing What Sources Say About Entities as a Start
Towards Corpus-Scale Discovery of Selection Biases in News Coverage: Comparing What Sources Say About Entities as a Start
Sihao Chen
William F. Bruno
Dan Roth
68
1
0
06 Apr 2023
Deep Learning for Opinion Mining and Topic Classification of Course
  Reviews
Deep Learning for Opinion Mining and Topic Classification of Course Reviews
Anna Koufakou
69
20
0
06 Apr 2023
Previous
123...111112113...215216217
Next