ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,802 papers shown
Title
UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language
  Models
UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models
Deming Ye
Yankai Lin
Zhengyan Zhang
Maosong Sun
KELM
58
0
0
02 May 2023
FreeLM: Fine-Tuning-Free Language Model
FreeLM: Fine-Tuning-Free Language Model
Xiang Li
Xin Jiang
Xuying Meng
Aixin Sun
Yequan Wang
84
3
0
02 May 2023
How to Unleash the Power of Large Language Models for Few-shot Relation
  Extraction?
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu
Yuqi Zhu
Xiaohan Wang
Ningyu Zhang
KELMLRM
131
55
0
02 May 2023
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
Jianquan Li
Xidong Wang
Xiangbo Wu
Zhiyi Zhang
Xiaolong Xu
Jie Fu
Prayag Tiwari
Xiang Wan
Benyou Wang
LM&MA
162
51
0
02 May 2023
BrainNPT: Pre-training of Transformer networks for brain network
  classification
BrainNPT: Pre-training of Transformer networks for brain network classification
Jinlong Hu
Ya-Lin Huang
Nan Wang
Shoubin Dong
ViTMedIm
107
8
0
02 May 2023
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in
  Language Models
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Shuai Zhao
Jinming Wen
Anh Tuan Luu
Jiaqi Zhao
Jie Fu
SILM
176
99
0
02 May 2023
Read it Twice: Towards Faithfully Interpretable Fact Verification by
  Revisiting Evidence
Read it Twice: Towards Faithfully Interpretable Fact Verification by Revisiting Evidence
Xuming Hu
Zhaochen Hong
Zhijiang Guo
Lijie Wen
Philip S. Yu
HILM
128
17
0
02 May 2023
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control
  Communications for Robust Automatic Speech Recognition and Understanding
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Juan Pablo Zuluaga
Iuliia Nigmatulina
Amrutha Prasad
P. Motlícek
Driss Khalil
...
Allan Tart
Igor Szöke
Vincent Lenders
M. Rigault
K. Choukri
61
2
0
02 May 2023
ADVISE: AI-accelerated Design of Evidence Synthesis for Global
  Development
ADVISE: AI-accelerated Design of Evidence Synthesis for Global Development
Kristen M. Edwards
Binyang Song
J. Porciello
M. Engelbert
Carolyn Huang
Faez Ahmed
17
2
0
02 May 2023
ArK: Augmented Reality with Knowledge Interactive Emergent Ability
ArK: Augmented Reality with Knowledge Interactive Emergent Ability
Qiuyuan Huang
Jinho Park
Abhinav Gupta
Paul N. Bennett
Ran Gong
...
Baolin Peng
O. Mohammed
C. Pal
Yejin Choi
Jianfeng Gao
122
6
0
01 May 2023
Redundancy and Concept Analysis for Code-trained Language Models
Redundancy and Concept Analysis for Code-trained Language Models
Arushi Sharma
Zefu Hu
Christopher Quinn
Ali Jannesari
149
2
0
01 May 2023
Multimodal Graph Transformer for Multimodal Question Answering
Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He
Xin Eric Wang
88
9
0
30 Apr 2023
NewsPanda: Media Monitoring for Timely Conservation Action
NewsPanda: Media Monitoring for Timely Conservation Action
Sedrick Scott Keh
Z. Shi
David J. Patterson
N. Bhagabati
Karun Dewan
...
Pablo R. Izquierdo
D. Mallick
Ambika Sharma
Pooja Shrestha
Fei Fang
86
6
0
30 Apr 2023
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained
  models
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models
Korawat Tanwisuth
Shujian Zhang
Huangjie Zheng
Pengcheng He
Mingyuan Zhou
VLMVPVLM
188
28
0
29 Apr 2023
A Review of ChatGPT Applications in Education, Marketing, Software
  Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
Mohammad Fraiwan
Natheer Khasawneh
106
48
0
29 Apr 2023
Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques
  for LLMs
Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs
George Pu
Anirudh Jain
Jihan Yin
Russell Kaplan
75
43
0
28 Apr 2023
CCpdf: Building a High Quality Corpus for Visually Rich Documents from
  Web Crawl Data
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data
M. Turski
Tomasz Stanislawek
Karol Kaczmarek
Pawel Dyda
Filip Graliñski
97
12
0
28 Apr 2023
Information Redundancy and Biases in Public Document Information
  Extraction Benchmarks
Information Redundancy and Biases in Public Document Information Extraction Benchmarks
S. Laatiri
Pirashanth Ratnamogan
Joel Tang
Laurent Lam
William Vanhuffel
Fabien Caspani
45
1
0
28 Apr 2023
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
Abdurahman Maarouf
Dominik Bär
Dominique Geissler
Stefan Feuerriegel
80
10
0
28 Apr 2023
ChatGPT Evaluation on Sentence Level Relations: A Focus on Temporal,
  Causal, and Discourse Relations
ChatGPT Evaluation on Sentence Level Relations: A Focus on Temporal, Causal, and Discourse Relations
Chunkit Chan
Cheng Jiayang
Weiqi Wang
Yuxin Jiang
Tianqing Fang
Xin Liu
Yangqiu Song
LRM
170
62
0
28 Apr 2023
ResiDual: Transformer with Dual Residual Connections
ResiDual: Transformer with Dual Residual Connections
Shufang Xie
Huishuai Zhang
Junliang Guo
Xu Tan
Jiang Bian
Hany Awadalla
Arul Menezes
Tao Qin
Rui Yan
103
20
0
28 Apr 2023
RexUIE: A Recursive Method with Explicit Schema Instructor for Universal
  Information Extraction
RexUIE: A Recursive Method with Explicit Schema Instructor for Universal Information Extraction
Chengyuan Liu
Fubang Zhao
Yangyang Kang
Jingyuan Zhang
Xiang Zhou
Changlong Sun
Kun Kuang
Leilei Gan
96
11
0
28 Apr 2023
Made of Steel? Learning Plausible Materials for Components in the
  Vehicle Repair Domain
Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain
Annerose Eichel
Helena Schlipf
Sabine Schulte im Walde
67
2
0
28 Apr 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image
  Segmentation
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation
Yousef Yeganeh
Azade Farshad
Peter Weinberger
Seyed-Ahmad Ahmadi
Ehsan Adeli
Nassir Navab
ViTMedIm
58
0
0
28 Apr 2023
Analyzing Vietnamese Legal Questions Using Deep Neural Networks with
  Biaffine Classifiers
Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers
Nguyen Anh Tu
Hoang Thi Thu Uyen
Tu Minh Phuong
Ngo Xuan Bach
AILaw
72
1
0
27 Apr 2023
Energy-based Models are Zero-Shot Planners for Compositional Scene
  Rearrangement
Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement
N. Gkanatsios
Ayush Jain
Zhou Xian
Yunchu Zhang
C. Atkeson
Katerina Fragkiadaki
LM&Ro
160
33
0
27 Apr 2023
Controlled Text Generation with Natural Language Instructions
Controlled Text Generation with Natural Language Instructions
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ethan Gotlieb Wilcox
Ryan Cotterell
Mrinmaya Sachan
218
92
0
27 Apr 2023
A Modular Approach for Multilingual Timex Detection and Normalization
  using Deep Learning and Grammar-based methods
A Modular Approach for Multilingual Timex Detection and Normalization using Deep Learning and Grammar-based methods
Nayla Escribano
German Rigau
Rodrigo Agerri
55
4
0
27 Apr 2023
ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time
ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time
Shangqing Tu
Chunyang Li
Jifan Yu
Xiaozhi Wang
Lei Hou
Juanzi Li
LLMAGAI4MH
158
10
0
27 Apr 2023
Origin Tracing and Detecting of LLMs
Origin Tracing and Detecting of LLMs
Linyang Li
Pengyu Wang
Kerong Ren
Tianxiang Sun
Xipeng Qiu
LLMAG
144
35
0
27 Apr 2023
SweCTRL-Mini: a data-transparent Transformer-based large language model
  for controllable text generation in Swedish
SweCTRL-Mini: a data-transparent Transformer-based large language model for controllable text generation in Swedish
Dmytro Kalpakchi
Johan Boye
SyDa
49
3
0
27 Apr 2023
Contour Completion by Transformers and Its Application to Vector Font
  Data
Contour Completion by Transformers and Its Application to Vector Font Data
Yusuke Nagata
Brian Kenji Iwana
S. Uchida
90
1
0
27 Apr 2023
Learning and Reasoning Multifaceted and Longitudinal Data for Poverty
  Estimates and Livelihood Capabilities of Lagged Regions in Rural India
Learning and Reasoning Multifaceted and Longitudinal Data for Poverty Estimates and Livelihood Capabilities of Lagged Regions in Rural India
Atharva Kulkarni
Raya Das
R. Srivastava
Tanmoy Chakraborty
39
2
0
27 Apr 2023
Neural Keyphrase Generation: Analysis and Evaluation
Neural Keyphrase Generation: Analysis and Evaluation
Tuhin Kundu
Jishnu Ray Chowdhury
Cornelia Caragea
60
0
0
27 Apr 2023
MasonNLP+ at SemEval-2023 Task 8: Extracting Medical Questions,
  Experiences and Claims from Social Media using Knowledge-Augmented
  Pre-trained Language Models
MasonNLP+ at SemEval-2023 Task 8: Extracting Medical Questions, Experiences and Claims from Social Media using Knowledge-Augmented Pre-trained Language Models
Giridhar Kaushik Ramachandran
Haritha Gangavarapu
K. Lybarger
Özlem Uzuner
57
1
0
26 Apr 2023
Transferring Procedural Knowledge across Commonsense Tasks
Transferring Procedural Knowledge across Commonsense Tasks
Yifan Jiang
Filip Ilievski
Kaixin Ma
59
3
0
26 Apr 2023
Towards ethical multimodal systems
Towards ethical multimodal systems
Alexis Roger
Esma Aïmeur
Irina Rish
56
3
0
26 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
214
687
0
26 Apr 2023
Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Matthias Urban
Carsten Binnig
79
5
0
26 Apr 2023
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate
  Representation
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation
Krishnam Hasija
Shrishti Pradhan
Manasi Patwardhan
Raveendra Kumar Medicherla
Lovekesh Vig
Ravindra Naik
56
2
0
26 Apr 2023
Introducing MBIB -- the first Media Bias Identification Benchmark Task
  and Dataset Collection
Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection
Martin Wessel
Tomávs Horych
Terry Ruas
Akiko Aizawa
Bela Gipp
Timo Spinde
83
25
0
25 Apr 2023
Intent Induction from Conversations for Task-Oriented Dialogue Track at
  DSTC 11
Intent Induction from Conversations for Task-Oriented Dialogue Track at DSTC 11
James Gung
Raphael Shu
Emily Moeng
Wesley Rose
Salvatore Romeo
Yassine Benajiba
Arshit Gupta
Saab Mansour
Yi Zhang
101
8
0
25 Apr 2023
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based
  Adapters
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters
Md Mahfuz Ibn Alam
Ruoyu Xie
Fahim Faisal
Antonios Anastasopoulos
76
3
0
25 Apr 2023
NLP-LTU at SemEval-2023 Task 10: The Impact of Data Augmentation and
  Semi-Supervised Learning Techniques on Text Classification Performance on an
  Imbalanced Dataset
NLP-LTU at SemEval-2023 Task 10: The Impact of Data Augmentation and Semi-Supervised Learning Techniques on Text Classification Performance on an Imbalanced Dataset
Sana Al-Azzawi
Gyorgy Kovács
Filip Nilsson
Tosin Adewumi
Marcus Liwicki
42
7
0
25 Apr 2023
Test-Time Adaptation with Perturbation Consistency Learning
Test-Time Adaptation with Perturbation Consistency Learning
Yi Su
Yixin Ji
Juntao Li
Hai Ye
Hao Fei
VLM
68
2
0
25 Apr 2023
PUNR: Pre-training with User Behavior Modeling for News Recommendation
PUNR: Pre-training with User Behavior Modeling for News Recommendation
Guangyuan Ma
Hongtao Liu
Xing Wu
Wanhui Qian
Zhepeng Lv
Q. Yang
Songlin Hu
SSL
53
3
0
25 Apr 2023
KINLP at SemEval-2023 Task 12: Kinyarwanda Tweet Sentiment Analysis
KINLP at SemEval-2023 Task 12: Kinyarwanda Tweet Sentiment Analysis
Antoine Nzeyimana
44
3
0
25 Apr 2023
DocParser: End-to-end OCR-free Information Extraction from Visually Rich
  Documents
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
73
22
0
24 Apr 2023
Extreme Classification for Answer Type Prediction in Question Answering
Extreme Classification for Answer Type Prediction in Question Answering
Vinay Setty
89
1
0
24 Apr 2023
Enriching Source Code with Contextual Data for Code Completion Models:
  An Empirical Study
Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study
Tim van Dam
Maliheh Izadi
Arie van Deursen
44
15
0
24 Apr 2023
Previous
123...109110111...215216217
Next