ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,763 papers shown
Title
Enhancing Low Resource NER Using Assisting Language And Transfer
  Learning
Enhancing Low Resource NER Using Assisting Language And Transfer Learning
Maithili Sabane
Aparna Ranade
Onkar Litake
Parth Patil
Raviraj Joshi
Dipali M. Kadam
64
5
0
10 Jun 2023
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
Fuxiao Liu
Hao Tan
Chris Tensmeyer
CLIPVLM
99
18
0
09 Jun 2023
Measuring and Modifying Factual Knowledge in Large Language Models
Measuring and Modifying Factual Knowledge in Large Language Models
Pouya Pezeshkpour
KELM
68
18
0
09 Jun 2023
Morphosyntactic probing of multilingual BERT models
Morphosyntactic probing of multilingual BERT models
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
91
10
0
09 Jun 2023
Prodigy: An Expeditiously Adaptive Parameter-Free Learner
Prodigy: An Expeditiously Adaptive Parameter-Free Learner
Konstantin Mishchenko
Aaron Defazio
ODL
110
65
0
09 Jun 2023
Implementing BERT and fine-tuned RobertA to detect AI generated news by
  ChatGPT
Implementing BERT and fine-tuned RobertA to detect AI generated news by ChatGPT
Zecong Wang
Jiaxi Cheng
Chen Cui
Chenhao Yu
DeLMO
70
17
0
09 Jun 2023
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT
  that Easy to Detect?
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Wissam Antoun
Virginie Mouilleron
Benoît Sagot
Djamé Seddah
DeLMO
85
33
0
09 Jun 2023
Can Large Language Models Infer Causation from Correlation?
Can Large Language Models Infer Causation from Correlation?
Zhijing Jin
Jiarui Liu
Zhiheng Lyu
Spencer Poff
Mrinmaya Sachan
Rada Mihalcea
Mona T. Diab
Bernhard Schölkopf
LRM
100
129
0
09 Jun 2023
How Can Recommender Systems Benefit from Large Language Models: A Survey
How Can Recommender Systems Benefit from Large Language Models: A Survey
Jianghao Lin
Xinyi Dai
Yunjia Xi
Weiwen Liu
Bo Chen
...
Chenxu Zhu
Huifeng Guo
Yong Yu
Ruiming Tang
Weinan Zhang
LRM
187
224
0
09 Jun 2023
Causality between Sentiment and Cryptocurrency Prices
Causality between Sentiment and Cryptocurrency Prices
Lubdhak Mondal
Udeshya Raj
S. Abinandhan
S. BeganGowsik
P. Sarwesh
Abhijeet Chandra
27
0
0
09 Jun 2023
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Lin Liu
Mingming Zhao
Shanxin Yuan
Wenlong Lyu
Wen-gang Zhou
Houqiang Li
Yanfeng Wang
Qi Tian
69
3
0
09 Jun 2023
Embodied Executable Policy Learning with Language-based Scene
  Summarization
Embodied Executable Policy Learning with Language-based Scene Summarization
Jielin Qiu
Mengdi Xu
William Jongwon Han
Seungwhan Moon
Ding Zhao
LM&Ro
86
8
0
09 Jun 2023
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in
  Language Models
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in Language Models
Zihao Tan
Qingliang Chen
Wenbin Zhu
Yongjian Huang
AAMLSILM
91
3
0
09 Jun 2023
WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised
  Span Prediction
WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction
Qiyu Wu
Masaaki Nagata
Yoshimasa Tsuruoka
61
5
0
09 Jun 2023
Privacy- and Utility-Preserving NLP with Anonymized Data: A case study
  of Pseudonymization
Privacy- and Utility-Preserving NLP with Anonymized Data: A case study of Pseudonymization
Oleksandr Yermilov
Vipul Raheja
Artem Chernodub
55
10
0
08 Jun 2023
Bias Against 93 Stigmatized Groups in Masked Language Models and
  Downstream Sentiment Classification Tasks
Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks
Katelyn Mei
Sonia Fereidooni
Aylin Caliskan
87
56
0
08 Jun 2023
Artificial General Intelligence for Medical Imaging
Artificial General Intelligence for Medical Imaging
Xiang Li
Lu Zhang
Zihao Wu
Zheng Liu
Lin Zhao
...
Pingkuan Yan
Quanzheng Li
Wen Liu
Tianming Liu
Dinggang Shen
LM&MAAI4CE
140
42
0
08 Jun 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to
  Pre-trained Language Models Memories
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories
Shizhe Diao
Tianyang Xu
Ruijia Xu
Jiawei Wang
Tong Zhang
MoEAI4CE
55
41
0
08 Jun 2023
Utterance Emotion Dynamics in Children's Poems: Emotional Changes Across
  Age
Utterance Emotion Dynamics in Children's Poems: Emotional Changes Across Age
Daniela Teodorescu
Alona Fyshe
Saif M. Mohammad
54
2
0
08 Jun 2023
Revisit Few-shot Intent Classification with PLMs: Direct Fine-tuning vs.
  Continual Pre-training
Revisit Few-shot Intent Classification with PLMs: Direct Fine-tuning vs. Continual Pre-training
Haode Zhang
Haowen Liang
Li-Ming Zhan
Xiao-Ming Wu
Albert Y. S. Lam
VLM
83
8
0
08 Jun 2023
Extensive Evaluation of Transformer-based Architectures for Adverse Drug
  Events Extraction
Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction
Simone Scaboro
Beatrice Portelli
Emmanuele Chersoni
Enrico Santus
Giuseppe Serra
64
9
0
08 Jun 2023
Reference Matters: Benchmarking Factual Error Correction for Dialogue
  Summarization with Fine-grained Evaluation Framework
Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework
Mingqi Gao
Xiaojun Wan
Jia Su
Zhefeng Wang
Baoxing Huai
HILM
63
9
0
08 Jun 2023
Enhancing Robustness of AI Offensive Code Generators via Data
  Augmentation
Enhancing Robustness of AI Offensive Code Generators via Data Augmentation
Cristina Improta
Pietro Liguori
R. Natella
B. Cukic
Domenico Cotroneo
AAML
93
5
0
08 Jun 2023
LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for
  Sexism Detection and Classification
LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for Sexism Detection and Classification
K. Chernyshev
E. Garanina
Duygu Bayram
Qiankun Zheng
Lukas Edman
20
0
0
08 Jun 2023
Improving Visual Prompt Tuning for Self-supervised Vision Transformers
Improving Visual Prompt Tuning for Self-supervised Vision Transformers
S. Yoo
Eunji Kim
Dahuin Jung
Jungbeom Lee
Sung-Hoon Yoon
VLM
128
44
0
08 Jun 2023
Interpretable Medical Diagnostics with Structured Data Extraction by
  Large Language Models
Interpretable Medical Diagnostics with Structured Data Extraction by Large Language Models
Aleksa Bisercic
Mladen Nikolic
M. Schaar
Boris Delibasic
Pietro Lio
Andrija Petrović
94
17
0
08 Jun 2023
Assessing Phrase Break of ESL Speech with Pre-trained Language Models
  and Large Language Models
Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
Zhiyi Wang
Shaoguang Mao
Wenshan Wu
Yan Xia
Yan Deng
Jonathan Tien
71
4
0
08 Jun 2023
Leveraging Language Identification to Enhance Code-Mixed Text
  Classification
Leveraging Language Identification to Enhance Code-Mixed Text Classification
Gauri Takawane
Abhishek Phaltankar
Varad Patwardhan
Aryan Patil
Raviraj Joshi
Mukta S. Takalikar
72
4
0
08 Jun 2023
RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot
  Relation Extraction
RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction
Jun Zhao
Wenyu Zhan
Xin Zhao
Qi Zhang
Tao Gui
Zhongyu Wei
Junzhe Wang
Minlong Peng
Mingming Sun
82
24
0
08 Jun 2023
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural
  Language Understanding
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Junda Wu
Tong Yu
Rui Wang
Zhao Song
Ruiyi Zhang
Handong Zhao
Chaochao Lu
Shuai Li
Ricardo Henao
VLM
94
25
0
08 Jun 2023
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference
  Learning
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning
Jaehyung Kim
Jinwoo Shin
Dongyeop Kang
64
2
0
08 Jun 2023
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with
  Architecture-Routed Mixture-of-Experts
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Ganesh Jawahar
Haichuan Yang
Yunyang Xiong
Zechun Liu
Dilin Wang
...
Barlas Oğuz
Muhammad Abdul-Mageed
L. Lakshmanan
Raghuraman Krishnamoorthi
Vikas Chandra
84
4
0
08 Jun 2023
Intrinsic Dimension Estimation for Robust Detection of AI-Generated
  Texts
Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts
Eduard Tulchinskii
Kristian Kuznetsov
Laida Kushnareva
D. Cherniavskii
S. Barannikov
Irina Piontkovskaya
Sergey I. Nikolenko
Evgeny Burnaev
DeLMO
107
89
0
07 Jun 2023
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis,
  and LLMs Evaluations
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
146
84
0
07 Jun 2023
Proximity-Informed Calibration for Deep Neural Networks
Proximity-Informed Calibration for Deep Neural Networks
Miao Xiong
Ailin Deng
Pang Wei Koh
Jiaying Wu
Shen Li
Jianqing Xu
Bryan Hooi
UQCV
163
15
0
07 Jun 2023
Contrastive Bootstrapping for Label Refinement
Contrastive Bootstrapping for Label Refinement
Shudi Hou
Yu Xia
Muhao Chen
Sujian Li
49
0
0
07 Jun 2023
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with
  Fine-Tuned Generative Transformers
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers
Israt Jahan
Md Tahmid Rahman Laskar
Chun Peng
J. Huang
LM&MAMedImAI4MH
106
32
0
07 Jun 2023
STEPS: A Benchmark for Order Reasoning in Sequential Tasks
STEPS: A Benchmark for Order Reasoning in Sequential Tasks
Weizhi Wang
Hong Wang
Xi Yan
LRM
76
1
0
07 Jun 2023
On the Detectability of ChatGPT Content: Benchmarking, Methodology, and
  Evaluation through the Lens of Academic Writing
On the Detectability of ChatGPT Content: Benchmarking, Methodology, and Evaluation through the Lens of Academic Writing
Zeyan Liu
Zijun Yao
Fengjun Li
Bo Luo
DeLMO
89
23
0
07 Jun 2023
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
X. Fontaine
Félix Gaschi
Parisa Rastin
Y. Toussaint
99
10
0
07 Jun 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction
  with Predicate Generation
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation
Zhibin Chen
Yansong Feng
Dongyan Zhao
53
0
0
07 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data
  Augmentation
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
64
12
0
07 Jun 2023
XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages
  and Meaning Representations
XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
Yusen Zhang
Jun Wang
Zhiguo Wang
Rui Zhang
VLM
122
9
0
07 Jun 2023
TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named
  Entity Recognition
TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity Recognition
Jiang-Dong Liu
Hao Fei
Fei Li
Jingye Li
Bobo Li
Liang Zhao
Chong Teng
Donghong Ji
VLM
48
11
0
06 Jun 2023
How Good is the Model in Model-in-the-loop Event Coreference Resolution
  Annotation?
How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?
Shafiuddin Rehan Ahmed
Abhijnan Nath
Michael Regan
Adam Pollins
Nikhil Krishnaswamy
James H. Martin
40
6
0
06 Jun 2023
Causal interventions expose implicit situation models for commonsense
  language understanding
Causal interventions expose implicit situation models for commonsense language understanding
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
104
6
0
06 Jun 2023
From Key Points to Key Point Hierarchy: Structured and Expressive
  Opinion Summarization
From Key Points to Key Point Hierarchy: Structured and Expressive Opinion Summarization
Arie Cattan
Lilach Eden
Yoav Kantor
Roy Bar-Haim
63
10
0
06 Jun 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The
  Weights that Matter
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter
Ajay Jaiswal
Shiwei Liu
Tianlong Chen
Zhangyang Wang
VLM
71
34
0
06 Jun 2023
A Novel Approach To User Agent String Parsing For Vulnerability Analysis
  Using Mutli-Headed Attention
A Novel Approach To User Agent String Parsing For Vulnerability Analysis Using Mutli-Headed Attention
Dhruv Nandakumar
Sathvik Murli
Ankur Khosla
K. Choi
Abdul Rahman
Drew Walsh
Scott Riede
Eric Dull
Edward Bowen
28
1
0
06 Jun 2023
On the Difference of BERT-style and CLIP-style Text Encoders
On the Difference of BERT-style and CLIP-style Text Encoders
Zhihong Chen
Guiming Hardy Chen
Shizhe Diao
Xiang Wan
Benyou Wang
VLM
67
19
0
06 Jun 2023
Previous
123...959697...214215216
Next