ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,793 papers shown
Title
Do large language models solve verbal analogies like children do?
Do large language models solve verbal analogies like children do?
Claire E. Stevenson
Mathilde ter Veen
Rochelle Choenni
Han L. J. van der Maas
Ekaterina Shutova
LRM
33
8
0
31 Oct 2023
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating
  Chess Moves based on Sentiment Analysis
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis
Haifa Alrdahi
Riza Batista-Navarro
62
2
0
31 Oct 2023
EELBERT: Tiny Models through Dynamic Embeddings
EELBERT: Tiny Models through Dynamic Embeddings
Gabrielle Cohn
Rishika Agarwal
Deepanshu Gupta
Siddharth Patwardhan
38
2
0
31 Oct 2023
Improving Prompt Tuning with Learned Prompting Layers
Improving Prompt Tuning with Learned Prompting Layers
Wei Zhu
Ming Tan
VLM
117
1
0
31 Oct 2023
Ling-CL: Understanding NLP Models through Linguistic Curricula
Ling-CL: Understanding NLP Models through Linguistic Curricula
Mohamed Elgaar
Hadi Amiri
77
2
0
31 Oct 2023
Making Large Language Models Better Data Creators
Making Large Language Models Better Data Creators
Dong-Ho Lee
Jay Pujara
Mohit Sewak
Ryen W. White
S. Jauhar
ALMSyDa
44
26
0
31 Oct 2023
Efficient Classification of Student Help Requests in Programming Courses
  Using Large Language Models
Efficient Classification of Student Help Requests in Programming Courses Using Large Language Models
Jaromír Šavelka
Paul Denny
Mark H. Liffiton
Brad Sheese
AI4Ed
75
7
0
31 Oct 2023
Evaluating Neural Language Models as Cognitive Models of Language
  Acquisition
Evaluating Neural Language Models as Cognitive Models of Language Acquisition
Héctor Javier Vázquez Martínez
Annika Lea Heuser
Charles D. Yang
Jordan Kodner
102
10
0
31 Oct 2023
Partial Tensorized Transformers for Natural Language Processing
Partial Tensorized Transformers for Natural Language Processing
Subhadra Vadlamannati
Ryan Solgi
55
0
0
30 Oct 2023
Which Examples to Annotate for In-Context Learning? Towards Effective
  and Efficient Selection
Which Examples to Annotate for In-Context Learning? Towards Effective and Efficient Selection
Costas Mavromatis
Balasubramaniam Srinivasan
Zhengyuan Shen
Jiani Zhang
Huzefa Rangwala
Christos Faloutsos
George Karypis
60
26
0
30 Oct 2023
Faithful and Robust Local Interpretability for Textual Predictions
Faithful and Robust Local Interpretability for Textual Predictions
Gianluigi Lopardo
F. Precioso
Damien Garreau
OOD
64
4
0
30 Oct 2023
Split-NER: Named Entity Recognition via Two Question-Answering-based
  Classifications
Split-NER: Named Entity Recognition via Two Question-Answering-based Classifications
Jatin Arora
Youngja Park
69
8
0
30 Oct 2023
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long
  Documents
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents
Michael Gunther
Jackmin Ong
Isabelle Mohr
Alaeddine Abdessalem
Tanguy Abel
...
Saba Sturua
Bo Wang
Maximilian Werk
Nan Wang
Han Xiao
RALM
229
67
0
30 Oct 2023
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner
  from Backbone
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Ao Ma
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
88
16
0
30 Oct 2023
Chain-of-Thought Embeddings for Stance Detection on Social Media
Chain-of-Thought Embeddings for Stance Detection on Social Media
Joseph Gatto
Omar Sharif
S. Preum
LRM
75
14
0
30 Oct 2023
Integrating Pre-trained Language Model into Neural Machine Translation
Integrating Pre-trained Language Model into Neural Machine Translation
Soon-Jae Hwang
Chang-Sung Jeong
56
0
0
30 Oct 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral
  Judgment Tasks
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
84
40
0
30 Oct 2023
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties
  in Generative Language Models
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
Zhenpeng Su
Xing Wu
Xue Bai
Zijia Lin
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
138
5
0
30 Oct 2023
Are Natural Domain Foundation Models Useful for Medical Image
  Classification?
Are Natural Domain Foundation Models Useful for Medical Image Classification?
Joana Palés Huix
Adithya Raju Ganeshan
Johan Fredin Haslum
Magnus P Soderberg
Christos Matsoukas
Kevin Smith
OODMedImVLM
87
35
0
30 Oct 2023
Mean BERTs make erratic language teachers: the effectiveness of latent
  bootstrapping in low-resource settings
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
54
4
0
30 Oct 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
75
1
0
30 Oct 2023
Overview of the CLAIMSCAN-2023: Uncovering Truth in Social Media through
  Claim Detection and Identification of Claim Spans
Overview of the CLAIMSCAN-2023: Uncovering Truth in Social Media through Claim Detection and Identification of Claim Spans
Megha Sundriyal
Md. Shad Akhtar
Tanmoy Chakraborty
63
3
0
30 Oct 2023
Adapter Pruning using Tropical Characterization
Adapter Pruning using Tropical Characterization
Rishabh Bhardwaj
Tushar Vaidya
Soujanya Poria
28
0
0
30 Oct 2023
On the accuracy and efficiency of group-wise clipping in differentially
  private optimization
On the accuracy and efficiency of group-wise clipping in differentially private optimization
Zhiqi Bu
Ruixuan Liu
Yu Wang
Sheng Zha
George Karypis
VLM
68
4
0
30 Oct 2023
From Chatbots to PhishBots? -- Preventing Phishing scams created using
  ChatGPT, Google Bard and Claude
From Chatbots to PhishBots? -- Preventing Phishing scams created using ChatGPT, Google Bard and Claude
Sayak Saha Roy
Poojitha Thota
Krishna Vamsi Naragam
Shirin Nilizadeh
SILM
105
19
0
29 Oct 2023
Robustifying Language Models with Test-Time Adaptation
Robustifying Language Models with Test-Time Adaptation
Noah T. McDermott
Junfeng Yang
Chengzhi Mao
102
2
0
29 Oct 2023
A Unique Training Strategy to Enhance Language Models Capabilities for
  Health Mention Detection from Social Media Content
A Unique Training Strategy to Enhance Language Models Capabilities for Health Mention Detection from Social Media Content
Pervaiz Iqbal Khan
Muhammad Nabeel Asim
Andreas Dengel
Sheraz Ahmed
35
1
0
29 Oct 2023
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition
  and Relation Classification Methods
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition and Relation Classification Methods
S. Alqaaidi
Elika Bozorgi
Afsaneh Shams
Krzysztof J. Kochut
DRL
77
0
0
29 Oct 2023
Bipartite Graph Pre-training for Unsupervised Extractive Summarization
  with Graph Convolutional Auto-Encoders
Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders
Qianren Mao
Shaobo Zhao
Jiarui Li
Xiaolei Gu
Shizhu He
Bo Li
Jianxin Li
SSL
52
2
0
29 Oct 2023
Retrofitting Light-weight Language Models for Emotions using Supervised
  Contrastive Learning
Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning
Sapan Shah
Sreedhar Reddy
Pushpak Bhattacharyya
51
0
0
29 Oct 2023
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text
  Detection
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text Detection
Duke Nguyen
Khaing Myat Noe Naing
Aditya Joshi
65
7
0
29 Oct 2023
All Things Considered: Detecting Partisan Events from News Media with
  Cross-Article Comparison
All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison
Yujian Liu
Xinliang Frederick Zhang
Kaijian Zou
Ruihong Huang
Nick Beauchamp
Lu Wang
63
5
0
28 Oct 2023
Rethinking Semi-Supervised Federated Learning: How to co-train
  fully-labeled and fully-unlabeled client imaging data
Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data
Pramit Saha
Divyanshu Mishra
J. A. Noble
FedML
135
8
0
28 Oct 2023
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded
  Dialogue Generation
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
Yixin Wan
Fanyou Wu
Weijie Xu
Srinivasan H. Sengamedu
HILM
73
5
0
28 Oct 2023
Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in
  News Reporting
Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting
Kaijian Zou
Xinliang Frederick Zhang
Winston Wu
Nick Beauchamp
Lu Wang
76
3
0
28 Oct 2023
TLM: Token-Level Masking for Transformers
TLM: Token-Level Masking for Transformers
Yangjun Wu
Kebin Fang
Dongxian Zhang
Han Wang
Hao Zhang
Gang Chen
56
1
0
28 Oct 2023
Probing LLMs for Joint Encoding of Linguistic Categories
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace
Konstantinos Papakostas
Rochelle Choenni
Apostolos Panagiotopoulos
Matteo Rosati
Alina Leidinger
Ekaterina Shutova
78
7
0
28 Oct 2023
Foundational Models in Medical Imaging: A Comprehensive Survey and
  Future Vision
Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision
Bobby Azad
Reza Azad
Sania Eskandari
Afshin Bozorgpour
Amirhossein Kazerouni
I. Rekik
Dorit Merhof
VLMMedIm
146
68
0
28 Oct 2023
When Reviewers Lock Horn: Finding Disagreement in Scientific Peer
  Reviews
When Reviewers Lock Horn: Finding Disagreement in Scientific Peer Reviews
Sandeep Kumar
Tirthankar Ghosal
Asif Ekbal
101
1
0
28 Oct 2023
Setting the Trap: Capturing and Defeating Backdoors in Pretrained
  Language Models through Honeypots
Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots
Ruixiang Tang
Jiayi Yuan
Yiming Li
Zirui Liu
Rui Chen
Helen Zhou
AAML
94
14
0
28 Oct 2023
Anaphor Assisted Document-Level Relation Extraction
Anaphor Assisted Document-Level Relation Extraction
Chonggang Lu
Richong Zhang
Kai Sun
Jaein Kim
Cunwang Zhang
Yongyi Mao
89
10
0
28 Oct 2023
Large Language Models Are Better Adversaries: Exploring Generative
  Clean-Label Backdoor Attacks Against Text Classifiers
Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers
Wencong You
Zayd Hammoudeh
Daniel Lowd
AAML
49
15
0
28 Oct 2023
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from
  Clinical Notes
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Á. Lelkes
Eric Loreaux
Tal Schuster
Ming-Jun Chen
Alvin Rajkomar
80
2
0
27 Oct 2023
FP8-LM: Training FP8 Large Language Models
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
111
45
0
27 Oct 2023
Elevating Code-mixed Text Handling through Auditory Information of Words
Elevating Code-mixed Text Handling through Auditory Information of Words
Mamta Mamta
Zishan Ahmad
Asif Ekbal
36
6
0
27 Oct 2023
A Scalable Framework for Table of Contents Extraction from Complex ESG
  Annual Reports
A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports
Xinyu Wang
Lin Gui
Yulan He
LMTD
58
2
0
27 Oct 2023
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Yilin Zhao
Hai Zhao
Sufeng Duan
64
2
0
27 Oct 2023
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for
  Offensive Language Identification
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification
Dhiman Goswami
Md. Nishat Raihan
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
36
5
0
27 Oct 2023
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment
  Analysis
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment Analysis
Md. Nishat Raihan
Dhiman Goswami
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
58
12
0
27 Oct 2023
SOUL: Towards Sentiment and Opinion Understanding of Language
SOUL: Towards Sentiment and Opinion Understanding of Language
Yue Deng
Wenxuan Zhang
Sinno Jialin Pan
Lidong Bing
LRM
36
1
0
27 Oct 2023
Previous
123...727374...214215216
Next