ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 9,299 papers shown
Title
A Unique Training Strategy to Enhance Language Models Capabilities for
  Health Mention Detection from Social Media Content
A Unique Training Strategy to Enhance Language Models Capabilities for Health Mention Detection from Social Media Content
Pervaiz Iqbal Khan
Muhammad Nabeel Asim
Andreas Dengel
Sheraz Ahmed
21
1
0
29 Oct 2023
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition
  and Relation Classification Methods
A Few-Shot Learning Focused Survey on Recent Named Entity Recognition and Relation Classification Methods
S. Alqaaidi
Elika Bozorgi
Afsaneh Shams
Krzysztof J. Kochut
DRL
40
0
0
29 Oct 2023
Bipartite Graph Pre-training for Unsupervised Extractive Summarization
  with Graph Convolutional Auto-Encoders
Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders
Qianren Mao
Shaobo Zhao
Jiarui Li
Xiaolei Gu
Shizhu He
Bo Li
Jianxin Li
SSL
22
2
0
29 Oct 2023
Retrofitting Light-weight Language Models for Emotions using Supervised
  Contrastive Learning
Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning
Sapan Shah
Sreedhar Reddy
Pushpak Bhattacharyya
32
0
0
29 Oct 2023
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text
  Detection
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text Detection
Duke Nguyen
Khaing Myat Noe Naing
Aditya Joshi
37
6
0
29 Oct 2023
All Things Considered: Detecting Partisan Events from News Media with
  Cross-Article Comparison
All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison
Yujian Liu
Xinliang Frederick Zhang
Kaijian Zou
Ruihong Huang
Nick Beauchamp
Lu Wang
43
4
0
28 Oct 2023
Rethinking Semi-Supervised Federated Learning: How to co-train
  fully-labeled and fully-unlabeled client imaging data
Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data
Pramit Saha
Divyanshu Mishra
J. A. Noble
FedML
84
8
0
28 Oct 2023
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded
  Dialogue Generation
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
Yixin Wan
Fanyou Wu
Weijie Xu
Srinivasan H. Sengamedu
HILM
34
5
0
28 Oct 2023
Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in
  News Reporting
Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting
Kaijian Zou
Xinliang Frederick Zhang
Winston Wu
Nick Beauchamp
Lu Wang
51
3
0
28 Oct 2023
TLM: Token-Level Masking for Transformers
TLM: Token-Level Masking for Transformers
Yangjun Wu
Kebin Fang
Dongxian Zhang
Han Wang
Hao Zhang
Gang Chen
36
1
0
28 Oct 2023
Probing LLMs for Joint Encoding of Linguistic Categories
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace
Konstantinos Papakostas
Rochelle Choenni
Apostolos Panagiotopoulos
Matteo Rosati
Alina Leidinger
Ekaterina Shutova
47
7
0
28 Oct 2023
Foundational Models in Medical Imaging: A Comprehensive Survey and
  Future Vision
Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision
Bobby Azad
Reza Azad
Sania Eskandari
Afshin Bozorgpour
Amirhossein Kazerouni
I. Rekik
Dorit Merhof
VLM
MedIm
105
61
0
28 Oct 2023
When Reviewers Lock Horn: Finding Disagreement in Scientific Peer
  Reviews
When Reviewers Lock Horn: Finding Disagreement in Scientific Peer Reviews
Sandeep Kumar
Tirthankar Ghosal
Asif Ekbal
27
1
0
28 Oct 2023
Setting the Trap: Capturing and Defeating Backdoors in Pretrained
  Language Models through Honeypots
Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots
Ruixiang Tang
Jiayi Yuan
Yiming Li
Zirui Liu
Rui Chen
Xia Hu
AAML
53
13
0
28 Oct 2023
Anaphor Assisted Document-Level Relation Extraction
Anaphor Assisted Document-Level Relation Extraction
Chonggang Lu
Richong Zhang
Kai Sun
Jaein Kim
Cunwang Zhang
Yongyi Mao
55
8
0
28 Oct 2023
Large Language Models Are Better Adversaries: Exploring Generative
  Clean-Label Backdoor Attacks Against Text Classifiers
Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers
Wencong You
Zayd Hammoudeh
Daniel Lowd
AAML
37
13
0
28 Oct 2023
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from
  Clinical Notes
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Á. Lelkes
Eric Loreaux
Tal Schuster
Ming-Jun Chen
Alvin Rajkomar
57
2
0
27 Oct 2023
FP8-LM: Training FP8 Large Language Models
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
59
42
0
27 Oct 2023
Elevating Code-mixed Text Handling through Auditory Information of Words
Elevating Code-mixed Text Handling through Auditory Information of Words
Mamta Mamta
Zishan Ahmad
Asif Ekbal
17
6
0
27 Oct 2023
A Scalable Framework for Table of Contents Extraction from Complex ESG
  Annual Reports
A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports
Xinyu Wang
Lin Gui
Yulan He
LMTD
36
2
0
27 Oct 2023
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Yilin Zhao
Hai Zhao
Sufeng Duan
33
2
0
27 Oct 2023
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for
  Offensive Language Identification
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification
Dhiman Goswami
Md. Nishat Raihan
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
11
5
0
27 Oct 2023
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment
  Analysis
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment Analysis
Md. Nishat Raihan
Dhiman Goswami
Antara Mahmud
Antonios Anstasopoulos
Marcos Zampieri
37
10
0
27 Oct 2023
SOUL: Towards Sentiment and Opinion Understanding of Language
SOUL: Towards Sentiment and Opinion Understanding of Language
Yue Deng
Wenxuan Zhang
Sinno Jialin Pan
Lidong Bing
LRM
17
1
0
27 Oct 2023
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection
  Method
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
Dawei Yin
46
37
0
27 Oct 2023
Natural Language Interfaces for Tabular Data Querying and Visualization:
  A Survey
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Weixu Zhang
Yifei Wang
Yuanfeng Song
Victor Junqiu Wei
Yuxing Tian
Yiyan Qi
Jonathan H. Chan
Raymond Chi-Wing Wong
Haiqin Yang
LMTD
51
17
0
27 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
51
4
0
27 Oct 2023
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL
  Translation
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
A. Bazaga
Pietro Lio
G. Micklem
31
3
0
27 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
29
5
0
26 Oct 2023
InstOptima: Evolutionary Multi-objective Instruction Optimization via
  Large Language Model-based Instruction Operators
InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators
Heng Yang
Ke Li
42
18
0
26 Oct 2023
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
Haitao Li
Yunqiu Shao
Yueyue Wu
Qingyao Ai
Yixiao Ma
Yiqun Liu
AILaw
38
25
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
60
55
0
26 Oct 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open
  Environments
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
Mengxue Qu
Yu-Huan Wu
Wu Liu
Xiaodan Liang
Jingkuan Song
Yao-Min Zhao
Yunchao Wei
27
16
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How
  Does Information Loss Affect Performance?
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
40
1
0
26 Oct 2023
Topic Segmentation of Semi-Structured and Unstructured Conversational
  Datasets using Language Models
Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models
Reshmi Ghosh
Harjeet Singh Kajal
Sharanya Kamath
Dhuri Shrivastava
Samyadeep Basu
Hansi Zeng
Soundararajan Srinivasan
37
0
0
26 Oct 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
39
0
0
25 Oct 2023
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Anna Koufakou
Diego Grisales
Ragy Costa de jesus
Oscar Fox
25
2
0
25 Oct 2023
How well can machine-generated texts be identified and can language
  models be trained to avoid identification?
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
31
1
0
25 Oct 2023
Privately Aligning Language Models with Reinforcement Learning
Privately Aligning Language Models with Reinforcement Learning
Fan Wu
Huseyin A. Inan
A. Backurs
Varun Chandrasekaran
Janardhan Kulkarni
Robert Sim
46
7
0
25 Oct 2023
Improving a Named Entity Recognizer Trained on Noisy Data with a Few
  Clean Instances
Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances
Zhendong Chu
Ruiyi Zhang
Tong Yu
R. Jain
Vlad I. Morariu
Jiuxiang Gu
A. Nenkova
NoLa
68
2
0
25 Oct 2023
IntenDD: A Unified Contrastive Learning Approach for Intent Detection
  and Discovery
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery
Bhavuk Singhal
Ashim Gupta
P. ShivasankaranV
Amrith Krishna
35
1
0
25 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
44
9
0
25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email
  Response Prediction
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
49
4
0
25 Oct 2023
On the Interplay between Fairness and Explainability
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
34
4
0
25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box
  Models
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
48
5
0
25 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained
  Language Models
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
33
17
0
25 Oct 2023
FedTherapist: Mental Health Monitoring with User-Generated Linguistic
  Expressions on Smartphones via Federated Learning
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
Jaemin Shin
Hyungjun Yoon
Seungjoo Lee
Sungjoon Park
Yunxin Liu
Jinho D. Choi
Sung-Ju Lee
45
5
0
25 Oct 2023
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task
  Information Retrieval
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval
Jindvrich Helcl
Jindvrich Libovický
LRM
31
0
0
25 Oct 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A
  Robust Approach for Information Extraction in Visually-Rich Documents
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents
Tofik Ali
Partha Pratim Roy
26
0
0
25 Oct 2023
Enhanced Simultaneous Machine Translation with Word-level Policies
Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim
Hankyu Cho
78
3
0
25 Oct 2023
Previous
123...697071...184185186
Next