ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,783 papers shown
Title
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection
  Method
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
D. Yin
89
42
0
27 Oct 2023
Natural Language Interfaces for Tabular Data Querying and Visualization:
  A Survey
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Weixu Zhang
Yifei Wang
Yuanfeng Song
Victor Junqiu Wei
Yuxing Tian
Yiyan Qi
Jonathan H. Chan
Raymond Chi-Wing Wong
Haiqin Yang
LMTD
81
20
0
27 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
70
9
0
27 Oct 2023
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL
  Translation
SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation
A. Bazaga
Pietro Lio
G. Micklem
95
3
0
27 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
65
5
0
26 Oct 2023
InstOptima: Evolutionary Multi-objective Instruction Optimization via
  Large Language Model-based Instruction Operators
InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators
Heng Yang
Ke Li
85
19
0
26 Oct 2023
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset
Haitao Li
Yunqiu Shao
Yueyue Wu
Qingyao Ai
Yixiao Ma
Yiqun Liu
AILaw
89
26
0
26 Oct 2023
The Expressive Power of Low-Rank Adaptation
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
110
66
0
26 Oct 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open
  Environments
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
Mengxue Qu
Yu-Huan Wu
Wu Liu
Xiaodan Liang
Jingkuan Song
Yao-Min Zhao
Yunchao Wei
43
17
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How
  Does Information Loss Affect Performance?
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
65
1
0
26 Oct 2023
Topic Segmentation of Semi-Structured and Unstructured Conversational
  Datasets using Language Models
Topic Segmentation of Semi-Structured and Unstructured Conversational Datasets using Language Models
Reshmi Ghosh
Harjeet Singh Kajal
Sharanya Kamath
Dhuri Shrivastava
Samyadeep Basu
Hansi Zeng
Soundararajan Srinivasan
57
0
0
26 Oct 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
73
0
0
25 Oct 2023
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Data Augmentation for Emotion Detection in Small Imbalanced Text Data
Anna Koufakou
Diego Grisales
Ragy Costa de jesus
Oscar Fox
63
3
0
25 Oct 2023
How well can machine-generated texts be identified and can language
  models be trained to avoid identification?
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
33
1
0
25 Oct 2023
Privately Aligning Language Models with Reinforcement Learning
Privately Aligning Language Models with Reinforcement Learning
Fan Wu
Huseyin A. Inan
A. Backurs
Varun Chandrasekaran
Janardhan Kulkarni
Robert Sim
99
7
0
25 Oct 2023
Improving a Named Entity Recognizer Trained on Noisy Data with a Few
  Clean Instances
Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances
Zhendong Chu
Ruiyi Zhang
Tong Yu
R. Jain
Vlad I. Morariu
Jiuxiang Gu
A. Nenkova
NoLa
118
2
0
25 Oct 2023
IntenDD: A Unified Contrastive Learning Approach for Intent Detection
  and Discovery
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery
Bhavuk Singhal
Ashim Gupta
P. ShivasankaranV
Amrith Krishna
64
1
0
25 Oct 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
80
10
0
25 Oct 2023
PROMINET: Prototype-based Multi-View Network for Interpretable Email
  Response Prediction
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang
Prashanth Vijayaraghavan
Ehsan Degan
61
4
0
25 Oct 2023
On the Interplay between Fairness and Explainability
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
99
5
0
25 Oct 2023
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box
  Models
Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan
Yuval Asher
Amit Eshel
Yehonatan Elisha
Noam Koenigstein
69
5
0
25 Oct 2023
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained
  Language Models
Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef
Osman Alperen Koracs
Meijie Li
Jorg Schlotterer
Christin Seifert
KELM
81
19
0
25 Oct 2023
FedTherapist: Mental Health Monitoring with User-Generated Linguistic
  Expressions on Smartphones via Federated Learning
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
Jaemin Shin
Hyungjun Yoon
Seungjoo Lee
Sungjoon Park
Yunxin Liu
Jinho D. Choi
Sung-Ju Lee
70
6
0
25 Oct 2023
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task
  Information Retrieval
CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval
Jindvrich Helcl
Jindvrich Libovický
LRM
50
0
0
25 Oct 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A
  Robust Approach for Information Extraction in Visually-Rich Documents
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents
Tofik Ali
Partha Pratim Roy
57
0
0
25 Oct 2023
Enhanced Simultaneous Machine Translation with Word-level Policies
Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim
Hankyu Cho
102
3
0
25 Oct 2023
General Point Model with Autoencoding and Autoregressive
General Point Model with Autoencoding and Autoregressive
Zhe Li
Zhangyang Gao
Cheng Tan
Stan Z. Li
Laurence T. Yang
AI4CE3DPC
52
4
0
25 Oct 2023
Transformer-based Live Update Generation for Soccer Matches from
  Microblog Posts
Transformer-based Live Update Generation for Soccer Matches from Microblog Posts
Masashi Oshika
Kosuke Yamada
Ryohei Sasano
Koichi Takeda
39
0
0
25 Oct 2023
DiQAD: A Benchmark Dataset for End-to-End Open-domain Dialogue
  Assessment
DiQAD: A Benchmark Dataset for End-to-End Open-domain Dialogue Assessment
Yukun Zhao
Lingyong Yan
Weiwei Sun
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
D. Yin
ELM
52
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked
  Auto-Encoder
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
76
3
0
25 Oct 2023
URL-BERT: Training Webpage Representations via Social Media Engagements
URL-BERT: Training Webpage Representations via Social Media Engagements
A. Qamar
Chetan Verma
Ahmed El-Kishky
Sumit Binnani
Sneha Mehta
Taylor Berg-Kirkpatrick
60
0
0
25 Oct 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of
  Masked Language Model Pretraining
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
Ting-Rui Chiang
Dani Yogatama
59
1
0
25 Oct 2023
Speakerly: A Voice-based Writing Assistant for Text Composition
Speakerly: A Voice-based Writing Assistant for Text Composition
Dhruv Kumar
Vipul Raheja
Alice Kaiser-Schatzlein
Robyn Perry
Apurva Joshi
Justin Hugues-Nuger
Samuel Lou
Navid Chowdhury
75
1
0
24 Oct 2023
Mixture-of-Linguistic-Experts Adapters for Improving and Interpreting
  Pre-trained Language Models
Mixture-of-Linguistic-Experts Adapters for Improving and Interpreting Pre-trained Language Models
Raymond Li
Gabriel Murray
Giuseppe Carenini
MoE
81
2
0
24 Oct 2023
Knowledge Editing for Large Language Models: A Survey
Knowledge Editing for Large Language Models: A Survey
Song Wang
Yaochen Zhu
Haochen Liu
Zaiyi Zheng
Chen Chen
Wenlin Yao
KELM
176
163
0
24 Oct 2023
BLP-2023 Task 2: Sentiment Analysis
BLP-2023 Task 2: Sentiment Analysis
Md. Arid Hasan
Firoj Alam
Anika Anjum
Shudipta Das
Afiyat Anjum
49
20
0
24 Oct 2023
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form
  Question Answering
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering
Wookje Han
Jinsol Park
Kyungjae Lee
73
4
0
24 Oct 2023
From Heuristic to Analytic: Cognitively Motivated Strategies for
  Coherent Physical Commonsense Reasoning
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
Zheyuan Zhang
Shane Storks
Fengyuan Hu
Sungryull Sohn
Moontae Lee
Honglak Lee
Joyce Chai
LRM
75
4
0
24 Oct 2023
GenKIE: Robust Generative Multimodal Document Key Information Extraction
GenKIE: Robust Generative Multimodal Document Key Information Extraction
Panfeng Cao
Ye Wang
Qiang Zhang
Zaiqiao Meng
SyDa
82
7
0
24 Oct 2023
Locally Differentially Private Document Generation Using Zero Shot
  Prompting
Locally Differentially Private Document Generation Using Zero Shot Prompting
Saiteja Utpala
Sara Hooker
Pin-Yu Chen
53
39
0
24 Oct 2023
Contrastive Learning-based Sentence Encoders Implicitly Weight
  Informative Words
Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words
Hiroto Kurita
Goro Kobayashi
Sho Yokoi
Kentaro Inui
62
4
0
24 Oct 2023
Is Probing All You Need? Indicator Tasks as an Alternative to Probing
  Embedding Spaces
Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces
Tal Levy
Omer Goldman
Reut Tsarfaty
111
3
0
24 Oct 2023
Density of States Prediction of Crystalline Materials via Prompt-guided
  Multi-Modal Transformer
Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer
Namkyeong Lee
Heewoong Noh
Sungwon Kim
Dongmin Hyun
Gyoung S. Na
Chanyoung Park
54
6
0
24 Oct 2023
Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To
  Word--Definition Alignment
Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To Word--Definition Alignment
Ahmed ElBakry
Mohamed Gabr
Muhammad N. ElNokrashy
Badr AlKhamissi
42
3
0
24 Oct 2023
Towards Automated Recipe Genre Classification using Semi-Supervised
  Learning
Towards Automated Recipe Genre Classification using Semi-Supervised Learning
Nazmus Sakib
G. M. Shahariar
Mohsinul Kabir
Md. Kamrul Hasan
H. Mahmud
25
1
0
24 Oct 2023
Expression Syntax Information Bottleneck for Math Word Problems
Expression Syntax Information Bottleneck for Math Word Problems
Jing Xiong
Chengming Li
Min Yang
Xiping Hu
Bin Hu
65
5
0
24 Oct 2023
Confounder Balancing in Adversarial Domain Adaptation for Pre-Trained
  Large Models Fine-Tuning
Confounder Balancing in Adversarial Domain Adaptation for Pre-Trained Large Models Fine-Tuning
Shuoran Jiang
Qingcai Chen
Yang Xiang
Youcheng Pan
Xiangping Wu
AI4CE
126
1
0
24 Oct 2023
A Survey on Detection of LLMs-Generated Content
A Survey on Detection of LLMs-Generated Content
Xianjun Yang
Liangming Pan
Xuandong Zhao
Haifeng Chen
Linda R. Petzold
William Y. Wang
Wei Cheng
DeLMO
102
55
0
24 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme
  Large Language Model Compression
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
70
4
0
24 Oct 2023
Improving Language Models Meaning Understanding and Consistency by
  Learning Conceptual Roles from Dictionary
Improving Language Models Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary
Myeongjun Jang
Thomas Lukasiewicz
62
5
0
24 Oct 2023
Previous
123...737475...214215216
Next