ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 9,167 papers shown
Title
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial
  Reasoning in Text
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in Text
Shuaiyi Li
Yang Deng
Wai Lam
58
2
0
19 Oct 2023
Towards Anytime Fine-tuning: Continually Pre-trained Language Models
  with Hypernetwork Prompt
Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompt
Gangwei Jiang
Caigao Jiang
Siqiao Xue
James Y. Zhang
Junqing Zhou
Defu Lian
Ying Wei
VLM
51
7
0
19 Oct 2023
Contrastive Learning for Inference in Dialogue
Contrastive Learning for Inference in Dialogue
Etsuko Ishii
Yan Xu
Bryan Wilie
Ziwei Ji
Holy Lovenia
Willy Chung
Pascale Fung
40
0
0
19 Oct 2023
MTS-LOF: Medical Time-Series Representation Learning via
  Occlusion-Invariant Features
MTS-LOF: Medical Time-Series Representation Learning via Occlusion-Invariant Features
Huayu Li
Ana S. Carreon-Rascon
Xiwen Chen
Geng Yuan
Ao Li
AI4TS
19
5
0
19 Oct 2023
A Read-and-Select Framework for Zero-shot Entity Linking
A Read-and-Select Framework for Zero-shot Entity Linking
Zhenran Xu
Yulin Chen
Baotian Hu
Min Zhang
44
5
0
19 Oct 2023
Efficient Long-Range Transformers: You Need to Attend More, but Not
  Necessarily at Every Layer
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Qingru Zhang
Dhananjay Ram
Cole Hawkins
Sheng Zha
Tuo Zhao
60
15
0
19 Oct 2023
Automated Repair of Declarative Software Specifications in the Era of
  Large Language Models
Automated Repair of Declarative Software Specifications in the Era of Large Language Models
Md Rashedul Hasan
Jiawei Li
Iftekhar Ahmed
Hamid Bagheri
55
2
0
19 Oct 2023
Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised
  Language Understanding
Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding
Jianing Wang
Qiushi Sun
Nuo Chen
Chengyu Wang
Jun Huang
Ming Gao
Xiang Li
UQLM
46
3
0
19 Oct 2023
Solving Hard Analogy Questions with Relation Embedding Chains
Solving Hard Analogy Questions with Relation Embedding Chains
Nitesh Kumar
Steven Schockaert
39
1
0
18 Oct 2023
SHARCS: Efficient Transformers through Routing with Dynamic Width
  Sub-networks
SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Mohammadreza Salehi
Sachin Mehta
Aditya Kusupati
Ali Farhadi
Hannaneh Hajishirzi
69
5
0
18 Oct 2023
CORE: A Few-Shot Company Relation Classification Dataset for Robust
  Domain Adaptation
CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation
Philipp Borchert
Jochen De Weerdt
Kristof Coussement
Arno De Caigny
Marie-Francine Moens
43
1
0
18 Oct 2023
DesignQuizzer: A Community-Powered Conversational Agent for Learning
  Visual Design
DesignQuizzer: A Community-Powered Conversational Agent for Learning Visual Design
Zhenhui Peng
Qiaoyi Chen
Zhiyu Shen
Xiaojuan Ma
Antti Oulasvirta
29
5
0
18 Oct 2023
Gold: A Global and Local-aware Denoising Framework for Commonsense
  Knowledge Graph Noise Detection
Gold: A Global and Local-aware Denoising Framework for Commonsense Knowledge Graph Noise Detection
Zheye Deng
Weiqi Wang
Zhaowei Wang
Xin Liu
Yangqiu Song
38
9
0
18 Oct 2023
Investigating semantic subspaces of Transformer sentence embeddings
  through linear structural probing
Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing
Dmitry Nikolaev
Sebastian Padó
56
5
0
18 Oct 2023
Rather a Nurse than a Physician -- Contrastive Explanations under
  Investigation
Rather a Nurse than a Physician -- Contrastive Explanations under Investigation
Oliver Eberle
Ilias Chalkidis
Laura Cabello
Stephanie Brandl
36
9
0
18 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By
  Step in Natural Language Understanding
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
37
3
0
18 Oct 2023
Enhancing Low-resource Fine-grained Named Entity Recognition by
  Leveraging Coarse-grained Datasets
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
Su ah Lee
Seokjin Oh
Woohwan Jung
41
3
0
18 Oct 2023
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
Daeun Lee
Sejung Son
Hyolim Jeon
Seungbae Kim
Jinyoung Han
29
3
0
18 Oct 2023
Learning under Label Proportions for Text Classification
Learning under Label Proportions for Text Classification
Jatin Chauhan
Xiaoxuan Wang
Wei Wang
35
1
0
18 Oct 2023
Open-ended Commonsense Reasoning with Unrestricted Answer Scope
Open-ended Commonsense Reasoning with Unrestricted Answer Scope
Chen Ling
Xuchao Zhang
Xujiang Zhao
Yanchi Liu
Wei Cheng
Mika Oishi
Takao Osaki
Katsushi Matsuda
Haifeng Chen
Liang Zhao
ReLM
LRM
34
1
0
18 Oct 2023
Field-testing items using artificial intelligence: Natural language
  processing with transformers
Field-testing items using artificial intelligence: Natural language processing with transformers
Hotaka Maeda
14
2
0
18 Oct 2023
VeRA: Vector-based Random Matrix Adaptation
VeRA: Vector-based Random Matrix Adaptation
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
VLM
41
138
0
17 Oct 2023
Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism
  with Neural Networks
Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks
Muhan Zhang
11
1
0
17 Oct 2023
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models
  for Emotion Recognition in Conversations
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations
Yazhou Zhang
Mengyao Wang
Youxi Wu
Prayag Tiwari
Qiuchi Li
Benyou Wang
Jing Qin
63
23
0
17 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
53
0
0
17 Oct 2023
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for
  Zero-Shot Commonsense Question Answering
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Haochen Shi
Weiqi Wang
Tianqing Fang
Baixuan Xu
Wenxuan Ding
Xin Liu
Yangqiu Song
74
7
0
17 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency
  by Automatic Task Formation
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
34
3
0
17 Oct 2023
Entity Matching using Large Language Models
Entity Matching using Large Language Models
Ralph Peeters
Christian Bizer
43
13
0
17 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated
  Self-Explanations
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Shiyuan Huang
Siddarth Mamidanna
Shreedhar Jangam
Yilun Zhou
Leilani H. Gilpin
LRM
MILM
ELM
66
68
0
17 Oct 2023
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text
  Processing
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing
Quoc-Nam Nguyen
Thang Chau Phan
Duc-Vu Nguyen
Kiet Van Nguyen
33
8
0
17 Oct 2023
H2O Open Ecosystem for State-of-the-art Large Language Models
H2O Open Ecosystem for State-of-the-art Large Language Models
Arno Candel
Jon McKinney
Philipp Singer
Pascal Pfeiffer
Maximilian Jeblick
Chun Ming Lee
Marcos V. Conde
VLM
30
4
0
17 Oct 2023
Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained
  Language Models
Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models
Yilmazcan Ozyurt
Stefan Feuerriegel
Ce Zhang
51
1
0
17 Oct 2023
Understanding writing style in social media with a supervised
  contrastively pre-trained transformer
Understanding writing style in social media with a supervised contrastively pre-trained transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
23
4
0
17 Oct 2023
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
S. Nigam
Aniket Deroy
Noel Shallum
Ayush Kumar Mishra
Anup Roy
Shubham Kumar Mishra
Arnab Bhattacharya
Saptarshi Ghosh
Kripabandhu Ghosh
AILaw
ELM
28
10
0
17 Oct 2023
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for
  Text Generation
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation
Tomohito Kasahara
Daisuke Kawahara
51
2
0
17 Oct 2023
Correction Focused Language Model Training for Speech Recognition
Correction Focused Language Model Training for Speech Recognition
Yingyi Ma
Zhe Liu
Ozlem Kalinli
KELM
53
3
0
17 Oct 2023
Large Language Models can Contrastively Refine their Generation for
  Better Sentence Representation Learning
Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning
Huiming Wang
Zhaodonghui Li
Liying Cheng
De Wen Soh
Lidong Bing
53
2
0
17 Oct 2023
A State-Vector Framework for Dataset Effects
A State-Vector Framework for Dataset Effects
E. Sahak
Zining Zhu
Frank Rudzicz
38
1
0
17 Oct 2023
Fake News in Sheep's Clothing: Robust Fake News Detection Against
  LLM-Empowered Style Attacks
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
48
56
0
16 Oct 2023
Building Persona Consistent Dialogue Agents with Offline Reinforcement
  Learning
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
Ryan Shea
Zhou Yu
OffRL
47
7
0
16 Oct 2023
G-SPEED: General SParse Efficient Editing MoDel
G-SPEED: General SParse Efficient Editing MoDel
Haoke Zhang
Yue Wang
Juntao Li
Xiabing Zhou
Min Zhang
SyDa
KELM
35
1
0
16 Oct 2023
Privacy in Large Language Models: Attacks, Defenses and Future
  Directions
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
55
42
0
16 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head
  Attention under Multi-task Learning
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
43
5
0
16 Oct 2023
Decomposed Prompt Tuning via Low-Rank Reparameterization
Decomposed Prompt Tuning via Low-Rank Reparameterization
Yao Xiao
Lu Xu
Jiaxi Li
Wei Lu
Xiaoli Li
VLM
33
6
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
53
64
0
16 Oct 2023
FiLM: Fill-in Language Models for Any-Order Generation
FiLM: Fill-in Language Models for Any-Order Generation
Tianxiao Shen
Hao-Chun Peng
Ruoqi Shen
Yao Fu
Zaïd Harchaoui
Yejin Choi
46
8
0
15 Oct 2023
Reformulating NLP tasks to Capture Longitudinal Manifestation of
  Language Disorders in People with Dementia
Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia
Dimitris Gkoumas
Matthew Purver
Maria Liakata
36
2
0
15 Oct 2023
Rethinking Relation Classification with Graph Meaning Representations
Rethinking Relation Classification with Graph Meaning Representations
Li Zhou
Wenyu Chen
DingYi Zeng
Hong Qu
Daniel Hershcovich
AI4CE
30
0
0
15 Oct 2023
Diversifying the Mixture-of-Experts Representation for Language Models
  with Orthogonal Optimizer
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer
Boan Liu
Liang Ding
Li Shen
Keqin Peng
Yu Cao
Dazhao Cheng
Dacheng Tao
MoE
41
7
0
15 Oct 2023
EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
Huanhuan Ma
Weizhi Xu
Yifan Wei
Liuji Chen
Liang Wang
Qiang Liu
Shu Wu
Liang Wang
37
15
0
15 Oct 2023
Previous
123...727374...182183184
Next