ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,802 papers shown
Title
SocialDial: A Benchmark for Socially-Aware Dialogue Systems
SocialDial: A Benchmark for Socially-Aware Dialogue Systems
Haolan Zhan
Zhuang Li
Yufei Wang
Linhao Luo
Tao Feng
...
Lay-Ki Soon
Suraj Sharma
Ingrid Zukerman
Zhaleh Semnani Azad
Gholamreza Haffari
123
17
0
24 Apr 2023
CHEAT: A Large-scale Dataset for Detecting ChatGPT-writtEn AbsTracts
CHEAT: A Large-scale Dataset for Detecting ChatGPT-writtEn AbsTracts
Peipeng Yu
Jiahan Chen
Xuan Feng
Zhihua Xia
175
45
0
24 Apr 2023
KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual
  Fine-Tuning for Persuasion Techniques Detection
KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual Fine-Tuning for Persuasion Techniques Detection
Timo Hromadka
Timotej Smolen
T. Remiš
Branislav Pecher
Ivan Srba
48
11
0
24 Apr 2023
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis
  [Experiment, Analysis & Benchmark]
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]
Alexandros Zeakis
G. Papadakis
Dimitrios Skoutas
Manolis Koubarakis
78
39
0
24 Apr 2023
Text-to-Audio Generation using Instruction-Tuned LLM and Latent
  Diffusion Model
Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Deepanway Ghosal
Navonil Majumder
Ambuj Mehrish
Soujanya Poria
234
152
0
24 Apr 2023
PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis
PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis
Shuyong Wei
Nuo Xu
51
5
0
24 Apr 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment
  of Performance, Explainability, Calibration, and Faithfulness
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELMAI4MH
138
168
0
23 Apr 2023
Differentiate ChatGPT-generated and Human-written Medical Texts
Differentiate ChatGPT-generated and Human-written Medical Texts
Wenxiong Liao
Zheng Liu
Haixing Dai
Shaochen Xu
Zihao Wu
...
Xiaoke Huang
Dajiang Zhu
Hongmin Cai
Tianming Liu
Xiang Li
LM&MADeLMOMedImAI4MH
62
60
0
23 Apr 2023
Graph Neural Networks for Text Classification: A Survey
Graph Neural Networks for Text Classification: A Survey
Kunze Wang
Yihao Ding
S. Han
FaMLGNN
99
29
0
23 Apr 2023
Detecting Spoilers in Movie Reviews with External Movie Knowledge and
  User Networks
Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks
Heng Wang
Wenqian Zhang
Yuyang Bai
Zhaoxuan Tan
Shangbin Feng
Qinghua Zheng
Minnan Luo
80
4
0
22 Apr 2023
SAILER: Structure-aware Pre-trained Language Model for Legal Case
  Retrieval
SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval
Haitao Li
Qingyao Ai
Jia Chen
Qian Dong
Yueyue Wu
Yu-an Liu
C. L. Philip Chen
Qi Tian
AILawELMRALM
80
79
0
22 Apr 2023
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic
  Music Information Retrieval
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Shangda Wu
Dingyao Yu
Xu Tan
Maosong Sun
CLIPVLM
76
15
0
21 Apr 2023
LEIA: Linguistic Embeddings for the Identification of Affect
LEIA: Linguistic Embeddings for the Identification of Affect
S. Aroyehun
Lukas Malik
Hannah Metzler
Nikolas Haimerl
Anna Flavia Di Natale
David Garcia
35
3
0
21 Apr 2023
Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs
Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs
Costas Mavromatis
V. Ioannidis
Shen Wang
Da Zheng
Soji Adeshina
Jun Ma
Han Zhao
Christos Faloutsos
George Karypis
79
31
0
20 Apr 2023
Word Sense Induction with Knowledge Distillation from BERT
Word Sense Induction with Knowledge Distillation from BERT
Anik Saha
Alex Gittens
B. Yener
51
1
0
20 Apr 2023
MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label
  Framing Detection with Contrastive Learning
MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label Framing Detection with Contrastive Learning
Qisheng Liao
Meiting Lai
Preslav Nakov
VLM
43
10
0
20 Apr 2023
Domain-specific Continued Pretraining of Language Models for Capturing
  Long Context in Mental Health
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Shaoxiong Ji
Tianlin Zhang
Kailai Yang
Sophia Ananiadou
Min Zhang
Jörg Tiedemann
AI4MHALM
86
29
0
20 Apr 2023
GPT-NER: Named Entity Recognition via Large Language Models
GPT-NER: Named Entity Recognition via Large Language Models
Shuhe Wang
Xiaofei Sun
Xiaoya Li
Rongbin Ouyang
Leilei Gan
Tianwei Zhang
Jiwei Li
Guoyin Wang
108
202
0
20 Apr 2023
Interventional Probing in High Dimensions: An NLI Case Study
Interventional Probing in High Dimensions: An NLI Case Study
Julia Rozanova
Marco Valentino
Lucas C. Cordeiro
André Freitas
45
7
0
20 Apr 2023
Is Cross-modal Information Retrieval Possible without Training?
Is Cross-modal Information Retrieval Possible without Training?
Hyunjin Choi
HyunJae Lee
Seongho Joe
Youngjune Gwon
49
1
0
20 Apr 2023
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts
Ashutosh Modi
Prathamesh Kalamkar
S. Karn
Aman Tiwari
Abhinav Joshi
Sai Kiran Tanikella
S. Guha
Sachin Malhan
Vivek Raghavan
ELMAILaw
55
42
0
19 Apr 2023
EC^2: Emergent Communication for Embodied Control
EC^2: Emergent Communication for Embodied Control
Yao Mu
Shunyu Yao
Mingyu Ding
Ping Luo
Chuang Gan
LM&Ro
79
20
0
19 Apr 2023
Hyperbolic Image-Text Representations
Hyperbolic Image-Text Representations
Karan Desai
Maximilian Nickel
Tanmay Rajpurohit
Justin Johnson
Ramakrishna Vedantam
VLM
109
67
0
18 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by
  equivalent and optimal shifting and scaling
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Ruihao Gong
Jian Ren
Zhengang Li
MQ
78
36
0
18 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local
  Fine-Tuned Models for Highly-Specific Radiology NLI Task
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
66
39
0
18 Apr 2023
Revisiting k-NN for Fine-tuning Pre-trained Language Models
Revisiting k-NN for Fine-tuning Pre-trained Language Models
Lei Li
Jing Chen
Bo Tian
Ning Zhang
63
1
0
18 Apr 2023
D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence
  Embeddings
D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings
HyunJae Lee
VLM
52
0
0
18 Apr 2023
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised
  Learning
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Zheng Lian
Haiyang Sun
Guoying Zhao
Kang Chen
Mingyu Xu
...
Meng Wang
Min Zhang
Guoying Zhao
Björn W. Schuller
Jianhua Tao
96
51
0
18 Apr 2023
Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to
  Fine-Tune and Hard to Detect with other LLMs
Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs
Da Silva Gameiro Henrique
Andrei Kucharavy
R. Guerraoui
DeLMO
83
8
0
18 Apr 2023
From Words to Music: A Study of Subword Tokenization Techniques in
  Symbolic Music Generation
From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation
Adarsh Kumar
Pedro Sarmento
73
4
0
18 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study
  on Faroese
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavaš
Ivan Vulić
84
23
0
18 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large
  Language Models
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MAVLM
70
11
0
18 Apr 2023
A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain
  Text Classification
A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification
Yunlong Feng
Bohan Li
Libo Qin
Xiao Xu
Wanxiang Che
46
3
0
18 Apr 2023
HeRo: RoBERTa and Longformer Hebrew Language Models
HeRo: RoBERTa and Longformer Hebrew Language Models
Vitaly Shalumov
Harel Haskey
VLM
96
7
0
18 Apr 2023
Classification of US Supreme Court Cases using BERT-Based Techniques
Classification of US Supreme Court Cases using BERT-Based Techniques
Shubham Vatsal
Adam Meyers
J. Ortega
ELMAILaw
53
3
0
17 Apr 2023
An Unbiased Transformer Source Code Learning with Semantic Vulnerability
  Graph
An Unbiased Transformer Source Code Learning with Semantic Vulnerability Graph
Nafis Tanveer Islam
G. Parra
Dylan Manuel
E. Bou-Harb
Peyman Najafirad
85
10
0
17 Apr 2023
Improving Autoregressive NLP Tasks via Modular Linearized Attention
Improving Autoregressive NLP Tasks via Modular Linearized Attention
Victor Agostinelli
Lizhong Chen
60
1
0
17 Apr 2023
LED: A Dataset for Life Event Extraction from Dialogs
LED: A Dataset for Life Event Extraction from Dialogs
Yi-Pei Chen
An-Zi Yen
Hen-Hsen Huang
Hideki Nakayama
Hsin-Hsi Chen
8
4
0
17 Apr 2023
Context-Dependent Embedding Utterance Representations for Emotion
  Recognition in Conversations
Context-Dependent Embedding Utterance Representations for Emotion Recognition in Conversations
Patrícia Pereira
Helena Moniz
Isabel Dias
Joao Paulo Carvalho
79
9
0
17 Apr 2023
VECO 2.0: Cross-lingual Language Model Pre-training with
  Multi-granularity Contrastive Learning
VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning
Zhen-Ru Zhang
Chuanqi Tan
Songfang Huang
Fei Huang
VLM
64
5
0
17 Apr 2023
SkillGPT: a RESTful API service for skill extraction and standardization
  using a Large Language Model
SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language Model
Nan Li
Bo Kang
T. D. Bie
64
15
0
17 Apr 2023
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Juan Pablo Zuluaga
Amrutha Prasad
Iuliia Nigmatulina
P. Motlícek
Matthias Kleinert
67
23
0
16 Apr 2023
It's All in the Embedding! Fake News Detection Using Document Embeddings
It's All in the Embedding! Fake News Detection Using Document Embeddings
Ciprian-Octavian Truică
Elena Simona Apostol
85
51
0
16 Apr 2023
MisRoBÆRTa: Transformers versus Misinformation
MisRoBÆRTa: Transformers versus Misinformation
Ciprian-Octavian Truică
Elena Simona Apostol
66
39
0
16 Apr 2023
Permutation Equivariance of Transformers and Its Applications
Permutation Equivariance of Transformers and Its Applications
Hengyuan Xu
Liyao Xiang
Hang Ye
Dixi Yao
Pengzhi Chu
Baochun Li
56
15
0
16 Apr 2023
ArguGPT: evaluating, understanding and identifying argumentative essays
  generated by GPT models
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu
Ziyin Zhang
Wanyang Zhang
Shisen Yue
Xiaojing Zhao
Xinyuan Cheng
Yiwen Zhang
Hai Hu
DeLMO
103
55
0
16 Apr 2023
The Self-Perception and Political Biases of ChatGPT
The Self-Perception and Political Biases of ChatGPT
Jérôme Rutinowski
Sven Franke
Jan Endendyk
Ina Dormuth
Markus Pauly
106
104
0
14 Apr 2023
OPI at SemEval 2023 Task 9: A Simple But Effective Approach to
  Multilingual Tweet Intimacy Analysis
OPI at SemEval 2023 Task 9: A Simple But Effective Approach to Multilingual Tweet Intimacy Analysis
Slawomir Dadas
57
2
0
14 Apr 2023
Keeping the Questions Conversational: Using Structured Representations
  to Resolve Dependency in Conversational Question Answering
Keeping the Questions Conversational: Using Structured Representations to Resolve Dependency in Conversational Question Answering
Munazza Zaib
Quan Z. Sheng
W. Zhang
A. Mahmood
80
2
0
14 Apr 2023
Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9
  and DSTC10
Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10
David Thulke
Nico Daheim
Christian Dugast
Hermann Ney
83
7
0
14 Apr 2023
Previous
123...110111112...215216217
Next