ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,764 papers shown
Title
AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented
  Generative Approach
AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach
Jianwen Guo
Liying Cheng
Wenxuan Zhang
Stanley Kok
Xin Li
Lidong Bing
48
9
0
31 May 2023
Guiding Computational Stance Detection with Expanded Stance Triangle
  Framework
Guiding Computational Stance Detection with Expanded Stance Triangle Framework
Zhengyuan Liu
Yong Keong Yap
Hai Leong Chieu
Nancy F. Chen
55
6
0
31 May 2023
Knowledge Base Question Answering for Space Debris Queries
Knowledge Base Question Answering for Space Debris Queries
Paul Darm
Antonio Valerio Miceli Barone
Shay B. Cohen
A. Riccardi
70
1
0
31 May 2023
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations
  for Text-to-Speech
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech
L. T. Nguyen
Thinh-Le-Gia Pham
Dat Quoc Nguyen
98
14
0
31 May 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot
  Settings Tell Us about Language Models?
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
50
2
0
31 May 2023
SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with
  BERT
SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT
Aditya Yadavalli
Alekhya Yadavalli
Vera Tobin
103
7
0
31 May 2023
Large Language Models Are Not Strong Abstract Reasoners
Large Language Models Are Not Strong Abstract Reasoners
Gaël Gendron
Qiming Bao
Michael Witbrock
Gillian Dobbie
ELMLRM
127
37
0
31 May 2023
Exploring Lottery Prompts for Pre-trained Language Models
Exploring Lottery Prompts for Pre-trained Language Models
Yulin Chen
Ning Ding
Xiaobin Wang
Shengding Hu
Haitao Zheng
Zhiyuan Liu
Pengjun Xie
VLMLRM
52
7
0
31 May 2023
Towards Flow Graph Prediction of Open-Domain Procedural Texts
Towards Flow Graph Prediction of Open-Domain Procedural Texts
Keisuke Shirai
Hirotaka Kameko
Shinsuke Mori
67
1
0
31 May 2023
BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline
BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline
Alexander Shevtsov
D. Antonakaki
Ioannis Lamprou
Polyvios Pratikakis
Sotiris Ioannidis
172
0
0
31 May 2023
ScoNe: Benchmarking Negation Reasoning in Language Models With
  Fine-Tuning and In-Context Learning
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
Jingyuan Selena She
Christopher Potts
Sam Bowman
Atticus Geiger
94
15
0
30 May 2023
Examining risks of racial biases in NLP tools for child protective
  services
Examining risks of racial biases in NLP tools for child protective services
Anjalie Field
Amanda Coston
Nupoor Gandhi
Alexandra Chouldechova
Emily Putnam-Hornstein
David Steier
Yulia Tsvetkov
92
14
0
30 May 2023
DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative
  Modeling
DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling
Yuchen Zhuang
Yue Yu
Lingkai Kong
Xiang Chen
Chao Zhang
NoLaSyDaAI4CE
110
13
0
30 May 2023
Blockwise Parallel Transformer for Large Context Models
Blockwise Parallel Transformer for Large Context Models
Hao Liu
Pieter Abbeel
77
11
0
30 May 2023
infoVerse: A Universal Framework for Dataset Characterization with
  Multidimensional Meta-information
infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information
Jaehyung Kim
Yekyung Kim
Karin de Langis
Jinwoo Shin
Dongyeop Kang
52
1
0
30 May 2023
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private
  Tuning
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning
Umang Gupta
Aram Galstyan
Greg Ver Steeg
53
2
0
30 May 2023
Preserving Pre-trained Features Helps Calibrate Fine-tuned Language
  Models
Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models
Guande He
Jianfei Chen
Jun Zhu
94
22
0
30 May 2023
Controlled Text Generation with Hidden Representation Transformations
Controlled Text Generation with Hidden Representation Transformations
Vaibhav Kumar
H. Koorehdavoudi
Masud Moshtaghi
Amita Misra
Ankit Chadha
Emilio Ferrara
62
3
0
30 May 2023
Together We Make Sense -- Learning Meta-Sense Embeddings from Pretrained
  Static Sense Embeddings
Together We Make Sense -- Learning Meta-Sense Embeddings from Pretrained Static Sense Embeddings
Haochen Luo
Yi Zhou
Danushka Bollegala
SSL
69
1
0
30 May 2023
Event-Centric Query Expansion in Web Search
Event-Centric Query Expansion in Web Search
Yanan Zhang
Weijie Cui
Yangfan Zhang
Xiaoling Bai
Zhe Zhang
Jin Ma
Xinyu Chen
Tianhua Zhou
69
2
0
30 May 2023
Document-Level Multi-Event Extraction with Event Proxy Nodes and
  Hausdorff Distance Minimization
Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization
Xinyu Wang
Lin Gui
Yulan He
51
8
0
30 May 2023
Fighting Bias with Bias: Promoting Model Robustness by Amplifying
  Dataset Biases
Fighting Bias with Bias: Promoting Model Robustness by Amplifying Dataset Biases
Yuval Reif
Roy Schwartz
78
7
0
30 May 2023
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language
  Models
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models
Zhuocheng Gong
Jiahao Liu
Qifan Wang
Yang Yang
Jingang Wang
Wei Wu
Yunsen Xian
Dongyan Zhao
Rui Yan
MQ
72
5
0
30 May 2023
Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language
  Pre-training
Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training
Yuxuan Wang
Jianghui Wang
Dongyan Zhao
Zilong Zheng
61
5
0
30 May 2023
Graph Reasoning for Question Answering with Triplet Retrieval
Graph Reasoning for Question Answering with Triplet Retrieval
Shiyang Li
Yifan Gao
Hao Jiang
Qingyu Yin
Zheng Li
Xifeng Yan
Chao Zhang
Bing Yin
RALMReLM
93
35
0
30 May 2023
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training
  for Document Understanding
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Yi Tu
Ya Guo
Huan Chen
Jinyang Tang
64
15
0
30 May 2023
Representation Of Lexical Stylistic Features In Language Models'
  Embedding Space
Representation Of Lexical Stylistic Features In Language Models' Embedding Space
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
85
7
0
29 May 2023
Check-COVID: Fact-Checking COVID-19 News Claims with Scientific Evidence
Check-COVID: Fact-Checking COVID-19 News Claims with Scientific Evidence
Gengyu Wang
Kate Harwood
Lawrence Chillrud
Amith Ananthram
Melanie Subbiah
Kathleen McKeown
HILM
79
24
0
29 May 2023
Beyond Confidence: Reliable Models Should Also Consider Atypicality
Beyond Confidence: Reliable Models Should Also Consider Atypicality
Mert Yuksekgonul
Linjun Zhang
James Zou
Carlos Guestrin
101
22
0
29 May 2023
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive
  Prompt-Based Few-Shot Fine-Tuning
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Amirhossein Abaskohi
S. Rothe
Yadollah Yaghoobzadeh
VLM
97
18
0
29 May 2023
Multiscale Positive-Unlabeled Detection of AI-Generated Texts
Multiscale Positive-Unlabeled Detection of AI-Generated Texts
Yuchuan Tian
Hanting Chen
Xutao Wang
Zheyuan Bai
Qinghua Zhang
Ruifeng Li
Chaoxi Xu
Yunhe Wang
DeLMO
116
47
0
29 May 2023
From Adversarial Arms Race to Model-centric Evaluation: Motivating a
  Unified Automatic Robustness Evaluation Framework
From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Lifan Yuan
Dehan Kong
...
Longtao Huang
H. Xue
Zhiyuan Liu
Maosong Sun
Heng Ji
AAMLELM
101
6
0
29 May 2023
The Utility of Large Language Models and Generative AI for Education
  Research
The Utility of Large Language Models and Generative AI for Education Research
Andrew Katz
Umair Shakir
B. Chambers
AI4CE
68
6
0
29 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark
  Datasets
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MAELMALM
125
193
0
29 May 2023
Abstractive Summarization as Augmentation for Document-Level Event
  Detection
Abstractive Summarization as Augmentation for Document-Level Event Detection
Janko Vidaković
Filip Karlo Dosilovic
Domagoj Pluscec
28
0
0
29 May 2023
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jia-Bin Huang
Yi Ren
Rongjie Huang
Dongchao Yang
Zhenhui Ye
Chen Zhang
Jinglin Liu
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
120
64
0
29 May 2023
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
Amirhossein Layegh
A. H. Payberah
A. Soylu
Dumitru Roman
M. Matskin
VLM
80
8
0
29 May 2023
Test-Time Training on Nearest Neighbors for Large Language Models
Test-Time Training on Nearest Neighbors for Large Language Models
Moritz Hardt
Yu Sun
VLMRALM
128
30
0
29 May 2023
Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of
  Weight Residuals
Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of Weight Residuals
Simo Ryu
S. Seo
Jaejun Yoo
87
8
0
28 May 2023
A Quantitative Review on Language Model Efficiency Research
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
76
0
0
28 May 2023
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule
  Zero-Shot Learning
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning
Haiteng Zhao
Shengchao Liu
Chang Ma
Hannan Xu
Jie Fu
Zhihong Deng
Lingpeng Kong
Qi Liu
94
65
0
28 May 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
84
11
0
28 May 2023
Mitigating Label Biases for In-context Learning
Mitigating Label Biases for In-context Learning
Yu Fei
Buse Giledereli
Zeming Chen
Antoine Bosselut
103
76
0
28 May 2023
Whitening-based Contrastive Learning of Sentence Embeddings
Whitening-based Contrastive Learning of Sentence Embeddings
Wenjie Zhuo
Yifan Sun
Xiaohan Wang
Linchao Zhu
Yezhou Yang
59
21
0
28 May 2023
Learning a Structural Causal Model for Intuition Reasoning in
  Conversation
Learning a Structural Causal Model for Intuition Reasoning in Conversation
Hang Chen
Bingyu Liao
Jing Luo
Wenjing Zhu
Xinyu Yang
LRM
119
13
0
28 May 2023
Rethinking Masked Language Modeling for Chinese Spelling Correction
Rethinking Masked Language Modeling for Chinese Spelling Correction
Hongqiu Wu
Shaohua Zhang
Yuchen Zhang
Hai Zhao
82
30
0
28 May 2023
LLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly
  Transformers
LLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly Transformers
Xuanqing Liu
Zhuotao Liu
97
23
0
28 May 2023
Plug-and-Play Knowledge Injection for Pre-trained Language Models
Plug-and-Play Knowledge Injection for Pre-trained Language Models
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Huadong Wang
Deming Ye
...
Xu Han
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
KELM
94
12
0
28 May 2023
One Network, Many Masks: Towards More Parameter-Efficient Transfer
  Learning
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
Guangtao Zeng
Peiyuan Zhang
Wei Lu
95
22
0
28 May 2023
Decoding the Underlying Meaning of Multimodal Hateful Memes
Decoding the Underlying Meaning of Multimodal Hateful Memes
Ming Shan Hee
Wen-Haw Chong
Roy Ka-wei Lee
89
43
0
28 May 2023
Previous
123...9899100...214215216
Next