ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 4,753 papers shown
Title
Dependency Parsing as MRC-based Span-Span Prediction
Dependency Parsing as MRC-based Span-Span Prediction
Leilei Gan
Yuxian Meng
Kun Kuang
Xiaofei Sun
Chun Fan
Fei Wu
Jiwei Li
40
21
0
17 May 2021
Sentence Similarity Based on Contexts
Sentence Similarity Based on Contexts
Xiaofei Sun
Yuxian Meng
Xiang Ao
Fei Wu
Tianwei Zhang
Jiwei Li
Chun Fan
28
29
0
17 May 2021
Doc2Dict: Information Extraction as Text Generation
Doc2Dict: Information Extraction as Text Generation
Benjamin Townsend
Eamon Ito-Fisher
Lily Zhang
Madison May
28
7
0
16 May 2021
How is BERT surprised? Layerwise detection of linguistic anomalies
How is BERT surprised? Layerwise detection of linguistic anomalies
Bai Li
Zining Zhu
Guillaume Thomas
Yang Xu
Frank Rudzicz
27
31
0
16 May 2021
BERT Busters: Outlier Dimensions that Disrupt Transformers
BERT Busters: Outlier Dimensions that Disrupt Transformers
Olga Kovaleva
Saurabh Kulshreshtha
Anna Rogers
Anna Rumshisky
27
85
0
14 May 2021
Out-of-Manifold Regularization in Contextual Embedding Space for Text
  Classification
Out-of-Manifold Regularization in Contextual Embedding Space for Text Classification
Seonghyeon Lee
Dongha Lee
Hwanjo Yu
24
4
0
14 May 2021
Video Corpus Moment Retrieval with Contrastive Learning
Video Corpus Moment Retrieval with Contrastive Learning
Hao Zhang
Aixin Sun
Wei Jing
Guoshun Nan
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
46
81
0
13 May 2021
VSR: A Unified Framework for Document Layout Analysis combining Vision,
  Semantics and Relations
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Peng Zhang
Can Li
Liang Qiao
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Fei Wu
31
57
0
13 May 2021
Designing Multimodal Datasets for NLP Challenges
Designing Multimodal Datasets for NLP Challenges
James Pustejovsky
E. Holderness
Jingxuan Tu
Parker Glenn
Kyeongmin Rim
Kelley Lynch
R. Brutti
31
5
0
12 May 2021
Analysing The Impact Of Linguistic Features On Cross-Lingual Transfer
Analysing The Impact Of Linguistic Features On Cross-Lingual Transfer
B. Dolički
Gerasimos Spanakis
26
16
0
12 May 2021
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation
Ahmad Rashid
Vasileios Lioutas
Mehdi Rezagholizadeh
AAML
26
36
0
12 May 2021
Kleister: Key Information Extraction Datasets Involving Long Documents
  with Complex Layouts
Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts
Tomasz Stanislawek
Filip Graliñski
Anna Wróblewska
Dawid Lipiñski
Agnieszka Kaliska
Paulina Rosalska
Bartosz Topolski
P. Biecek
44
92
0
12 May 2021
BertGCN: Transductive Text Classification by Combining GCN and BERT
BertGCN: Transductive Text Classification by Combining GCN and BERT
Yuxiao Lin
Yuxian Meng
Xiaofei Sun
Qinghong Han
Kun Kuang
Jiwei Li
Fei Wu
18
225
0
12 May 2021
Evaluating Gender Bias in Natural Language Inference
Evaluating Gender Bias in Natural Language Inference
Shanya Sharma
Manan Dey
Koustuv Sinha
28
41
0
12 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A
  Retrospective Datasheet for BookCorpus
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
29
57
0
11 May 2021
Reinforcement Learning from Reformulations in Conversational Question
  Answering over Knowledge Graphs
Reinforcement Learning from Reformulations in Conversational Question Answering over Knowledge Graphs
Magdalena Kaiser
Rishiraj Saha Roy
Gerhard Weikum
29
52
0
11 May 2021
Poolingformer: Long Document Modeling with Pooling Attention
Poolingformer: Long Document Modeling with Pooling Attention
Hang Zhang
Yeyun Gong
Yelong Shen
Weisheng Li
Jiancheng Lv
Nan Duan
Weizhu Chen
43
98
0
10 May 2021
DefSent: Sentence Embeddings using Definition Sentences
DefSent: Sentence Embeddings using Definition Sentences
Hayato Tsukagoshi
Ryohei Sasano
Koichi Takeda
19
23
0
10 May 2021
How could Neural Networks understand Programs?
How could Neural Networks understand Programs?
Dinglan Peng
Shuxin Zheng
Yatao Li
Guolin Ke
Di He
Tie-Yan Liu
NAI
23
62
0
10 May 2021
REPT: Bridging Language Models and Machine Reading Comprehension via
  Retrieval-Based Pre-training
REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training
Fangkai Jiao
Yangyang Guo
Yilin Niu
Feng Ji
Feng-Lin Li
Liqiang Nie
LRM
34
12
0
10 May 2021
Understanding the Role of Affect Dimensions in Detecting Emotions from
  Tweets: A Multi-task Approach
Understanding the Role of Affect Dimensions in Detecting Emotions from Tweets: A Multi-task Approach
Rajdeep Mukherjee
Atharva Naik
S. Poddar
Soham Dasgupta
Niloy Ganguly
23
12
0
09 May 2021
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
Chaojun Xiao
Xueyu Hu
Zhiyuan Liu
Cunchao Tu
Maosong Sun
AILaw
ELM
50
230
0
09 May 2021
e-ViL: A Dataset and Benchmark for Natural Language Explanations in
  Vision-Language Tasks
e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Maxime Kayser
Oana-Maria Camburu
Leonard Salewski
Cornelius Emde
Virginie Do
Zeynep Akata
Thomas Lukasiewicz
VLM
31
100
0
08 May 2021
Logic-Driven Context Extension and Data Augmentation for Logical
  Reasoning of Text
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text
Siyuan Wang
Wanjun Zhong
Duyu Tang
Zhongyu Wei
Zhihao Fan
Daxin Jiang
Ming Zhou
Nan Duan
NAI
33
70
0
08 May 2021
Improving Named Entity Recognition by External Context Retrieving and
  Cooperative Learning
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Xinyu Wang
Yong-jia Jiang
Nguyen Bach
Tao Wang
Zhongqiang Huang
Fei Huang
Kewei Tu
43
144
0
08 May 2021
Improving Document Representations by Generating Pseudo Query Embeddings
  for Dense Retrieval
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval
Hongyin Tang
Xingwu Sun
Beihong Jin
Jingang Wang
Fuzheng Zhang
Wei Wu
RALM
36
36
0
08 May 2021
Understanding by Understanding Not: Modeling Negation in Language Models
Understanding by Understanding Not: Modeling Negation in Language Models
Arian Hosseini
Siva Reddy
Dzmitry Bahdanau
R. Devon Hjelm
Alessandro Sordoni
Rameswar Panda
22
87
0
07 May 2021
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP:
  The Role of Sample Size and Dimensionality
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality
Adithya Ganesan
Matthew Matero
Aravind Reddy Ravula
Huy-Hien Vu
H. Andrew Schwartz
30
35
0
07 May 2021
TABBIE: Pretrained Representations of Tabular Data
TABBIE: Pretrained Representations of Tabular Data
H. Iida
Dung Ngoc Thai
Varun Manjunatha
Mohit Iyyer
LMTD
SSL
VLM
25
170
0
06 May 2021
Rethinking Search: Making Domain Experts out of Dilettantes
Rethinking Search: Making Domain Experts out of Dilettantes
Donald Metzler
Yi Tay
Dara Bahri
Marc Najork
LRM
43
46
0
05 May 2021
HerBERT: Efficiently Pretrained Transformer-based Language Model for
  Polish
HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish
Robert Mroczkowski
Piotr Rybak
Alina Wróblewska
Ireneusz Gawlik
36
81
0
04 May 2021
When to Foldém: How to answer Unanswerable questions
When to Foldém: How to answer Unanswerable questions
Marshall Ho
Zhipeng Zhou
J. He
36
2
0
01 May 2021
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark
Nouha Dziri
Hannah Rashkin
Tal Linzen
David Reitter
ALM
208
79
0
30 Apr 2021
Explanation-Based Human Debugging of NLP Models: A Survey
Explanation-Based Human Debugging of NLP Models: A Survey
Piyawat Lertvittayakumjorn
Francesca Toni
LRM
47
79
0
30 Apr 2021
Entailment as Few-Shot Learner
Entailment as Few-Shot Learner
Sinong Wang
Han Fang
Madian Khabsa
Hanzi Mao
Hao Ma
35
183
0
29 Apr 2021
AMR Parsing with Action-Pointer Transformer
AMR Parsing with Action-Pointer Transformer
Jiawei Zhou
Tahira Naseem
Ramón Fernández Astudillo
Radu Florian
46
44
0
29 Apr 2021
SYNFIX: Automatically Fixing Syntax Errors using Compiler Diagnostics
SYNFIX: Automatically Fixing Syntax Errors using Compiler Diagnostics
Toufique Ahmed
Noah Rose Ledesma
Prem Devanbu
59
19
0
29 Apr 2021
MOROCCO: Model Resource Comparison Framework
MOROCCO: Model Resource Comparison Framework
Valentin Malykh
Alexander Kukushkin
Ekaterina Artemova
Vladislav Mikhailov
Maria Tikhonova
Tatiana Shavrina
24
0
0
29 Apr 2021
MelBERT: Metaphor Detection via Contextualized Late Interaction using
  Metaphorical Identification Theories
MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories
Minjin Choi
Sunkyung Lee
Eunseong Choi
Heesoo Park
Junhyuk Lee
Dongwon Lee
Jongwuk Lee
29
102
0
28 Apr 2021
Multi-class Text Classification using BERT-based Active Learning
Multi-class Text Classification using BERT-based Active Learning
Sumanth Prabhu
Moosa Mohamed
Hemant Misra
29
38
0
27 Apr 2021
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Shellcode_IA32: A Dataset for Automatic Shellcode Generation
Pietro Liguori
Erfan Al-Hossami
Domenico Cotroneo
R. Natella
B. Cukic
Samira Shaikh
36
27
0
27 Apr 2021
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjD
VLM
93
864
0
26 Apr 2021
PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language
  Models with Auto-parallel Computation
PanGu-ααα: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Wei Zeng
Xiaozhe Ren
Teng Su
Hui Wang
Yi-Lun Liao
...
Gaojun Fan
Yaowei Wang
Xuefeng Jin
Qun Liu
Yonghong Tian
ALM
MoE
AI4CE
35
212
0
26 Apr 2021
Vietnamese Complaint Detection on E-Commerce Websites
Vietnamese Complaint Detection on E-Commerce Websites
N. Nguyen
Phuong Phan-Dieu Ha
Luan Thanh Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
33
6
0
24 Apr 2021
Incremental Few-shot Text Classification with Multi-round New Classes:
  Formulation, Dataset and System
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Congyin Xia
Wenpeng Yin
Yihao Feng
Philip Yu
CLL
VLM
27
51
0
24 Apr 2021
Weakly-supervised Multi-task Learning for Multimodal Affect Recognition
Weakly-supervised Multi-task Learning for Multimodal Affect Recognition
Wenliang Dai
Samuel Cahyawijaya
Yejin Bang
Pascale Fung
CVBM
41
11
0
23 Apr 2021
Analyzing Monotonic Linear Interpolation in Neural Network Loss
  Landscapes
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
James Lucas
Juhan Bae
Michael Ruogu Zhang
Stanislav Fort
R. Zemel
Roger C. Grosse
MoMe
174
28
0
22 Apr 2021
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of
  Media Frames
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames
Shima Khanehzar
Trevor Cohn
Gosia Mikołajczak
A. Turpin
Lea Frermann
22
11
0
22 Apr 2021
Hybrid Encoder: Towards Efficient and Precise Native AdsRecommendation
  via Hybrid Transformer Encoding Networks
Hybrid Encoder: Towards Efficient and Precise Native AdsRecommendation via Hybrid Transformer Encoding Networks
Junhan Yang
Zheng Liu
Bowen Jin
Jianxun Lian
Defu Lian
Akshay Soni
Eun Yong Kang
Yajun Wang
Guangzhong Sun
Xing Xie
46
1
0
22 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision
  Transformers
All Tokens Matter: Token Labeling for Training Better Vision Transformers
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
38
203
0
22 Apr 2021
Previous
123...777879...949596
Next