ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,839 papers shown
Title
Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph
  for Zero-shot Entity Retrieval
Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph for Zero-shot Entity Retrieval
Taiqiang Wu
Xingyu Bai
Weigang Guo
Weijie Liu
Siheng Li
Yujiu Yang
89
16
0
20 Nov 2022
Artificial Interrogation for Attributing Language Models
Artificial Interrogation for Attributing Language Models
Farhan Dhanani
Muhammad Rafi
36
1
0
20 Nov 2022
SeDR: Segment Representation Learning for Long Documents Dense Retrieval
SeDR: Segment Representation Learning for Long Documents Dense Retrieval
Junying Chen
Qingcai Chen
Dongfang Li
Yutao Huang
67
6
0
20 Nov 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learning
Maria Lymperaiou
Giorgos Stamou
174
15
0
19 Nov 2022
Entity-Assisted Language Models for Identifying Check-worthy Sentences
Entity-Assisted Language Models for Identifying Check-worthy Sentences
Ting-Han Su
Craig Macdonald
I. Ounis
47
0
0
19 Nov 2022
Bipartite-play Dialogue Collection for Practical Automatic Evaluation of
  Dialogue Systems
Bipartite-play Dialogue Collection for Practical Automatic Evaluation of Dialogue Systems
Shiki Sato
Yosuke Kishinami
Hiroaki Sugiyama
Reina Akama
Ryoko Tokuhisa
Jun Suzuki
99
2
0
19 Nov 2022
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for
  Efficient and Effective Multi-Vector Retrieval
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Minghan Li
Sheng-Chieh Lin
Barlas Oğuz
Asish Ghoshal
Jimmy J. Lin
Yashar Mehdad
Wen-tau Yih
Xilun Chen
80
26
0
18 Nov 2022
GENIUS: Sketch-based Language Model Pre-training via Extreme and
  Selective Masking for Text Generation and Augmentation
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation
Biyang Guo
Yeyun Gong
Yelong Shen
Songqiao Han
Hailiang Huang
Nan Duan
Weizhu Chen
VLM
102
19
0
18 Nov 2022
Task Residual for Tuning Vision-Language Models
Task Residual for Tuning Vision-Language Models
Tao Yu
Zhihe Lu
Xin Jin
Zhibo Chen
Xinchao Wang
VLMCLIP
111
92
0
18 Nov 2022
Context Variance Evaluation of Pretrained Language Models for
  Prompt-based Biomedical Knowledge Probing
Context Variance Evaluation of Pretrained Language Models for Prompt-based Biomedical Knowledge Probing
Zonghai Yao
Yi Cao
Zhichao Yang
Hong-ye Yu
100
17
0
18 Nov 2022
Vision Transformers in Medical Imaging: A Review
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViTMedIm
101
36
0
18 Nov 2022
Where did you tweet from? Inferring the origin locations of tweets based
  on contextual information
Where did you tweet from? Inferring the origin locations of tweets based on contextual information
Rabindra Lamsal
Aaron Harwood
M. Read
64
11
0
18 Nov 2022
Towards Explaining Subjective Ground of Individuals on Social Media
Towards Explaining Subjective Ground of Individuals on Social Media
Younghun Lee
Dan Goldwasser
73
1
0
18 Nov 2022
CAPE: Corrective Actions from Precondition Errors using Large Language
  Models
CAPE: Corrective Actions from Precondition Errors using Large Language Models
S. S. Raman
Vanya Cohen
Ifrah Idrees
Eric Rosen
Ray Mooney
Stefanie Tellex
D. Paulius
LLMAGVLM
90
35
0
17 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and
  Vision-Language Tasks
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
89
58
0
17 Nov 2022
Zero-Shot Dynamic Quantization for Transformer Inference
Zero-Shot Dynamic Quantization for Transformer Inference
Yousef El-Kurdi
Jerry Quinn
Avirup Sil
MQ
69
1
0
17 Nov 2022
UPTON: Preventing Authorship Leakage from Public Text Release via Data
  Poisoning
UPTON: Preventing Authorship Leakage from Public Text Release via Data Poisoning
Ziyao Wang
Thai Le
Dongwon Lee
91
1
0
17 Nov 2022
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global
  Association Approach
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach
Pha Nguyen
Kha Gia Quach
C. Duong
Son Lam Phung
Ngan Le
Khoa Luu
123
13
0
17 Nov 2022
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Xun Gong
Yu-Huan Wu
Jinyu Li
Shujie Liu
Rui Zhao
Xie Chen
Y. Qian
RALM
69
11
0
17 Nov 2022
Self-Training with Purpose Preserving Augmentation Improves Few-shot
  Generative Dialogue State Tracking
Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking
Jihyun Lee
C. Lee
Yunsu Kim
G. G. Lee
76
0
0
17 Nov 2022
Few-shot Learning for Multi-modal Social Media Event Filtering
Few-shot Learning for Multi-modal Social Media Event Filtering
José Nascimento
J. P. Cardenuto
J. Yang
Anderson de Rezende Rocha
52
3
0
16 Nov 2022
A Graph-Based Context-Aware Model to Understand Online Conversations
A Graph-Based Context-Aware Model to Understand Online Conversations
Vibhor Agarwal
A. P. Young
Sagar Joglekar
Nishanth R. Sastry
116
9
0
16 Nov 2022
Deep Emotion Recognition in Textual Conversations: A Survey
Deep Emotion Recognition in Textual Conversations: A Survey
Patrícia Pereira
Helena Moniz
Joao Paulo Carvalho
101
18
0
16 Nov 2022
Prompting PaLM for Translation: Assessing Strategies and Performance
Prompting PaLM for Translation: Assessing Strategies and Performance
David Vilar
Markus Freitag
Colin Cherry
Jiaming Luo
Viresh Ratnakar
George F. Foster
LRM
122
167
0
16 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight
  BERT
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
49
1
0
16 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed
  Representations
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Shafiq Joty
Luo Si
Lidong Bing
90
2
0
16 Nov 2022
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual
  Pre-training
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training
Henry Tang
Ameet Deshpande
Karthik Narasimhan
106
5
0
15 Nov 2022
Alzheimer's Dementia Detection through Spontaneous Dialogue with
  Proactive Robotic Listeners
Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners
Yuanchao Li
Catherine Lai
Divesh Lala
K. Inoue
Tatsuya Kawahara
60
12
0
15 Nov 2022
Introducing Semantics into Speech Encoders
Introducing Semantics into Speech Encoders
Derek Xu
Shuyan Dong
Changhan Wang
Suyoun Kim
Zhaojiang Lin
...
Alexei Baevski
Guan-Ting Lin
Hung-yi Lee
Yizhou Sun
Wei Wang
SSL
103
3
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
66
16
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an
  Out-of-distribution Generalization Perspective
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
191
82
0
15 Nov 2022
YORO -- Lightweight End to End Visual Grounding
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
60
22
0
15 Nov 2022
A Survey for Efficient Open Domain Question Answering
A Survey for Efficient Open Domain Question Answering
Qin Zhang
Shan Chen
Dongkuan Xu
Qingqing Cao
Xiaojun Chen
Trevor Cohn
Meng Fang
90
36
0
15 Nov 2022
QueryForm: A Simple Zero-shot Form Entity Query Framework
QueryForm: A Simple Zero-shot Form Entity Query Framework
Zifeng Wang
Zizhao Zhang
Jacob Devlin
Chen-Yu Lee
Guolong Su
Hao Zhang
Jennifer Dy
Vincent Perot
Tomas Pfister
68
8
0
14 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMCLIP
269
730
0
14 Nov 2022
Semantic Similarity Models for Depression Severity Estimation
Semantic Similarity Models for Depression Severity Estimation
Anxo Perez
Neha Warikoo
Kexin Wang
Javier Parapar
Iryna Gurevych
AI4MH
62
7
0
14 Nov 2022
Imagination is All You Need! Curved Contrastive Learning for Abstract
  Sequence Modeling Utilized on Long Short-Term Dialogue Planning
Imagination is All You Need! Curved Contrastive Learning for Abstract Sequence Modeling Utilized on Long Short-Term Dialogue Planning
Justus-Jonas Erker
Stefan Schaffer
Gerasimos Spanakis
91
1
0
14 Nov 2022
High-Resource Methodological Bias in Low-Resource Investigations
High-Resource Methodological Bias in Low-Resource Investigations
Maartje ter Hoeve
David Grangier
Natalie Schluter
78
2
0
14 Nov 2022
Calibrated Interpretation: Confidence Estimation in Semantic Parsing
Calibrated Interpretation: Confidence Estimation in Semantic Parsing
Elias Stengel-Eskin
Benjamin Van Durme
UQLM
166
25
0
14 Nov 2022
Composed Image Retrieval with Text Feedback via Multi-grained
  Uncertainty Regularization
Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
Yiyang Chen
Zhedong Zheng
Wei Ji
Leigang Qu
Tat-Seng Chua
164
45
0
14 Nov 2022
Language models are good pathologists: using attention-based sequence
  reduction and text-pretrained transformers for efficient WSI classification
Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification
Juan Pisula
Katarzyna Bozek
VLMMedIm
89
3
0
14 Nov 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILMMoE
88
52
0
14 Nov 2022
Replacing Language Model for Style Transfer
Replacing Language Model for Style Transfer
Peng Cheng
Rui Li
KELM
72
3
0
14 Nov 2022
MT4SSL: Boosting Self-Supervised Speech Representation Learning by
  Integrating Multiple Targets
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
Ziyang Ma
Zhisheng Zheng
Changli Tang
Yujin Wang
Xie Chen
126
20
0
14 Nov 2022
SPE: Symmetrical Prompt Enhancement for Fact Probing
SPE: Symmetrical Prompt Enhancement for Fact Probing
Yiyuan Li
Tong Che
Yezhen Wang
Zhengbao Jiang
Caiming Xiong
Snigdha Chaturvedi
77
6
0
14 Nov 2022
Xu at SemEval-2022 Task 4: Pre-BERT Neural Network Methods vs Post-BERT
  RoBERTa Approach for Patronizing and Condescending Language Detection
Xu at SemEval-2022 Task 4: Pre-BERT Neural Network Methods vs Post-BERT RoBERTa Approach for Patronizing and Condescending Language Detection
Jinghua Xu
38
3
0
13 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLMLRM
66
6
0
13 Nov 2022
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion
  and Keyword-to-Caption Augmentation
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation
Yusong Wu
Kai Chen
Tianyu Zhang
Yuchen Hui
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
CLIP
194
546
0
12 Nov 2022
NLPeer: A Unified Resource for the Computational Study of Peer Review
NLPeer: A Unified Resource for the Computational Study of Peer Review
Nils Dycke
Ilia Kuznetsov
Iryna Gurevych
82
39
0
12 Nov 2022
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic
  Fusion Prompts
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic Fusion Prompts
Xiaocui Yang
Shi Feng
Daling Wang
Pengfei Hong
Soujanya Poria
84
23
0
12 Nov 2022
Previous
123...129130131...215216217
Next