ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,764 papers shown
Title
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built
  on Pre-Trained Language Models
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models
Jiazheng Li
ZHAOYUE SUN
Bin Liang
Lin Gui
Yulan He
79
2
0
06 Jun 2023
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Zhiyu Zoey Chen
J. Choi
B. Fetahu
Oleg Rokhlenko
S. Malmasi
RALM
41
6
0
06 Jun 2023
Deep neural networks architectures from the perspective of manifold
  learning
Deep neural networks architectures from the perspective of manifold learning
German Magai
AAMLAI4CE
61
6
0
06 Jun 2023
BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision,
  Language, and Graphs
BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs
Zhiyong Yang
Tinglin Huang
Ming Ding
Yuxiao Dong
Rex Ying
Yukuo Cen
Yangli-ao Geng
Jie Tang
SSLVLM
91
9
0
06 Jun 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive
  Learning
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
Chujie Zheng
Pei Ke
Zheng Zhang
Minlie Huang
BDL
83
34
0
06 Jun 2023
Security Knowledge-Guided Fuzzing of Deep Learning Libraries
Security Knowledge-Guided Fuzzing of Deep Learning Libraries
Nima Shiri Harzevili
Mohammad Mahdi Mohajer
Moshi Wei
H. Pham
Song Wang
AAMLAI4CE
56
1
0
05 Jun 2023
Information Flow Control in Machine Learning through Modular Model
  Architecture
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
75
6
0
05 Jun 2023
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification
  Tasks
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks
Jean-Michel Attendu
Jean-Philippe Corbeil
88
16
0
05 Jun 2023
How Can We Train Deep Learning Models Across Clouds and Continents? An
  Experimental Study
How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study
Alexander Isenko
R. Mayer
Hans-Arno Jacobsen
78
8
0
05 Jun 2023
Machine Learning and Statistical Approaches to Measuring Similarity of
  Political Parties
Machine Learning and Statistical Approaches to Measuring Similarity of Political Parties
Daria Boratyn
Damian Brzyski
Beata Kosowska-Gkastol
Jan Rybicki
Wojciech Słomczyński
Dariusz Stolicki
15
0
0
05 Jun 2023
Structured Voronoi Sampling
Structured Voronoi Sampling
Afra Amini
Li Du
Ryan Cotterell
DiffM
97
2
0
05 Jun 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese
  Medical Exam Dataset
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Junling Liu
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
...
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu
Michael Lingzhi Li
LM&MAELM
111
79
0
05 Jun 2023
Which Argumentative Aspects of Hate Speech in Social Media can be
  reliably identified?
Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?
D. Furman
Pablo E. Torres
José Raúl Rodríguez Rodríguez
Diego Letzen
María Vanina Martínez
Laura Alonso Alemany
32
7
0
05 Jun 2023
A Simple and Flexible Modeling for Mental Disorder Detection by Learning
  from Clinical Questionnaires
A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires
Hoyun Song
Jisu Shin
Huije Lee
Jong C. Park
72
7
0
05 Jun 2023
DecompX: Explaining Transformers Decisions by Propagating Token
  Decomposition
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
Ali Modarressi
Mohsen Fayyaz
Ehsan Aghazadeh
Yadollah Yaghoobzadeh
Mohammad Taher Pilehvar
100
28
0
05 Jun 2023
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model
  Pre-Training Research
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Alham Fikri Aji
Genta Indra Winata
Radityo Eko Prasojo
Phil Blunsom
A. Kuncoro
65
8
0
05 Jun 2023
Leveraging Large Language Models for Topic Classification in the Domain
  of Public Affairs
Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs
Alejandro Peña
Aythami Morales
Julian Fierrez
Ignacio Serna
J. Ortega-Garcia
Iñigo Puente
Jorge Cordova
Gonzalo Cordova
87
20
0
05 Jun 2023
UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception
  Detection
UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception Detection
Aswathy Velutharambath
Roman Klinger
35
3
0
05 Jun 2023
Enhancing Language Representation with Constructional Information for
  Natural Language Understanding
Enhancing Language Representation with Constructional Information for Natural Language Understanding
Lvxiaowei Xu
Jian Wu
Jiawei Peng
Zhilin Gong
Ming Cai
Tianxiang Wang
61
3
0
05 Jun 2023
CELDA: Leveraging Black-box Language Model as Enhanced Classifier
  without Labels
CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels
Hyunsoo Cho
Youna Kim
Sang-goo Lee
42
3
0
05 Jun 2023
Joint Pre-training and Local Re-training: Transferable Representation
  Learning on Multi-source Knowledge Graphs
Joint Pre-training and Local Re-training: Transferable Representation Learning on Multi-source Knowledge Graphs
Zequn Sun
Jiacheng Huang
Jing-Rong Lin
Xiaozhou Xu
Qijin Chen
Wei Hu
62
2
0
05 Jun 2023
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual
  Document Understanding Models
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models
Jiabang He
Yilang Hu
Lei Wang
Xingdong Xu
Ning Liu
Hui-juan Liu
Hengtao Shen
VLMOOD
68
3
0
05 Jun 2023
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help
  Multiple Graph Applications
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications
Han Xie
Da Zheng
Jun Ma
Houyu Zhang
V. Ioannidis
...
Sheng Wang
Carl Yang
Yi Xu
Belinda Zeng
Trishul Chilimbi
AI4CE
90
40
0
05 Jun 2023
Introduction to Latent Variable Energy-Based Models: A Path Towards
  Autonomous Machine Intelligence
Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine Intelligence
Anna Dawid
Yann LeCun
DRL
104
31
0
05 Jun 2023
Prompt to be Consistent is Better than Self-Consistent? Few-Shot and
  Zero-Shot Fact Verification with Pre-trained Language Models
Prompt to be Consistent is Better than Self-Consistent? Few-Shot and Zero-Shot Fact Verification with Pre-trained Language Models
Fengzhu Zeng
Wei Gao
69
7
0
05 Jun 2023
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and
  Generative Fusion
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
ELM
144
334
0
05 Jun 2023
Learning to Relate to Previous Turns in Conversational Search
Learning to Relate to Previous Turns in Conversational Search
Fengran Mo
J. Nie
Kaiyu Huang
Kelong Mao
Yutao Zhu
Peng Li
Yang Liu
103
28
0
05 Jun 2023
RadLing: Towards Efficient Radiology Report Understanding
RadLing: Towards Efficient Radiology Report Understanding
Rikhiya Ghosh
Sanjeev Kumar Karn
Manuela Danu
Larisa Micu
Ramya Vunikili
Oladimeji Farri
MedIm
61
6
0
04 Jun 2023
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Omar Shaikh
Caleb Ziems
William B. Held
Aryan Pariani
Fred Morstatter
Diyi Yang
85
14
0
04 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELMFedML
79
4
0
04 Jun 2023
Leverage Points in Modality Shifts: Comparing Language-only and
  Multimodal Word Representations
Leverage Points in Modality Shifts: Comparing Language-only and Multimodal Word Representations
Aleksey Tikhonov
Lisa Bylinina
Denis Paperno
53
2
0
04 Jun 2023
Probing Physical Reasoning with Counter-Commonsense Context
Probing Physical Reasoning with Counter-Commonsense Context
Kazushi Kondo
Saku Sugawara
Akiko Aizawa
LRM
74
4
0
04 Jun 2023
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean
  Language Models
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models
H. Ko
Kichang Yang
Minho Ryu
Taekyoon Choi
Seungmu Yang
Jiwung Hyun
Sung-Yong Park
Kyubyong Park
93
30
0
04 Jun 2023
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from
  Pre-trained Language Model
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Lingfeng Shen
Haiyun Jiang
Lemao Liu
Shuming Shi
61
2
0
04 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
72
172
0
04 Jun 2023
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
Jianghui Wang
Yuxuan Wang
Dongyan Zhao
Zilong Zheng
87
1
0
04 Jun 2023
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in
  Sentiment Analysis
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Daniela Teodorescu
Saif M. Mohammad
59
13
0
03 Jun 2023
Question-Context Alignment and Answer-Context Dependencies for Effective
  Answer Sentence Selection
Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection
Minh Le Nguyen
K. Kishan
Toan Q. Nguyen
Thien Huu Nguyen
Ankit Chadha
Thuy Vu
39
0
0
03 Jun 2023
Stubborn Lexical Bias in Data and Models
Stubborn Lexical Bias in Data and Models
Sofia Serrano
Jesse Dodge
Noah A. Smith
97
2
0
03 Jun 2023
Towards Coding Social Science Datasets with Language Models
Towards Coding Social Science Datasets with Language Models
Anonymous Acl
Taylor Sorensen
Lisa P. Argyle
Ethan C. Busby
Nancy Fulda
Joshua R Gubler
David Wingate
ALMSyDa
55
11
0
03 Jun 2023
Incorporating Deep Syntactic and Semantic Knowledge for Chinese Sequence
  Labeling with GCN
Incorporating Deep Syntactic and Semantic Knowledge for Chinese Sequence Labeling with GCN
Xuemei Tang
Jun Wang
Qi Su
GNN
50
0
0
03 Jun 2023
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos
S. Symeonidis
Pranav Kasela
G. Pasi
LM&MA
50
13
0
03 Jun 2023
MultiLegalPile: A 689GB Multilingual Legal Corpus
MultiLegalPile: A 689GB Multilingual Legal Corpus
Joel Niklaus
Veton Matoshi
Matthias Sturmer
Ilias Chalkidis
Daniel E. Ho
AILawELM
120
44
0
03 Jun 2023
A Comprehensive Survey on Relation Extraction: Recent Advances and New
  Frontiers
A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers
Xiaoyan Zhao
Yang Deng
Min Yang
Lingzhi Wang
Rui Zhang
Hong Cheng
W. Lam
Ying Shen
Ruifeng Xu
KELM
98
36
0
03 Jun 2023
Span Identification of Epistemic Stance-Taking in Academic Written
  English
Span Identification of Epistemic Stance-Taking in Academic Written English
Masaki Eguchi
K. Kyle
35
6
0
03 Jun 2023
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
Robert Lo
Arnhav Datar
Abishek Sridhar
GAN
73
2
0
02 Jun 2023
A Simple yet Effective Self-Debiasing Framework for Transformer Models
A Simple yet Effective Self-Debiasing Framework for Transformer Models
Xiaoyue Wang
Lijie Wang
Xin Liu
Suhang Wu
Jinsong Su
Huasen Wu
68
4
0
02 Jun 2023
Revisiting the Role of Language Priors in Vision-Language Models
Revisiting the Role of Language Priors in Vision-Language Models
Zhiqiu Lin
Xinyue Chen
Deepak Pathak
Pengchuan Zhang
Deva Ramanan
VLM
159
27
0
02 Jun 2023
Knowledge of cultural moral norms in large language models
Knowledge of cultural moral norms in large language models
Aida Ramezani
Yang Xu
ELMAILaw
72
51
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
80
9
0
02 Jun 2023
Previous
123...969798...214215216
Next