ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,763 papers shown
Title
On the Relationship between Skill Neurons and Robustness in Prompt
  Tuning
On the Relationship between Skill Neurons and Robustness in Prompt Tuning
Leon Ackermann
Xenia Ohmer
AAML
42
0
0
21 Sep 2023
Bad Actor, Good Advisor: Exploring the Role of Large Language Models in
  Fake News Detection
Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection
Beizhe Hu
Qiang Sheng
Juan Cao
Yuhui Shi
Yang Li
Danding Wang
Peng Qi
136
99
0
21 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets
  and Approaches
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
Deepak Gupta
Kush Attal
Dina Demner-Fushman
LM&MA
54
1
0
21 Sep 2023
How-to Guides for Specific Audiences: A Corpus and Initial Findings
How-to Guides for Specific Audiences: A Corpus and Initial Findings
Nicola Fanton
Agnieszka Falenska
Michael Roth
21
0
0
21 Sep 2023
SemEval-2022 Task 7: Identifying Plausible Clarifications of Implicit
  and Underspecified Phrases in Instructional Texts
SemEval-2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts
Michael Roth
Talita Anthonio
Anna Sauer
62
16
0
21 Sep 2023
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision
Jinzhao Zhou
Yiqun Duan
Yu-Cheng Chang
Yu-Kai Wang
Chin-Teng Lin
76
6
0
21 Sep 2023
Fully Transformer-Equipped Architecture for End-to-End Referring Video
  Object Segmentation
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation
P. Li
Yu Zhang
L. Yuan
Xianghua Xu
VOS
53
9
0
21 Sep 2023
Evaluating Large Language Models for Document-grounded Response
  Generation in Information-Seeking Dialogues
Evaluating Large Language Models for Document-grounded Response Generation in Information-Seeking Dialogues
N. Braunschweiler
R. Doddipatla
Simon Keizer
Svetlana Stoyanchev
LM&MA
56
10
0
21 Sep 2023
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Chengyuan Liu
Fubang Zhao
Lizhi Qing
Yangyang Kang
Changlong Sun
Kun Kuang
Leilei Gan
AAML
75
21
0
21 Sep 2023
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
Rui Liu
Bin Liu
Haizhou Li
53
3
0
21 Sep 2023
Semi-supervised News Discourse Profiling with Contrastive Learning
Semi-supervised News Discourse Profiling with Contrastive Learning
Ming Li
Ruihong Huang
61
2
0
20 Sep 2023
Large-scale Pretraining Improves Sample Efficiency of Active Learning
  based Molecule Virtual Screening
Large-scale Pretraining Improves Sample Efficiency of Active Learning based Molecule Virtual Screening
Zhonglin Cao
Simone Sciabola
Ye Wang
75
1
0
20 Sep 2023
Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic
  Evaluation
Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Ali Mousavi
Xin Zhan
Richard He Bai
Peng Shi
Theo Rekatsinas
...
Jeff Pound
Josh Susskind
Natalie Schluter
Ihab F. Ilyas
Navdeep Jaitly
53
2
0
20 Sep 2023
Sentence Attention Blocks for Answer Grounding
Sentence Attention Blocks for Answer Grounding
Seyedalireza Khoshsirat
Chandra Kambhamettu
75
8
0
20 Sep 2023
A Large-scale Dataset for Audio-Language Representation Learning
A Large-scale Dataset for Audio-Language Representation Learning
Luoyi Sun
Xuenan Xu
Mengyue Wu
Weidi Xie
87
27
0
20 Sep 2023
Kosmos-2.5: A Multimodal Literate Model
Kosmos-2.5: A Multimodal Literate Model
Tengchao Lv
Yupan Huang
Jingye Chen
Lei Cui
Shuming Ma
...
Weiyao Luo
Shaoxiang Wu
Guoxin Wang
Cha Zhang
Furu Wei
VLMMLLM
114
66
0
20 Sep 2023
CPLLM: Clinical Prediction with Large Language Models
CPLLM: Clinical Prediction with Large Language Models
Ofir Ben Shoham
Nadav Rappoport
LM&MA
81
28
0
20 Sep 2023
Overview of AuTexTification at IberLEF 2023: Detection and Attribution
  of Machine-Generated Text in Multiple Domains
Overview of AuTexTification at IberLEF 2023: Detection and Attribution of Machine-Generated Text in Multiple Domains
A. Sarvazyan
José Ángel González
Marc Franco-Salvador
Francisco Rangel
Berta Chulvi
Paolo Rosso
DeLMO
93
64
0
20 Sep 2023
Sequence-to-Sequence Spanish Pre-trained Language Models
Sequence-to-Sequence Spanish Pre-trained Language Models
Vladimir Araujo
Maria Mihaela Truşcǎ
Rodrigo Tufino
Marie-Francine Moens
71
2
0
20 Sep 2023
Compilation as a Defense: Enhancing DL Model Attack Robustness via
  Tensor Optimization
Compilation as a Defense: Enhancing DL Model Attack Robustness via Tensor Optimization
Stefan Trawicki
William Hackett
Lewis Birch
M. Dascalu
Peter Garraghan
AAML
56
0
0
20 Sep 2023
Assessment of Pre-Trained Models Across Languages and Grammars
Assessment of Pre-Trained Models Across Languages and Grammars
Alberto Muñoz-Ortiz
David Vilares
Carlos Gómez-Rodríguez
62
4
0
20 Sep 2023
CoT-BERT: Enhancing Unsupervised Sentence Representation through
  Chain-of-Thought
CoT-BERT: Enhancing Unsupervised Sentence Representation through Chain-of-Thought
Bowen Zhang
Kehua Chang
Chunping Li
SSL
99
6
0
20 Sep 2023
Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the
  Ocean, the Brazilian Coast, and Climate Change
Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the Ocean, the Brazilian Coast, and Climate Change
Paulo Pirozelli
M. M. José
I. Silveira
Flávio Nakasato
S. M. Peres
A. Brandão
Anna H. R. Costa
Fabio Gagliardi Cozman
RALM
75
4
0
19 Sep 2023
A Family of Pretrained Transformer Language Models for Russian
A Family of Pretrained Transformer Language Models for Russian
Dmitry Zmitrovich
Alexander Abramov
Andrey Kalmykov
Maria Tikhonova
Ekaterina Taktasheva
...
Vitalii Kadulin
Sergey Markov
Tatiana Shavrina
Vladislav Mikhailov
Alena Fenogenova
109
26
0
19 Sep 2023
Specializing Small Language Models towards Complex Style Transfer via
  Latent Attribute Pre-Training
Specializing Small Language Models towards Complex Style Transfer via Latent Attribute Pre-Training
Ruiqi Xu
Y. Huang
Xin Chen
Lin Zhang
31
3
0
19 Sep 2023
Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Toan Tien Nguyen
Minh Nhat Vu
Baoru Huang
Tuan V. Vo
Vy Truong
Ngan Le
T. Vo
Bac Le
Anh Nguyen
DiffM
84
18
0
19 Sep 2023
Artificial Intelligence-Enabled Intelligent Assistant for Personalized
  and Adaptive Learning in Higher Education
Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education
Ramteja Sajja
Y. Sermet
Muhammed Cikmaz
David M. Cwiertny
Ibrahim Demir
104
149
0
19 Sep 2023
Natural Language Embedded Programs for Hybrid Language Symbolic
  Reasoning
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Tianhua Zhang
Jiaxin Ge
Hongyin Luo
Yung-Sung Chuang
Mingye Gao
Yuan Gong
Xixin Wu
Yoon Kim
Helen M. Meng
James R. Glass
LRMReLM
150
16
0
19 Sep 2023
Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping
Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping
Subash Khanal
Srikumar Sastry
Aayush Dhakal
Nathan Jacobs
107
10
0
19 Sep 2023
Unsupervised Deep Cross-Language Entity Alignment
Unsupervised Deep Cross-Language Entity Alignment
Chuanyu Jiang
Yiming Qian
Lijun Chen
Yang Gu
Xia Xie
96
5
0
19 Sep 2023
A Neighbourhood-Aware Differential Privacy Mechanism for Static Word
  Embeddings
A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings
Danushka Bollegala
Shuichi Otake
T. Machide
Ken-ichi Kawarabayashi
136
4
0
19 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
83
13
0
19 Sep 2023
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded
  Dialogue Systems
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems
Bryan Wilie
Yan Xu
Willy Chung
Samuel Cahyawijaya
Holy Lovenia
Pascale Fung
66
1
0
19 Sep 2023
KoBigBird-large: Transformation of Transformer for Korean Language
  Understanding
KoBigBird-large: Transformation of Transformer for Korean Language Understanding
Kisu Yang
Yoonna Jang
Taewoo Lee
Jinwoo Seong
Hyungjin Lee
Hwanseok Jang
Heu-Jeoung Lim
VLM
68
0
0
19 Sep 2023
Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and
  Hindi
Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi
Md. Nishat Raihan
Dhiman Goswami
Antara Mahmud
89
1
0
19 Sep 2023
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated
  Jailbreak Prompts
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
Jiahao Yu
Xingwei Lin
Zheng Yu
Xinyu Xing
SILM
230
353
0
19 Sep 2023
What is the Best Automated Metric for Text to Motion Generation?
What is the Best Automated Metric for Text to Motion Generation?
Jordan Voas
Yili Wang
Qixing Huang
Raymond Mooney
EGVM
125
14
0
19 Sep 2023
Few-Shot Adaptation for Parsing Contextual Utterances with LLMs
Few-Shot Adaptation for Parsing Contextual Utterances with LLMs
Kevin Lin
Patrick Xia
Hao Fang
61
2
0
18 Sep 2023
RadOnc-GPT: A Large Language Model for Radiation Oncology
RadOnc-GPT: A Large Language Model for Radiation Oncology
Zheng Liu
Peilong Wang
Yiwei Li
J. Holmes
Peng Shu
...
Quanzheng Li
Samir H. Patel
Terence T. Sio
Tianming Liu
Wen Liu
LM&MA
107
23
0
18 Sep 2023
Deep Prompt Tuning for Graph Transformers
Deep Prompt Tuning for Graph Transformers
Reza Shirkavand
Heng-Chiao Huang
57
7
0
18 Sep 2023
GAME: Generalized deep learning model towards multimodal data
  integration for early screening of adolescent mental disorders
GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders
Zhicheng Du
Chenyao Jiang
Xi Yuan
Shiyao Zhai
Zhengyang Lei
...
Chufan Xiao
Qiming Huang
Ming Xu
Dongmei Yu
Peiwu Qin
69
0
0
18 Sep 2023
Automatic Personalized Impression Generation for PET Reports Using Large
  Language Models
Automatic Personalized Impression Generation for PET Reports Using Large Language Models
Xin Tie
Muheon Shin
Ali Pirasteh
Nevein Ibrahim
Zachary Huemann
...
K. M. Kelly
John W. Garrett
Junjie Hu
Steve Y. Cho
Tyler Bradshaw
LM&MA
114
10
0
18 Sep 2023
Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for
  Inferring Online Health Texts
Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for Inferring Online Health Texts
Joseph Gatto
S. Preum
AI4MH
57
1
0
18 Sep 2023
Watch the Speakers: A Hybrid Continuous Attribution Network for Emotion
  Recognition in Conversation With Emotion Disentanglement
Watch the Speakers: A Hybrid Continuous Attribution Network for Emotion Recognition in Conversation With Emotion Disentanglement
Shanglin Lei
Xiaoping Wang
Guanting Dong
Jiang Li
Yingjian Liu
63
2
0
18 Sep 2023
Harnessing Collective Intelligence Under a Lack of Cultural Consensus
Harnessing Collective Intelligence Under a Lack of Cultural Consensus
Necdet Gurkan
Jordan W. Suchow
62
2
0
18 Sep 2023
LLM4Jobs: Unsupervised occupation extraction and standardization
  leveraging Large Language Models
LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models
Nan Li
Bo Kang
T. D. Bie
89
2
0
18 Sep 2023
Evaluating Gender Bias of Pre-trained Language Models in Natural
  Language Inference by Considering All Labels
Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
Panatchakorn Anantaprayoon
Masahiro Kaneko
Naoaki Okazaki
123
18
0
18 Sep 2023
Multi-turn Dialogue Comprehension from a Topic-aware Perspective
Multi-turn Dialogue Comprehension from a Topic-aware Perspective
Xinbei Ma
Yi Xu
Hai Zhao
Zhuosheng Zhang
79
5
0
18 Sep 2023
Proposition from the Perspective of Chinese Language: A Chinese
  Proposition Classification Evaluation Benchmark
Proposition from the Perspective of Chinese Language: A Chinese Proposition Classification Evaluation Benchmark
Conghui Niu
Mengyang Hu
Lin Bo
Xiaoli He
Dong Yu
Peng Liu
43
0
0
18 Sep 2023
Fabricator: An Open Source Toolkit for Generating Labeled Training Data
  with Teacher LLMs
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Jonas Golde
Patrick Haller
Felix Hamborg
Julian Risch
Alan Akbik
112
8
0
18 Sep 2023
Previous
123...818283...214215216
Next