ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only
  Quantization for LLMs
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Young Jin Kim
Rawn Henry
Raffy Fahim
Hany Awadalla
MQ
82
19
0
16 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Amit Kumar Jaiswal
Haiming Liu
57
2
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with
  Curriculum Learning for Named Entity Recognition
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition
Vera Pavlova
M. Makhlouf
58
3
0
16 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
131
116
0
16 Aug 2023
SummHelper: Collaborative Human-Computer Summarization
SummHelper: Collaborative Human-Computer Summarization
Aviv Slobodkin
Niv Nachum
Shmuel Amar
Ori Shapira
Ido Dagan
88
1
0
16 Aug 2023
Visually-Aware Context Modeling for News Image Captioning
Visually-Aware Context Modeling for News Image Captioning
Tingyu Qu
Tinne Tuytelaars
Marie-Francine Moens
VLM
58
9
0
16 Aug 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task
  Learning in NLP Through ML Lifecycle: A Survey
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
72
5
0
16 Aug 2023
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme
  Detection
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Rui Cao
Ming Shan Hee
Adriel Kuek
Wen-Haw Chong
Roy Ka-wei Lee
Jing Jiang
VLMMLLM
56
43
0
16 Aug 2023
Using Artificial Populations to Study Psychological Phenomena in Neural
  Models
Using Artificial Populations to Study Psychological Phenomena in Neural Models
Jesse Roberts
Kyle Moore
Drew Wilenzick
Doug Fisher
60
6
0
15 Aug 2023
"Beware of deception": Detecting Half-Truth and Debunking it through
  Controlled Claim Editing
"Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing
Sandeep Singamsetty
Nishtha Madaan
S. Mehta
Varad Bhatnagar
P. Bhattacharyya
HILM
34
0
0
15 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness
  on Longitudinal Versions of Large Language Models
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
90
8
0
15 Aug 2023
Enhancing Visually-Rich Document Understanding via Layout Structure
  Modeling
Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
Qiwei Li
Z. Li
Xiantao Cai
Bo Du
Hai Zhao
64
8
0
15 Aug 2023
SPM: Structured Pretraining and Matching Architectures for Relevance
  Modeling in Meituan Search
SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search
Wen-xin Zan
Yaopeng Han
Xiaotian Jiang
Yao Xiao
Yang Yang
Dayao Chen
Sheng Chen
70
3
0
15 Aug 2023
Comparison between parameter-efficient techniques and full fine-tuning:
  A case study on multilingual news article classification
Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification
Olesya Razuvayevskaya
Ben Wu
João A. Leite
Freddy Heppell
Ivan Srba
Carolina Scarton
Kalina Bontcheva
Xingyi Song
62
10
0
14 Aug 2023
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt
  Generation for Few-shot Learning
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning
Chengzhengxu Li
Xiaoming Liu
Yichen Wang
Duyi Li
Y. Lan
Chao Shen
83
6
0
14 Aug 2023
AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Zhaohui Li
Haitao Wang
Xinghua Jiang
113
1
0
14 Aug 2023
Language is All a Graph Needs
Language is All a Graph Needs
Ruosong Ye
Caiqi Zhang
Runhui Wang
Shuyuan Xu
Yongfeng Zhang
AI4CE
168
170
0
14 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
160
21
0
14 Aug 2023
DIVAS: An LLM-based End-to-End Framework for SoC Security Analysis and
  Policy-based Protection
DIVAS: An LLM-based End-to-End Framework for SoC Security Analysis and Policy-based Protection
Sudipta Paria
Aritra Dasgupta
Swarup Bhunia
70
23
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models
  with Positional Embeddings
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
55
0
0
14 Aug 2023
Improving Face Recognition from Caption Supervision with Multi-Granular
  Contextual Feature Aggregation
Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation
Md Golam Moula Mehedi Hasan
Nasser M. Nasrabadi
CVBM
47
2
0
13 Aug 2023
Building Trust in Conversational AI: A Comprehensive Review and Solution
  Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge
  Graph
Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph
Ahtsham Zafar
V. Parthasarathy
Chan Le Van
Saad Shahid
A. khan
Arsalan Shahid
79
14
0
13 Aug 2023
An Ensemble Approach to Question Classification: Integrating Electra
  Transformer, GloVe, and LSTM
An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM
Sanad Aburass
O. Dorgham
Maha Abu Rumman
63
3
0
13 Aug 2023
Robust Infidelity: When Faithfulness Measures on Masked Language Models
  Are Misleading
Robust Infidelity: When Faithfulness Measures on Masked Language Models Are Misleading
Evan Crothers
H. Viktor
Nathalie Japkowicz
AAML
68
1
0
13 Aug 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following
  Inspired by Real-World Use
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Yonatan Bitton
Hritik Bansal
Jack Hessel
Rulin Shao
Wanrong Zhu
Anas Awadalla
Josh Gardner
Rohan Taori
L. Schimdt
VLM
129
82
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
121
285
0
12 Aug 2023
Large Language Models and Knowledge Graphs: Opportunities and Challenges
Large Language Models and Knowledge Graphs: Opportunities and Challenges
Jeff Z. Pan
Simon Razniewski
Jan-Christoph Kalo
Sneha Singhania
Jiaoyan Chen
...
Gerard de Melo
A. Bonifati
Edlira Vakaj
M. Dragoni
D. Graux
KELM
91
78
0
11 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
67
16
0
11 Aug 2023
KETM:A Knowledge-Enhanced Text Matching method
KETM:A Knowledge-Enhanced Text Matching method
Kexin Jiang
Yahui Zhao
Guozhe Jin
Zhenguo Zhang
Rong-yi Cui
50
6
0
11 Aug 2023
Identification of the Relevance of Comments in Codes Using Bag of Words
  and Transformer Based Models
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models
S. Sruthi
Tanmay Basu
39
1
0
11 Aug 2023
Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic
Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic
Terufumi Morishita
Gaku Morio
Atsuki Yamaguchi
Yasuhiro Sogawa
ReLMLRMAI4CEELM
100
26
0
11 Aug 2023
C5: Towards Better Conversation Comprehension and Contextual Continuity
  for ChatGPT
C5: Towards Better Conversation Comprehension and Contextual Continuity for ChatGPT
Pan Liang
Danwei Ye
Zihao Zhu
Yunchao Wang
Wang Xia
Ronghua Liang
Guodao Sun
65
4
0
10 Aug 2023
Cross-Domain Product Representation Learning for Rich-Content E-Commerce
Cross-Domain Product Representation Learning for Rich-Content E-Commerce
Xuehan Bai
Yan Li
Yong Cheng
Wenjie Yang
Quanming Chen
Han Li
61
4
0
10 Aug 2023
Classification of Human- and AI-Generated Texts: Investigating Features
  for ChatGPT
Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT
Lorenz Mindner
Tim Schlippe
Kristina Schaaff
DeLMO
62
46
0
10 Aug 2023
MetRoBERTa: Leveraging Traditional Customer Relationship Management Data
  to Develop a Transit-Topic-Aware Language Model
MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model
M. Leong
Awad Abdelhalim
Jude Ha
Dianne Patterson
Gabriel L. Pincus
Anthony B. Harris
Michael Eichler
Jinhua Zhao
57
7
0
09 Aug 2023
Transferable Models for Bioacoustics with Human Language Supervision
Transferable Models for Bioacoustics with Human Language Supervision
David Robinson
Adelaide Robinson
Lily Akrapongpisak
74
8
0
09 Aug 2023
Performance Analysis of Transformer Based Models (BERT, ALBERT and
  RoBERTa) in Fake News Detection
Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection
Shafna Fitria Nur Azizah
Hasan Dwi Cahyono
S. W. Sihwi
Wisnu Widiarto
24
13
0
09 Aug 2023
Emotion-Conditioned Text Generation through Automatic Prompt
  Optimization
Emotion-Conditioned Text Generation through Automatic Prompt Optimization
Yarik Menchaca Resendiz
Roman Klinger
48
5
0
09 Aug 2023
Evaluating the Generation Capabilities of Large Chinese Language Models
Evaluating the Generation Capabilities of Large Chinese Language Models
Hui Zeng
Jingyuan Xue
Meng Hao
Chen Sun
Bin Ning
Na Zhang
ELM
82
12
0
09 Aug 2023
A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with
  Commonsense Knowledge
A Bipartite Graph is All We Need for Enhancing Emotional Reasoning with Commonsense Knowledge
Kailai Yang
Tianlin Zhang
Shaoxiong Ji
Sophia Ananiadou
67
5
0
09 Aug 2023
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced
  Transformer
PETformer: Long-term Time Series Forecasting via Placeholder-enhanced Transformer
Shengsheng Lin
Weiwei Lin
Wentai Wu
Song Wang
Yongxiang Wang
AI4TS
77
21
0
09 Aug 2023
TBIN: Modeling Long Textual Behavior Data for CTR Prediction
TBIN: Modeling Long Textual Behavior Data for CTR Prediction
Shuwei Chen
Xiang Li
Jian Dong
Jin Zhang
Yongkang Wang
Xingxing Wang
74
3
0
09 Aug 2023
A Comparative Study of Sentence Embedding Models for Assessing Semantic
  Variation
A Comparative Study of Sentence Embedding Models for Assessing Semantic Variation
Deven M. Mistry
A. Minai
39
2
0
08 Aug 2023
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose
  Recommendation System?
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?
Ali Pesaranghader
Touqir Sajed
35
1
0
08 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position
  Bias
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
54
0
0
08 Aug 2023
Ahead of the Text: Leveraging Entity Preposition for Financial Relation
  Extraction
Ahead of the Text: Leveraging Entity Preposition for Financial Relation Extraction
Stefan Pasch
Dimitrios Petridis
39
3
0
08 Aug 2023
Revisiting Disentanglement and Fusion on Modality and Context in
  Conversational Multimodal Emotion Recognition
Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition
Bobo Li
Hao Fei
Lizi Liao
Yu Zhao
Chong Teng
Tat-Seng Chua
Donghong Ji
Fei Li
77
34
0
08 Aug 2023
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
Ziyu Zhu
Xiaojian Ma
Yixin Chen
Zhidong Deng
Siyuan Huang
Qing Li
LM&Ro
85
123
0
08 Aug 2023
Advancing Natural-Language Based Audio Retrieval with PaSST and Large
  Audio-Caption Data Sets
Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Paul Primus
Khaled Koutini
Gerhard Widmer
71
13
0
08 Aug 2023
Portrayal: Leveraging NLP and Visualization for Analyzing Fictional
  Characters
Portrayal: Leveraging NLP and Visualization for Analyzing Fictional Characters
Md. Naimul Hoque
Bhavya Ghai
Kari Kraus
Niklas Elmqvist
59
16
0
08 Aug 2023
Previous
123...868788...213214215
Next