ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
Multilingual Text Representation
Multilingual Text Representation
Fahim Faisal
48
0
0
02 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on
  downstream tasks
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
58
0
0
02 Sep 2023
Towards Code Watermarking with Dual-Channel Transformations
Towards Code Watermarking with Dual-Channel Transformations
Borui Yang
Wei Li
Liyao Xiang
Yue Liu
78
10
0
02 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Bias and Fairness in Large Language Models: A Survey
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
140
609
0
02 Sep 2023
Contextual Biasing of Named-Entities with Large Language Models
Contextual Biasing of Named-Entities with Large Language Models
Chuanneng Sun
Zeeshan Ahmed
Yingyi Ma
Zhe Liu
Lucas Kabela
Yutong Pang
Ozlem Kalinli
KELM
69
7
0
01 Sep 2023
FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large
  Language Models in Federated Learning
FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning
Weirui Kuang
Bingchen Qian
Zitao Li
Daoyuan Chen
Dawei Gao
Xuchen Pan
Yuexiang Xie
Yaliang Li
Bolin Ding
Jingren Zhou
FedML
124
136
0
01 Sep 2023
SortedNet: A Scalable and Generalized Framework for Training Modular
  Deep Neural Networks
SortedNet: A Scalable and Generalized Framework for Training Modular Deep Neural Networks
Mojtaba Valipour
Mehdi Rezagholizadeh
Hossein Rajabzadeh
Parsa Kavehzadeh
Marzieh S. Tahaei
Boxing Chen
Ali Ghodsi
43
1
0
01 Sep 2023
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122
  Language Variants
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Lucas Bandarkar
Davis Liang
Benjamin Muller
Mikel Artetxe
Satya Narayan Shukla
Don Husa
Naman Goyal
Abhinandan Krishnan
Luke Zettlemoyer
Madian Khabsa
126
157
0
31 Aug 2023
Enhancing PLM Performance on Labour Market Tasks via Instruction-based
  Finetuning and Prompt-tuning with Rules
Enhancing PLM Performance on Labour Market Tasks via Instruction-based Finetuning and Prompt-tuning with Rules
Jarno Vrolijk
David Graus
46
2
0
31 Aug 2023
Exploring Cross-Cultural Differences in English Hate Speech Annotations:
  From Dataset Construction to Analysis
Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis
Nayeon Lee
Chani Jung
Jun-Hee Myung
Jiho Jin
Jose Camacho-Collados
Juho Kim
Alice Oh
102
23
0
31 Aug 2023
ViLTA: Enhancing Vision-Language Pre-training through Textual
  Augmentation
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation
Weihan Wang
Zhiyong Yang
Bin Xu
Juanzi Li
Yankui Sun
VLM
96
8
0
31 Aug 2023
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
Shaltiel Shmidman
Avi Shmidman
Moshe Koppel
47
8
0
31 Aug 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on
  Hate Speech Detection
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
84
3
0
31 Aug 2023
Generalised Winograd Schema and its Contextuality
Generalised Winograd Schema and its Contextuality
K. Lo
M. Sadrzadeh
Shane Mansfield
68
7
0
31 Aug 2023
MaintainoMATE: A GitHub App for Intelligent Automation of Maintenance
  Activities
MaintainoMATE: A GitHub App for Intelligent Automation of Maintenance Activities
Anas Nadeem
Muhammad Usman Sarwar
Muhammad Zubair Malik
82
0
0
31 Aug 2023
Emoji Promotes Developer Participation and Issue Resolution on GitHub
Emoji Promotes Developer Participation and Issue Resolution on GitHub
Yuhang Zhou
Xuan Lu
Ge Gao
Qiaozhu Mei
Wei Ai
141
4
0
30 Aug 2023
Catalog Phrase Grounding (CPG): Grounding of Product Textual Attributes
  in Product Images for e-commerce Vision-Language Applications
Catalog Phrase Grounding (CPG): Grounding of Product Textual Attributes in Product Images for e-commerce Vision-Language Applications
Wenyi Wu
Karim Bouyarmane
Ismail B. Tutar
33
2
0
30 Aug 2023
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning
  Based on Visually Grounded Conversations
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations
Kilichbek Haydarov
Xiaoqian Shen
Avinash Madasu
Mahmoud Salem
Jia Li
Gamaleldin F. Elsayed
Mohamed Elhoseiny
67
4
0
30 Aug 2023
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language
  Understanding
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language Understanding
Omer Veysel Cagatan
74
2
0
30 Aug 2023
Can Prompt Learning Benefit Radiology Report Generation?
Can Prompt Learning Benefit Radiology Report Generation?
Jun Wang
Lixing Zhu
A. Bhalerao
Yulan He
MedIm
86
2
0
30 Aug 2023
Materials Informatics Transformer: A Language Model for Interpretable
  Materials Properties Prediction
Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction
Hongshuo Huang
Rishikesh Magar
Chang Xu
A. Farimani
AI4CE
77
4
0
30 Aug 2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open
  Generative Large Language Models
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Neha Sengupta
Sunil Kumar Sahu
Bokang Jia
Satheesh Katipomu
Haonan Li
...
A. Jackson
Hector Xuguang Ren
Preslav Nakov
Timothy Baldwin
Eric P. Xing
LRM
101
41
0
30 Aug 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
204
40
0
30 Aug 2023
FPTQ: Fine-grained Post-Training Quantization for Large Language Models
FPTQ: Fine-grained Post-Training Quantization for Large Language Models
Qingyuan Li
Yifan Zhang
Liang Li
Peng Yao
Bo Zhang
Xiangxiang Chu
Yerui Sun
Li-Qiang Du
Yuchen Xie
MQ
108
13
0
30 Aug 2023
MerA: Merging Pretrained Adapters For Few-Shot Learning
MerA: Merging Pretrained Adapters For Few-Shot Learning
Shwai He
Run-Ze Fan
Liang Ding
Li Shen
Dinesh Manocha
Dacheng Tao
MoMe
73
12
0
30 Aug 2023
Introducing Language Guidance in Prompt-based Continual Learning
Introducing Language Guidance in Prompt-based Continual Learning
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
Luc Van Gool
D. Stricker
F. Tombari
Muhammad Zeshan Afzal
VLMCLL
103
51
0
30 Aug 2023
HAlf-MAsked Model for Named Entity Sentiment analysis
HAlf-MAsked Model for Named Entity Sentiment analysis
A. Kabaev
P. Podberezko
A. Kaznacheev
Sabina Abdullayeva
18
4
0
30 Aug 2023
Radiology-Llama2: Best-in-Class Large Language Model for Radiology
Radiology-Llama2: Best-in-Class Large Language Model for Radiology
Zheng Liu
Yiwei Li
Peng Shu
Aoxiao Zhong
Longtao Yang
...
Wen Liu
Dinggang Shen
Tianming Liu
Quanzheng Li
Xiang Li
LM&MA
80
42
0
29 Aug 2023
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style
  Transfer
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
Zachary Horvitz
Ajay Patel
Chris Callison-Burch
Zhou Yu
Kathleen McKeown
DiffM
102
14
0
29 Aug 2023
Document AI: A Comparative Study of Transformer-Based, Graph-Based
  Models, and Convolutional Neural Networks For Document Layout Analysis
Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis
Sotirios Kastanas
Shaomu Tan
Yijiang He
74
1
0
29 Aug 2023
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences
Nicos Isaak
59
1
0
29 Aug 2023
SpikeBERT: A Language Spikformer Learned from BERT with Knowledge
  Distillation
SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation
Changze Lv
Changze Lv
Jianhan Xu
Chenxi Gu
Zixuan Ling
Cenyuan Zhang
Xiaoqing Zheng
Xuanjing Huang
71
8
0
29 Aug 2023
Taxonomic Loss for Morphological Glossing of Low-Resource Languages
Taxonomic Loss for Morphological Glossing of Low-Resource Languages
Michael Ginn
Alexis Palmer
51
0
0
29 Aug 2023
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text
  Classification
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification
Jiadong Wang
Chengyu Wang
Cen Chen
Ming Gao
Jun Huang
Aoying Zhou
VLM
94
0
0
29 Aug 2023
Attention Visualizer Package: Revealing Word Importance for Deeper
  Insight into Encoder-Only Transformer Models
Attention Visualizer Package: Revealing Word Importance for Deeper Insight into Encoder-Only Transformer Models
A. A. Falaki
R. Gras
ViT
53
7
0
28 Aug 2023
Identifying and Mitigating the Security Risks of Generative AI
Identifying and Mitigating the Security Risks of Generative AI
Clark W. Barrett
Bradley L Boyd
Ellie Burzstein
Nicholas Carlini
Brad Chen
...
Zulfikar Ramzan
Khawaja Shams
Basel Alomair
Ankur Taly
Diyi Yang
SILM
125
101
0
28 Aug 2023
Breaking the Bank with ChatGPT: Few-Shot Text Classification for Finance
Breaking the Bank with ChatGPT: Few-Shot Text Classification for Finance
Lefteris Loukas
Ilias Stogiannidis
Prodromos Malakasiotis
Stavros Vassos
81
22
0
28 Aug 2023
A Multi-Task Semantic Decomposition Framework with Task-specific
  Pre-training for Few-Shot NER
A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER
Guanting Dong
Zechen Wang
Jinxu Zhao
Gang Zhao
Daichi Guo
...
Keqing He
Xuefeng Li
Liwen Wang
Xinyue Cui
Weiran Xu
84
22
0
28 Aug 2023
Mobile Foundation Model as Firmware
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
120
22
0
28 Aug 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient
  Parameter and Memory
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yanzhe Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
81
19
0
28 Aug 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on
  Language, Multimodal, and Scientific GPT Models
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Kaiyuan Gao
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MASyDa
64
5
0
27 Aug 2023
Towards Unified Token Learning for Vision-Language Tracking
Towards Unified Token Learning for Vision-Language Tracking
Yaozong Zheng
Bineng Zhong
Qihua Liang
Guorong Li
Rongrong Ji
Xianxian Li
132
35
0
27 Aug 2023
A Wide Evaluation of ChatGPT on Affective Computing Tasks
A Wide Evaluation of ChatGPT on Affective Computing Tasks
Mostafa M. Amin
Rui Mao
Min Zhang
Björn W. Schuller
AI4MH
67
35
0
26 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
63
11
0
26 Aug 2023
FwdLLM: Efficient FedLLM using Forward Gradient
FwdLLM: Efficient FedLLM using Forward Gradient
Mengwei Xu
Dongqi Cai
Yaozong Wu
Xiang Li
Shangguang Wang
FedML
118
26
0
26 Aug 2023
WellXplain: Wellness Concept Extraction and Classification in Reddit
  Posts for Mental Health Analysis
WellXplain: Wellness Concept Extraction and Classification in Reddit Posts for Mental Health Analysis
Muskan Garg
AI4MH
52
10
0
25 Aug 2023
Party Prediction for Twitter
Party Prediction for Twitter
Kellin Pelrine
Anne Imouza
Zachary Yang
Jacob-Junqi Tian
Sacha Lévy
...
Aarash Feizi
Cécile Amadoro
A. Blais
Jean-François Godbout
Reihaneh Rabbany
59
2
0
25 Aug 2023
Rethinking Language Models as Symbolic Knowledge Graphs
Rethinking Language Models as Symbolic Knowledge Graphs
Vishwas Mruthyunjaya
Pouya Pezeshkpour
Estevam R. Hruschka
Nikita Bhutani
ELMALM
39
12
0
25 Aug 2023
Construction Grammar and Language Models
Construction Grammar and Language Models
Harish Tayyar Madabushi
Laurence Romain
P. Milin
Dagmar Divjak
128
5
0
25 Aug 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their
  Solutions
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
83
22
0
25 Aug 2023
Previous
123...848586...213214215
Next