ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
MatchXML: An Efficient Text-label Matching Framework for Extreme
  Multi-label Text Classification
MatchXML: An Efficient Text-label Matching Framework for Extreme Multi-label Text Classification
Hui Ye
Rajshekhar Sunderraman
Shihao Ji
102
3
0
25 Aug 2023
Bayesian Low-rank Adaptation for Large Language Models
Bayesian Low-rank Adaptation for Large Language Models
Adam X. Yang
Maxime Robeyns
Xi Wang
Laurence Aitchison
AI4CEBDL
165
55
0
24 Aug 2023
ZeroLeak: Using LLMs for Scalable and Cost Effective Side-Channel
  Patching
ZeroLeak: Using LLMs for Scalable and Cost Effective Side-Channel Patching
M. Tol
B. Sunar
94
5
0
24 Aug 2023
Multi-BERT for Embeddings for Recommendation System
Multi-BERT for Embeddings for Recommendation System
Shashidhar Reddy Javaji
Krutika Sarode
43
2
0
24 Aug 2023
DLIP: Distilling Language-Image Pre-training
DLIP: Distilling Language-Image Pre-training
Huafeng Kuang
Jie Wu
Xiawu Zheng
Ming Li
Xuefeng Xiao
Rui Wang
Min Zheng
Rongrong Ji
VLM
70
4
0
24 Aug 2023
Code Llama: Open Foundation Models for Code
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELMALM
143
2,112
0
24 Aug 2023
POLCA: Power Oversubscription in LLM Cloud Providers
POLCA: Power Oversubscription in LLM Cloud Providers
Pratyush Patel
Esha Choukse
Chaojie Zhang
Íñigo Goiri
Brijesh Warrier
Nithish Mahalingam
Ricardo Bianchini
48
14
0
24 Aug 2023
Large Language Models Vote: Prompting for Rare Disease Identification
Large Language Models Vote: Prompting for Rare Disease Identification
David Oniani
Jordan Hilsman
Hang Dong
F. Gao
Shiven Verma
Yanshan Wang
62
13
0
24 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and
  Vulnerabilities
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
87
87
0
24 Aug 2023
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
kTrans: Knowledge-Aware Transformer for Binary Code Embedding
Wenyu Zhu
Hao Wang
Yuchen Zhou
Jiaming Wang
Zihan Sha
Zeyu Gao
Chao Zhang
88
10
0
24 Aug 2023
From Chatter to Matter: Addressing Critical Steps of Emotion Recognition
  Learning in Task-oriented Dialogue
From Chatter to Matter: Addressing Critical Steps of Emotion Recognition Learning in Task-oriented Dialogue
Shutong Feng
Nurul Lubis
Benjamin Ruppik
Christian Geishauser
Michael Heck
Hsien-chin Lin
Carel van Niekerk
Renato Vukovic
Milica Gavsić
80
3
0
24 Aug 2023
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open
  Response Scenarios
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios
Yu-Wen Chen
Zhou Yu
Julia Hirschberg
73
1
0
24 Aug 2023
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
Rui Mao
Guanyi Chen
Xulang Zhang
Frank Guerin
Min Zhang
ELMLM&MA
85
112
0
24 Aug 2023
American Stories: A Large-Scale Structured Text Dataset of Historical
  U.S. Newspapers
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Melissa Dell
Jacob Carlson
Tom Bryan
Emily Silcock
Abhishek Arora
Zejiang Shen
Luca DÁmico-Wong
Q. Le
Pablo Querubin
Leander Heldring
AI4TS
65
12
0
24 Aug 2023
Curriculum Learning with Adam: The Devil Is in the Wrong Details
Curriculum Learning with Adam: The Devil Is in the Wrong Details
Leon Weber
Jaap Jumelet
Paul Michel
Elia Bruni
Dieuwke Hupkes
ODL
83
3
0
23 Aug 2023
Out of the Cage: How Stochastic Parrots Win in Cyber Security
  Environments
Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments
M. Rigaki
Ondrej Lukás
C. Catania
Sebastian Garcia
LLMAG
63
12
0
23 Aug 2023
CgT-GAN: CLIP-guided Text GAN for Image Captioning
CgT-GAN: CLIP-guided Text GAN for Image Captioning
Jiarui Yu
Haoran Li
Y. Hao
B. Zhu
Tong Xu
Xiangnan He
VLMCLIP
67
13
0
23 Aug 2023
IncreLoRA: Incremental Parameter Allocation Method for
  Parameter-Efficient Fine-tuning
IncreLoRA: Incremental Parameter Allocation Method for Parameter-Efficient Fine-tuning
Feiyu F. Zhang
Liangzhi Li
Jun-Cheng Chen
Zhouqian Jiang
Bowen Wang
Yiming Qian
95
37
0
23 Aug 2023
Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep
  Learning Track
Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track
Guangwei Xu
Yangzhao Zhang
Longhui Zhang
Dingkun Long
Pengjun Xie
Rui Guo
40
3
0
23 Aug 2023
A Unified Framework for 3D Point Cloud Visual Grounding
A Unified Framework for 3D Point Cloud Visual Grounding
Haojia Lin
Yongdong Luo
Xiawu Zheng
Lijiang Li
Chia-Wen Lin
Taisong Jin
Donghao Luo
Yan Wang
Liujuan Cao
Rongrong Ji
93
3
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
185
19
0
23 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
137
2
0
23 Aug 2023
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog
  Navigation
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation
Yi-Chiao Su
Dongyan An
Yuan Xu
Kehan Chen
Yan Huang
86
3
0
22 Aug 2023
Empowering Refugee Claimants and their Lawyers: Using Machine Learning
  to Examine Decision-Making in Refugee Law
Empowering Refugee Claimants and their Lawyers: Using Machine Learning to Examine Decision-Making in Refugee Law
Claire Barale
39
0
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive
  Language-Image Pre-training
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIPVLM
91
3
0
22 Aug 2023
Anonymity at Risk? Assessing Re-Identification Capabilities of Large
  Language Models
Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models
Alex Nyffenegger
Matthias Sturmer
Joel Niklaus
97
7
0
22 Aug 2023
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Fatma Elsafoury
30
2
0
21 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature Review
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
131
437
0
21 Aug 2023
Age Recommendation from Texts and Sentences for Children
Age Recommendation from Texts and Sentences for Children
Rashedur M. Rahman
Gwénolé Lecorvé
Nicolas Béchet
3DV
30
1
0
21 Aug 2023
Software Entity Recognition with Noise-Robust Learning
Software Entity Recognition with Noise-Robust Learning
Nguyen Tai
Yifeng Di
J. Lee
Muhao Chen
Tianyi Zhang
76
4
0
21 Aug 2023
An Examination of the Compositionality of Large Generative
  Vision-Language Models
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
79
4
0
21 Aug 2023
cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media
  Comments using Spatio-Temporally Retrained Language Models
cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media Comments using Spatio-Temporally Retrained Language Models
Sidney Gig-Jan Wong
Matthew Durward
Benjamin Adams
Jonathan Dunn
27
7
0
20 Aug 2023
Imaginations of WALL-E : Reconstructing Experiences with an
  Imagination-Inspired Module for Advanced AI Systems
Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems
Zeinab Taghavi
S. Gooran
Seyed Arshan Dalili
Hamidreza Amirzadeh
Mohammad Jalal Nematbakhsh
Hossein Sameti
48
2
0
20 Aug 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Scaling up Discovery of Latent Concepts in Deep NLP Models
Majd Hawasly
Fahim Dalvi
Nadir Durrani
123
5
0
20 Aug 2023
How Good Are LLMs at Out-of-Distribution Detection?
How Good Are LLMs at Out-of-Distribution Detection?
Bo Liu
Li-Ming Zhan
Zexin Lu
Yu Feng
Lei Xue
Xiao-Ming Wu
OODD
73
9
0
20 Aug 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
61
1
0
20 Aug 2023
A Survey on Fairness in Large Language Models
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
126
70
0
20 Aug 2023
Open, Closed, or Small Language Models for Text Classification?
Open, Closed, or Small Language Models for Text Classification?
Hao Yu
Zachary Yang
Kellin Pelrine
Jean Francois Godbout
Reihaneh Rabbany
78
36
0
19 Aug 2023
Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble
  Framework Utilizing Transformers
Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers
Anusuya Krishnan
13
0
0
19 Aug 2023
HICL: Hashtag-Driven In-Context Learning for Social Media Natural
  Language Understanding
HICL: Hashtag-Driven In-Context Learning for Social Media Natural Language Understanding
Hanzhuo Tan
Chunpu Xu
Jing Li
Yuqun Zhang
Zeyang Fang
Zeyu Chen
Baohua Lai
53
0
0
19 Aug 2023
Breaking Language Barriers: A Question Answering Dataset for Hindi and
  Marathi
Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi
Maithili Sabane
Onkar Litake
Amanat Chadha
126
2
0
19 Aug 2023
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing
  Framework for Content Moderation Software
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Wenxuan Wang
Jingyuan Huang
Jen-tse Huang
Chang Chen
Jiazhen Gu
Pinjia He
Michael R. Lyu
VLM
61
6
0
18 Aug 2023
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity
  Control
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control
Zi-Yuan Hu
Yanyang Li
Michael R. Lyu
Liwei Wang
VLM
90
16
0
18 Aug 2023
Video-Instrument Synergistic Network for Referring Video Instrument
  Segmentation in Robotic Surgery
Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery
Hongqiu Wang
Lei Zhu
Guang Yang
Yi-Ting Guo
Shenmin Zhang
Bo Xu
Yueming Jin
VOS
73
0
0
18 Aug 2023
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Samuel Albanie
Yining Pan
Tao Feng
Jianwen Jiang
Dong Ni
Yingya Zhang
Deli Zhao
VLM
79
40
0
18 Aug 2023
Characterizing Information Seeking Events in Health-Related Social
  Discourse
Characterizing Information Seeking Events in Health-Related Social Discourse
Omar Sharif
Madhusudan Basak
Tanzia Parvin
Ava Scharfstein
Alphonso Bradham
J. Borodovsky
S. Lord
S. Preum
61
7
0
17 Aug 2023
Towards Automatically Addressing Self-Admitted Technical Debt: How Far
  Are We?
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
A. Mastropaolo
M. D. Penta
Gabriele Bavota
63
9
0
17 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model
  with Non-textual Features for CTR Prediction
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
56
14
0
17 Aug 2023
Chinese Spelling Correction as Rephrasing Language Model
Chinese Spelling Correction as Rephrasing Language Model
Linfeng Liu
Hongqiu Wu
Hai Zhao
LRM
88
17
0
17 Aug 2023
Enhancing Phrase Representation by Information Bottleneck Guided Text
  Diffusion Process for Keyphrase Extraction
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
Yuanzhen Luo
Qingyu Zhou
F. Zhou
DiffM
61
2
0
17 Aug 2023
Previous
123...858687...213214215
Next