ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
Learning Evaluation Models from Large Language Models for Sequence Generation
Learning Evaluation Models from Large Language Models for Sequence Generation
Chenglong Wang
Hang Zhou
Kai-Chun Chang
Tongran Liu
Chunliang Zhang
Quan Du
Tong Xiao
Yue Zhang
Jingbo Zhu
ELM
154
4
0
08 Aug 2023
Generative Benchmark Creation for Table Union Search
Generative Benchmark Creation for Table Union Search
Koyena Pal
Aamod Khatiwada
Roee Shraga
Renée J. Miller
69
0
0
07 Aug 2023
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
J. Puentes
Angela Castillo
Wilmar Osejo
Yuly Calderón
Viviana Quintero
L. Saldarriaga
D. Agudelo
Pablo Arbelaez
55
2
0
07 Aug 2023
Detecting Spells in Fantasy Literature with a Transformer Based
  Artificial Intelligence
Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence
Marcel Moravek
Alexander Zender
Andreas Müller
17
0
0
07 Aug 2023
WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset
WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset
Hsuvas Borkakoty
Luis Espinosa-Anke
71
0
0
07 Aug 2023
Towards Controllable Natural Language Inference through Lexical
  Inference Types
Towards Controllable Natural Language Inference through Lexical Inference Types
Yingji Zhang
Danilo S. Carvalho
Ian Pratt-Hartmann
André Freitas
95
0
0
07 Aug 2023
Topological Interpretations of GPT-3
Topological Interpretations of GPT-3
Tianyi Sun
Bradley J. Nelson
50
2
0
07 Aug 2023
Accurate Retraining-free Pruning for Pretrained Encoder-based Language
  Models
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park
Ho-Jin Choi
U. Kang
VLM
80
8
0
07 Aug 2023
LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models
  Fine-tuning
LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Longteng Zhang
Lin Zhang
Shaoshuai Shi
Xiaowen Chu
Yue Liu
AI4CE
72
107
0
07 Aug 2023
Analysis of the Evolution of Advanced Transformer-Based Language Models:
  Experiments on Opinion Mining
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining
Nour Eddine Zekaoui
Siham Yousfi
Maryem Rhanoui
M. Mikram
49
3
0
07 Aug 2023
Two Sides of Miscalibration: Identifying Over and Under-Confidence
  Prediction for Network Calibration
Two Sides of Miscalibration: Identifying Over and Under-Confidence Prediction for Network Calibration
Shuang Ao
Stefan Rueger
Advaith Siddharthan
UQCV
65
8
0
06 Aug 2023
System-Initiated Transitions from Chit-Chat to Task-Oriented Dialogues
  with Transition Info Extractor and Transition Sentence Generator
System-Initiated Transitions from Chit-Chat to Task-Oriented Dialogues with Transition Info Extractor and Transition Sentence Generator
Ye Liu
Stefan Ultes
Wolfgang Minker
Wolfgang Maier
84
4
0
06 Aug 2023
3D-EX : A Unified Dataset of Definitions and Dictionary Examples
3D-EX : A Unified Dataset of Definitions and Dictionary Examples
F. Almeman
Hadi Sheikhi
Luis Espinosa-Anke
71
1
0
06 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
92
667
0
06 Aug 2023
A Symbolic Character-Aware Model for Solving Geometry Problems
A Symbolic Character-Aware Model for Solving Geometry Problems
Maizhen Ning
Qiufeng Wang
Kaizhu Huang
Xiaowei Huang
77
18
0
05 Aug 2023
PromptCARE: Prompt Copyright Protection by Watermark Injection and
  Verification
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao
Jian Lou
Kui Ren
Zhan Qin
AAMLVLM
103
31
0
05 Aug 2023
How Good Are SOTA Fake News Detectors
How Good Are SOTA Fake News Detectors
Matthew Iceland
49
6
0
04 Aug 2023
Toward Zero-Shot Instruction Following
Toward Zero-Shot Instruction Following
Renze Lou
Wenpeng Yin
121
1
0
04 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen
  Convolutional CLIP
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLMCLIP
100
152
0
04 Aug 2023
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning
  Attacks
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks
Domenico Cotroneo
Cristina Improta
Pietro Liguori
R. Natella
SILM
102
30
0
04 Aug 2023
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation
  from Text
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text
Nandana Mihindukulasooriya
Sanju Tiwari
Carlos F. Enguix
K. Lata
91
62
0
04 Aug 2023
Learning to Select the Relevant History Turns in Conversational Question
  Answering
Learning to Select the Relevant History Turns in Conversational Question Answering
Munazza Zaib
Wei Emma Zhang
Quan Z. Sheng
S. Sagar
A. Mahmood
Yang Zhang
62
4
0
04 Aug 2023
A Survey of Spanish Clinical Language Models
A Survey of Spanish Clinical Language Models
Guillem García Subies
Á. Jiménez
Paloma Martínez
LM&MAELMLRM
57
0
0
04 Aug 2023
From Fake to Hyperpartisan News Detection Using Domain Adaptation
From Fake to Hyperpartisan News Detection Using Domain Adaptation
Razvan-Alexandru Smadu
Sebastian-Vasile Echim
Dumitru-Clementin Cercel
Iuliana Marin
Florin-Catalin Pop
67
3
0
04 Aug 2023
Learning Referring Video Object Segmentation from Weak Annotation
Learning Referring Video Object Segmentation from Weak Annotation
Wangbo Zhao
Ke Nan
Songyang Zhang
Kai-xiang Chen
Dahua Lin
Yang You
VOS
68
2
0
04 Aug 2023
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature
  Extraction Techniques, Ensembling, and Deep Learning Models
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models
M. Kamruzzaman
Gene Louis Kim
59
3
0
03 Aug 2023
Baby Llama: knowledge distillation from an ensemble of teachers trained
  on a small dataset with no performance penalty
Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty
I. Timiryasov
J. Tastet
87
53
0
03 Aug 2023
Supply chain emission estimation using large language models
Supply chain emission estimation using large language models
A. Jain
Manikandan Padmanaban
J. Hazra
S. Godbole
Kommy Weldemariam
54
2
0
03 Aug 2023
MAP: A Model-agnostic Pretraining Framework for Click-through Rate
  Prediction
MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction
Jianghao Lin
Yanru Qu
Wei Guo
Xinyi Dai
Ruiming Tang
Yong Yu
Weinan Zhang
72
21
0
03 Aug 2023
Baby's CoThought: Leveraging Large Language Models for Enhanced
  Reasoning in Compact Models
Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models
Zheyu Zhang
Han Yang
Bolei Ma
David Rügamer
Ercong Nie
LRM
97
4
0
03 Aug 2023
NBIAS: A Natural Language Processing Framework for Bias Identification
  in Text
NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Shaina Razaa
Muskan Garg
Deepak John Reji
Syed Raza Bashir
Chen Ding
86
50
0
03 Aug 2023
SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning
SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning
Keyu Duan
Qian Liu
Tat-Seng Chua
Shuicheng Yan
Wei Tsang Ooi
Qizhe Xie
Junxian He
129
60
0
03 Aug 2023
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using
  Beat-Synchronous Mixup Strategies
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Kai Chen
Yusong Wu
Haohe Liu
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
DiffM
94
81
0
03 Aug 2023
Careful Whisper -- leveraging advances in automatic speech recognition
  for robust and interpretable aphasia subtype classification
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification
Laurin Wagner
M. Zusag
Theresa Bloder
78
12
0
02 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional
  Questions
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLMKELMLRM
83
2
0
02 Aug 2023
CASSINI: Network-Aware Job Scheduling in Machine Learning Clusters
CASSINI: Network-Aware Job Scheduling in Machine Learning Clusters
S. Rajasekaran
M. Ghobadi
Aditya Akella
GNN
87
32
0
01 Aug 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model
  Pretraining on Source Code
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code
Nadezhda Chirkova
Sergey Troshin
95
8
0
01 Aug 2023
Fountain -- an intelligent contextual assistant combining knowledge
  representation and language models for manufacturing risk identification
Fountain -- an intelligent contextual assistant combining knowledge representation and language models for manufacturing risk identification
Saurabh Kumar
D. Fuchs
K. Spindler
35
1
0
01 Aug 2023
Multimodal Multi-loss Fusion Network for Sentiment Analysis
Multimodal Multi-loss Fusion Network for Sentiment Analysis
Zehui Wu
Ziwei Gong
Jaywon Koo
Julia Hirschberg
113
27
0
01 Aug 2023
Adversarially Robust Neural Legal Judgement Systems
Adversarially Robust Neural Legal Judgement Systems
R. Raj
V. Devi
AILawELMAAML
38
0
0
31 Jul 2023
Towards Semantically Enriched Embeddings for Knowledge Graph Completion
Towards Semantically Enriched Embeddings for Knowledge Graph Completion
Mehwish Alam
F. V. Harmelen
Maribel Acosta
113
4
0
31 Jul 2023
Contrastive Learning for API Aspect Analysis
Contrastive Learning for API Aspect Analysis
G. M. Shahariar
Tahmid Hasan
Anindya Iqbal
Gias Uddin
47
0
0
31 Jul 2023
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and
  Baseline via Detection
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection
Xuanang Chen
Xianpei Han
Le Sun
Yingfei Sun
AAML
105
5
0
31 Jul 2023
Deep Dive into the Language of International Relations: NLP-based
  Analysis of UNESCO's Summary Records
Deep Dive into the Language of International Relations: NLP-based Analysis of UNESCO's Summary Records
Joanna Wojciechowska
Mateusz Sypniewski
Maria Śmigielska
Igor Kamiñski
Emilia Wisnios
Hanna Schreiber
Bartosz Pieliñski
52
2
0
31 Jul 2023
Utilisation of open intent recognition models for customer support
  intent detection
Utilisation of open intent recognition models for customer support intent detection
Rasheed Mohammad
Oliver Favell
Shariq Shah
Emmett Cooper
Edlira Vakaj
77
0
0
31 Jul 2023
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence
  for Continuous Cloud Audit
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit
Franz Deimling
Michela Fazzolari
62
1
0
31 Jul 2023
A Benchmark for Understanding Dialogue Safety in Mental Health Support
A Benchmark for Understanding Dialogue Safety in Mental Health Support
Huachuan Qiu
Tong Zhao
Anqi Li
Shuai Zhang
Hongliang He
Zhenzhong Lan
78
10
0
31 Jul 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural
  Machine Translation
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation
Haiyue Song
Raj Dabre
Chenhui Chu
Sadao Kurohashi
Eiichiro Sumita
43
3
0
31 Jul 2023
Visual Captioning at Will: Describing Images and Videos Guided by a Few
  Stylized Sentences
Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Di Yang
Hongyu Chen
Xinglin Hou
T. Ge
Yuning Jiang
Qin Jin
85
7
0
31 Jul 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for
  Complex Visual Reasoning Tasks
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
58
2
0
31 Jul 2023
Previous
123...878889...213214215
Next