ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,798 papers shown
Title
ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection
  in Arabic Text
ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text
Maram Hasanain
Firoj Alam
Hamdy Mubarak
Samir Abdaljalil
Wajdi Zaghouani
Preslav Nakov
Giovanni Da San Martino
Abed Alhakim Freihat
63
44
0
06 Nov 2023
Architectural Sweet Spots for Modeling Human Label Variation by the
  Example of Argument Quality: It's Best to Relate Perspectives!
Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives!
Philipp Heinisch
Matthias Orlikowski
Julia Romberg
Philipp Cimiano
51
3
0
06 Nov 2023
Findings of the WMT 2023 Shared Task on Discourse-Level Literary
  Translation: A Fresh Orb in the Cosmos of LLMs
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs
Longyue Wang
Zhaopeng Tu
Yan Gu
Siyou Liu
Dian Yu
...
Bonnie Webber
Philipp Koehn
Andy Way
Yulin Yuan
Shuming Shi
86
20
0
06 Nov 2023
Language Models are Super Mario: Absorbing Abilities from Homologous
  Models as a Free Lunch
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Le Yu
Yu Bowen
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
124
337
0
06 Nov 2023
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
Harika Abburi
Kalyani Roy
Michael Suesserman
Nirmala Pudota
Balaji Veeramani
Edward Bowen
Sanmitra Bhattacharya
DeLMO
94
10
0
06 Nov 2023
Adapting Pre-trained Generative Models for Extractive Question Answering
Adapting Pre-trained Generative Models for Extractive Question Answering
Prabir Mallick
Tapas Nayak
Indrajit Bhattacharya
54
4
0
06 Nov 2023
In-Context Learning for Knowledge Base Question Answering for Unmanned
  Systems based on Large Language Models
In-Context Learning for Knowledge Base Question Answering for Unmanned Systems based on Large Language Models
Yunlong Chen
Yaming Zhang
Jianfei Yu
Li Yang
Rui Xia
ELM
71
0
0
06 Nov 2023
CausalCite: A Causal Formulation of Paper Citations
CausalCite: A Causal Formulation of Paper Citations
Ishan Kumar
Zhijing Jin
Ehsan Mokhtarian
Siyuan Guo
Yuen Chen
Mrinmaya Sachan
Bernhard Schoelkopf
CML
96
0
0
05 Nov 2023
Large language models implicitly learn to straighten neural sentence
  trajectories to construct a predictive representation of natural language
Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language
Eghbal A. Hosseini
Evelina Fedorenko
LLMSV
61
6
0
05 Nov 2023
Robust Generalization Strategies for Morpheme Glossing in an Endangered
  Language Documentation Context
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
Michael Ginn
Alexis Palmer
57
5
0
05 Nov 2023
mahaNLP: A Marathi Natural Language Processing Library
mahaNLP: A Marathi Natural Language Processing Library
Vidula Magdum
Omkar Dhekane
Sharayu Hiwarkhedkar
Saloni Mittal
Raviraj Joshi
78
5
0
05 Nov 2023
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation
  for Grounding-Based Vision and Language Models
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models
Jingru Yi
Burak Uzkent
Oana Ignat
Zili Li
Amanmeet Garg
Xiang Yu
Linda Liu
VLM
78
1
0
05 Nov 2023
An Interdisciplinary Outlook on Large Language Models for Scientific
  Research
An Interdisciplinary Outlook on Large Language Models for Scientific Research
James Boyko
Joseph Cohen
Nathan Fox
Maria Han Veiga
Jennifer I-Hsiu Li
...
Andreas H. Rauch
Kenneth N. Reid
Soumi Tribedi
Anastasia Visheratina
Xin Xie
81
19
0
03 Nov 2023
Contextualizing the Limits of Model & Evaluation Dataset Curation on
  Semantic Similarity Classification Tasks
Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks
Daniel Theron
39
0
0
03 Nov 2023
Too Much Information: Keeping Training Simple for BabyLMs
Too Much Information: Keeping Training Simple for BabyLMs
Lukas Edman
Lisa Bylinina
74
4
0
03 Nov 2023
Sentiment Analysis through LLM Negotiations
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
93
21
0
03 Nov 2023
Lost Your Style? Navigating with Semantic-Level Approach for
  Text-to-Outfit Retrieval
Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Junkyu Jang
Eugene Hwang
Sung-Hyuk Park
53
0
0
03 Nov 2023
Proto-lm: A Prototypical Network-Based Framework for Built-in
  Interpretability in Large Language Models
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models
Sean Xie
Soroush Vosoughi
Saeed Hassanpour
122
4
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political
  Intents in Online Newspapers
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
45
1
0
03 Nov 2023
FLAP: Fast Language-Audio Pre-training
FLAP: Fast Language-Audio Pre-training
Ching-Feng Yeh
Po-Yao Huang
Vasu Sharma
Shang-Wen Li
Gargi Ghosh
CLIPVLM
72
9
0
02 Nov 2023
Can Language Models Be Tricked by Language Illusions? Easier with
  Syntax, Harder with Semantics
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
Yuhan Zhang
Edward Gibson
Forrest Davis
100
6
0
02 Nov 2023
People Make Better Edits: Measuring the Efficacy of LLM-Generated
  Counterfactually Augmented Data for Harmful Language Detection
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Indira Sen
Dennis Assenmacher
Mattia Samory
Isabelle Augenstein
Wil M.P. van der Aalst
Claudia Wagner
91
21
0
02 Nov 2023
Expressive TTS Driven by Natural Language Prompts Using Few Human
  Annotations
Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations
Hanglei Zhang
Yiwei Guo
Sen Liu
Xie Chen
Kai Yu
55
1
0
02 Nov 2023
Adapting Fake News Detection to the Era of Large Language Models
Adapting Fake News Detection to the Era of Large Language Models
Jinyan Su
Claire Cardie
Preslav Nakov
DeLMO
107
19
0
02 Nov 2023
ATHENA: Mathematical Reasoning with Thought Expansion
ATHENA: Mathematical Reasoning with Thought Expansion
JB. Kim
Hazel Kim
Joonghyuk Hahn
Yo-Sub Han
ReLMLRMAIMat
118
7
0
02 Nov 2023
Measuring Five Accountable Talk Moves to Improve Instruction at Scale
Measuring Five Accountable Talk Moves to Improve Instruction at Scale
Ashlee Kupor
Candice Morgan
Dorottya Demszky
39
7
0
02 Nov 2023
Blending Reward Functions via Few Expert Demonstrations for Faithful and
  Accurate Knowledge-Grounded Dialogue Generation
Blending Reward Functions via Few Expert Demonstrations for Faithful and Accurate Knowledge-Grounded Dialogue Generation
Wanyu Du
Yangfeng Ji
64
1
0
02 Nov 2023
A Review and Roadmap of Deep Causal Model from Different Causal
  Structures and Representations
A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations
Hang Chen
Keqing Du
Chenguang Li
Xinyu Yang
104
2
0
02 Nov 2023
Task-Agnostic Low-Rank Adapters for Unseen English Dialects
Task-Agnostic Low-Rank Adapters for Unseen English Dialects
Zedian Xiao
William B. Held
Yanchen Liu
Diyi Yang
102
9
0
02 Nov 2023
Self-Influence Guided Data Reweighting for Language Model Pre-training
Self-Influence Guided Data Reweighting for Language Model Pre-training
Megh Thakkar
Tolga Bolukbasi
Sriram Ganapathy
Shikhar Vashishth
Sarath Chandar
Partha P. Talukdar
MILM
111
26
0
02 Nov 2023
In-Context Prompt Editing For Conditional Audio Generation
In-Context Prompt Editing For Conditional Audio Generation
Ernie Chang
Pin-Jie Lin
Yang Li
Sidd Srinivasan
Gaël Le Lan
David Kant
Yangyang Shi
Forrest N. Iandola
Vikas Chandra
DiffM
49
4
0
01 Nov 2023
Latent Space Translation via Semantic Alignment
Latent Space Translation via Semantic Alignment
Valentino Maiorca
Luca Moschella
Antonio Norelli
Marco Fumero
Francesco Locatello
Emanuele Rodolà
124
23
0
01 Nov 2023
Boosting Summarization with Normalizing Flows and Aggressive Training
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang
Xiaotong Shen
AI4CETPM
82
0
0
01 Nov 2023
Can Large Language Models Design Accurate Label Functions?
Can Large Language Models Design Accurate Label Functions?
Naiqing Guan
Kaiwen Chen
Nick Koudas
ALM
58
7
0
01 Nov 2023
Text Rendering Strategies for Pixel Language Models
Text Rendering Strategies for Pixel Language Models
Jonas F. Lotz
Elizabeth Salesky
Phillip Rust
Desmond Elliott
VLM
85
12
0
01 Nov 2023
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot
  Classification
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification
Yongxin Huang
Kexin Wang
Sourav Dutta
Raj Nath Patel
Goran Glavaš
Iryna Gurevych
VLM
70
4
0
01 Nov 2023
Prompt-based Logical Semantics Enhancement for Implicit Discourse
  Relation Recognition
Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition
Chenxu Wang
Ping Jian
Mu Huang
53
5
0
01 Nov 2023
tmn at #SMM4H 2023: Comparing Text Preprocessing Techniques for
  Detecting Tweets Self-reporting a COVID-19 Diagnosis
tmn at #SMM4H 2023: Comparing Text Preprocessing Techniques for Detecting Tweets Self-reporting a COVID-19 Diagnosis
Anna Glazkova
50
1
0
01 Nov 2023
Unsupervised Lexical Simplification with Context Augmentation
Unsupervised Lexical Simplification with Context Augmentation
Takashi Wada
Timothy Baldwin
Jey Han Lau
46
1
0
01 Nov 2023
Syntactic Inductive Bias in Transformer Language Models: Especially
  Helpful for Low-Resource Languages?
Syntactic Inductive Bias in Transformer Language Models: Especially Helpful for Low-Resource Languages?
Luke Gessler
Nathan Schneider
46
1
0
01 Nov 2023
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue
  Agents
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
Yang Deng
Wenxuan Zhang
Wai Lam
See-Kiong Ng
Tat-Seng Chua
LM&RoLLMAG
142
43
0
01 Nov 2023
GATSY: Graph Attention Network for Music Artist Similarity
GATSY: Graph Attention Network for Music Artist Similarity
Andrea Giuseppe Di Francesco
Giuliano Giampietro
Indro Spinelli
Danilo Comminiello
78
1
0
01 Nov 2023
Object-centric Video Representation for Long-term Action Anticipation
Object-centric Video Representation for Long-term Action Anticipation
Ce Zhang
Changcheng Fu
Shijie Wang
Nakul Agarwal
Kwonjoon Lee
Chiho Choi
Chen Sun
122
17
0
31 Oct 2023
On the effect of curriculum learning with developmental data for grammar
  acquisition
On the effect of curriculum learning with developmental data for grammar acquisition
Mattia Opper
J. Morrison
N. Siddharth
92
2
0
31 Oct 2023
Non-Compositionality in Sentiment: New Data and Analyses
Non-Compositionality in Sentiment: New Data and Analyses
Verna Dankers
Christopher G. Lucas
CoGe
129
1
0
31 Oct 2023
Increasing The Performance of Cognitively Inspired Data-Efficient
  Language Models via Implicit Structure Building
Increasing The Performance of Cognitively Inspired Data-Efficient Language Models via Implicit Structure Building
Omar Momen
David Arps
Laura Kallmeyer
AI4CE
74
2
0
31 Oct 2023
Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding
Zero-Shot Medical Information Retrieval via Knowledge Graph Embedding
Yuqi Wang
Zeqiang Wang
Wei Wang
Qi Chen
Kaizhu Huang
Anh Nguyen
Suparna De
MedIm
34
2
0
31 Oct 2023
Breaking the Token Barrier: Chunking and Convolution for Efficient Long
  Text Classification with BERT
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT
Aman Jaiswal
E. Milios
VLM
59
9
0
31 Oct 2023
A Transformer-Based Model With Self-Distillation for Multimodal Emotion
  Recognition in Conversations
A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
Hui Ma
Jian Wang
Hongfei Lin
Bo Zhang
Yijia Zhang
Bo Xu
89
48
0
31 Oct 2023
Unveiling Black-boxes: Explainable Deep Learning Models for Patent
  Classification
Unveiling Black-boxes: Explainable Deep Learning Models for Patent Classification
Md. Shajalal
Sebastian Denef
Md. Rezaul Karim
Alexander Boden
Gunnar Stevens
XAI
50
6
0
31 Oct 2023
Previous
123...717273...214215216
Next