ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,783 papers shown
Title
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting
  Elusive Disinformation
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
Jason Samuel Lucas
Adaku Uchendu
Michiharu Yamashita
Jooyoung Lee
Shaurya Rohatgi
Dongwon Lee
96
48
0
24 Oct 2023
A Joint Matrix Factorization Analysis of Multilingual Representations
A Joint Matrix Factorization Analysis of Multilingual Representations
Zheng Zhao
Yftah Ziser
Bonnie Webber
Shay B. Cohen
87
4
0
24 Oct 2023
TRAMS: Training-free Memory Selection for Long-range Language Modeling
TRAMS: Training-free Memory Selection for Long-range Language Modeling
Haofei Yu
Cunxiang Wang
Yue Zhang
Wei Bi
RALM
102
6
0
24 Oct 2023
Interpreting Answers to Yes-No Questions in User-Generated Content
Interpreting Answers to Yes-No Questions in User-Generated Content
Shivam Mathur
Keun Hee Park
Dhivya Chinnappa
Saketh Kotamraju
Eduardo Blanco
49
0
0
24 Oct 2023
Toward a Critical Toponymy Framework for Named Entity Recognition: A
  Case Study of Airbnb in New York City
Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City
Mikael Brunila
J. LaViolette
Sky CH-Wang
Priyanka Verma
Clara Féré
Grant McKenzie
26
1
0
23 Oct 2023
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot
  Filling
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling
Yuanjun Shi
Linzhi Wu
Minglai Shao
70
3
0
23 Oct 2023
On the Dimensionality of Sentence Embeddings
On the Dimensionality of Sentence Embeddings
Hongwei Wang
Hongming Zhang
Dong Yu
AI4TSDML
55
4
0
23 Oct 2023
Towards Possibilities & Impossibilities of AI-generated Text Detection:
  A Survey
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey
Soumya Suvra Ghosal
Souradip Chakraborty
Jonas Geiping
Furong Huang
Dinesh Manocha
Amrit Singh Bedi
DeLMO
99
37
0
23 Oct 2023
GRENADE: Graph-Centric Language Model for Self-Supervised Representation
  Learning on Text-Attributed Graphs
GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs
Yichuan Li
Kaize Ding
Kyumin Lee
SSL
88
25
0
23 Oct 2023
Federated Learning of Large Language Models with Parameter-Efficient
  Prompt Tuning and Adaptive Optimization
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
Tianshi Che
Ji Liu
Yang Zhou
Jiaxiang Ren
Jiwen Zhou
Victor S. Sheng
H. Dai
Dejing Dou
96
56
0
23 Oct 2023
Affective and Dynamic Beam Search for Story Generation
Affective and Dynamic Beam Search for Story Generation
Tenghao Huang
Ehsan Qasemi
Bangzheng Li
He Wang
Faeze Brahman
Muhao Chen
Snigdha Chaturvedi
70
12
0
23 Oct 2023
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework
  for Science Journalism
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism
Ronald Cardenas
Bingsheng Yao
Dakuo Wang
Yufang Hou
102
0
0
23 Oct 2023
Leveraging Deep Learning for Abstractive Code Summarization of
  Unofficial Documentation
Leveraging Deep Learning for Abstractive Code Summarization of Unofficial Documentation
AmirHossein Naghshzan
Latifa Guerrouj
Olga Baysal
60
0
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for
  Large Language Models
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
83
33
0
23 Oct 2023
System Combination via Quality Estimation for Grammatical Error
  Correction
System Combination via Quality Estimation for Grammatical Error Correction
Muhammad Reza Qorib
Hwee Tou Ng
43
5
0
23 Oct 2023
Linking Surface Facts to Large-Scale Knowledge Graphs
Linking Surface Facts to Large-Scale Knowledge Graphs
Gorjan Radevski
Kiril Gashteovski
Chia-Chien Hung
Carolin (Haas) Lawrence
Goran Glavaš
HILM
60
3
0
23 Oct 2023
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time
  Controllable Text Generation
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
Tianqi Zhong
Quan Wang
Jingxuan Han
Yongdong Zhang
Zhendong Mao
92
9
0
23 Oct 2023
Paraphrase Types for Generation and Detection
Paraphrase Types for Generation and Detection
Jan Philip Wahle
Bela Gipp
Terry Ruas
70
4
0
23 Oct 2023
Adaptive Policy with Wait-$k$ Model for Simultaneous Translation
Adaptive Policy with Wait-kkk Model for Simultaneous Translation
Libo Zhao
Kai Fan
Wei Luo
Jing Wu
Shushu Wang
Ziqian Zeng
Zhongqiang Huang
92
10
0
23 Oct 2023
Transparency at the Source: Evaluating and Interpreting Language Models
  With Access to the True Distribution
Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution
Jaap Jumelet
Willem H. Zuidema
86
6
0
23 Oct 2023
Harnessing Attention Mechanisms: Efficient Sequence Reduction using
  Attention-based Autoencoders
Harnessing Attention Mechanisms: Efficient Sequence Reduction using Attention-based Autoencoders
Daniel Biermann
Fabrizio Palumbo
Morten Goodwin
Ole-Christoffer Granmo
107
0
0
23 Oct 2023
Large Language Models can Share Images, Too!
Large Language Models can Share Images, Too!
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
84
2
0
23 Oct 2023
What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared
  Properties in Large Concept Vocabularies
What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies
Amit Gajbhiye
Zied Bouraoui
Na Li
Usashi Chatterjee
Luis Espinosa Anke
Steven Schockaert
94
1
0
23 Oct 2023
Vision-Enhanced Semantic Entity Recognition in Document Images via
  Visually-Asymmetric Consistency Learning
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning
Hao Wang
Xiahua Chen
Rui Wang
Chenhui Chu
70
0
0
23 Oct 2023
SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for
  Social Media NLP Research
SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research
Dimosthenis Antypas
Asahi Ushio
Francesco Barbieri
Leonardo Neves
Kiamehr Rezaee
Luis Espinosa-Anke
Jiaxin Pei
Jose Camacho-Collados
66
10
0
23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future
  Directions
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
103
33
0
23 Oct 2023
Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time
  Pretraining for Complex Temporal Reasoning
Once Upon a Time\textit{Time}Time in Graph\textit{Graph}Graph: Relative-Time Pretraining for Complex Temporal Reasoning
Sen Yang
Xin Li
Li Bing
Wai Lam
AI4CE
80
11
0
23 Oct 2023
Tree of Clarifications: Answering Ambiguous Questions with
  Retrieval-Augmented Large Language Models
Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
Gangwoo Kim
Sungdong Kim
Byeongguk Jeon
Joonsuk Park
Jaewoo Kang
UQLM
70
30
0
23 Oct 2023
SpEL: Structured Prediction for Entity Linking
SpEL: Structured Prediction for Entity Linking
Hassan S. Shavarani
Anoop Sarkar
123
12
0
23 Oct 2023
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and
  Beyond
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Zhecan Wang
Long Chen
Haoxuan You
Keyang Xu
Yicheng He
Wenhao Li
Noal Codella
Kai-Wei Chang
Shih-Fu Chang
107
3
0
23 Oct 2023
Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion
  Recognition
Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition
Yige Xu
Zhiwei Zeng
Zhiqi Shen
VLM
82
3
0
23 Oct 2023
Unveiling the Multi-Annotation Process: Examining the Influence of
  Annotation Quantity and Instance Difficulty on Model Performance
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance
Pritam Kadasi
Mayank Singh
59
3
0
23 Oct 2023
Meaning Representations from Trajectories in Autoregressive Models
Meaning Representations from Trajectories in Autoregressive Models
Tian Yu Liu
Matthew Trager
Alessandro Achille
Pramuditha Perera
Luca Zancato
Stefano Soatto
87
16
0
23 Oct 2023
Continual Named Entity Recognition without Catastrophic Forgetting
Continual Named Entity Recognition without Catastrophic Forgetting
Duzhen Zhang
Wei Cong
Jiahua Dong
Yahan Yu
Xiuyi Chen
Yonggang Zhang
Zhen Fang
66
12
0
23 Oct 2023
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data
  Augmentation for Multi-hop Fact Verification
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification
Yingjie Zhu
Jiasheng Si
Yibo Zhao
Haiyang Zhu
Deyu Zhou
Yulan He
91
7
0
23 Oct 2023
Attention-Enhancing Backdoor Attacks Against BERT-based Models
Attention-Enhancing Backdoor Attacks Against BERT-based Models
Weimin Lyu
Songzhu Zheng
Lu Pang
Haibin Ling
Chao Chen
71
42
0
23 Oct 2023
GeoLM: Empowering Language Models for Geospatially Grounded Language
  Understanding
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding
Zekun Li
Wenxuan Zhou
Yao-Yi Chiang
Muhao Chen
SyDa
90
32
0
23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation
  Regularization
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization
Mohammad Reza Ghasemi Madani
Pasquale Minervini
91
4
0
22 Oct 2023
Merging Generated and Retrieved Knowledge for Open-Domain QA
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Lu Wang
RALM
91
38
0
22 Oct 2023
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
ITEm: Unsupervised Image-Text Embedding Learning for eCommerce
Baohao Liao
Michael Kozielski
Sanjika Hewavitharana
Jiangbo Yuan
Shahram Khadivi
Tomer Lancewicki
SSL
25
0
0
22 Oct 2023
CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural
  Text
CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text
Abhilash Nandy
M. Kapadnis
Pawan Goyal
Niloy Ganguly
42
1
0
22 Oct 2023
Conversational Speech Recognition by Learning Audio-textual Cross-modal
  Contextual Representation
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation
Kun Wei
Bei Li
Hang Lv
Quan Lu
Ning Jiang
Lei Xie
92
4
0
22 Oct 2023
RSM-NLP at BLP-2023 Task 2: Bangla Sentiment Analysis using Weighted and
  Majority Voted Fine-Tuned Transformers
RSM-NLP at BLP-2023 Task 2: Bangla Sentiment Analysis using Weighted and Majority Voted Fine-Tuned Transformers
Pratinav Seth
Rashi Goel
Komal Mathur
Swetha Vemulapalli
41
1
0
22 Oct 2023
UniMAP: Universal SMILES-Graph Representation Learning
UniMAP: Universal SMILES-Graph Representation Learning
Shikun Feng
Lixin Yang
Wei-Ying Ma
Yanyan Lan
OffRL
72
6
0
22 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language
  Models
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song
Xuan Xie
Jiayang Song
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
106
6
0
22 Oct 2023
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
Wei-wei Zhu
Xiaoling Wang
Huanran Zheng
Mosha Chen
Buzhou Tang
ELMLM&MA
69
36
0
22 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing
  Noisy Input
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
79
0
0
21 Oct 2023
MeaeQ: Mount Model Extraction Attacks with Efficient Queries
MeaeQ: Mount Model Extraction Attacks with Efficient Queries
Chengwei Dai
Minxuan Lv
Kun Li
Wei Zhou
AAML
70
5
0
21 Oct 2023
Toward Stronger Textual Attack Detectors
Toward Stronger Textual Attack Detectors
Pierre Colombo
Marine Picot
Nathan Noiry
Guillaume Staerman
Pablo Piantanida
563
5
0
21 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based
  Embedding Models
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
Pierre Colombo
Victor Pellegrain
Malik Boudiaf
Victor Storchan
Myriam Tami
Ismail Ben Ayed
C´eline Hudelot
Pablo Piantanida
101
8
0
21 Oct 2023
Previous
123...747576...214215216
Next