ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,764 papers shown
Title
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
GeoLLM: Extracting Geospatial Knowledge from Large Language Models
Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
70
55
0
10 Oct 2023
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
Karel DÓosterlinck
Semere Kiros Bitew
Brandon Papineau
Christopher Potts
Thomas Demeester
Chris Develder
64
9
0
09 Oct 2023
JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and
  Nonverbal Expressions
JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Detai Xin
Junfeng Jiang
Shinnosuke Takamichi
Yuki Saito
Akiko Aizawa
Hiroshi Saruwatari
59
12
0
09 Oct 2023
LLM for SoC Security: A Paradigm Shift
LLM for SoC Security: A Paradigm Shift
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
175
55
0
09 Oct 2023
Unleashing the power of Neural Collapse for Transferability Estimation
Unleashing the power of Neural Collapse for Transferability Estimation
Yuhe Ding
Bo Jiang
Lijun Sheng
Aihua Zheng
Jian Liang
CVBM
87
1
0
09 Oct 2023
Making Scalable Meta Learning Practical
Making Scalable Meta Learning Practical
Sang Keun Choe
Sanket Vaibhav Mehta
Hwijeen Ahn
Willie Neiswanger
Pengtao Xie
Emma Strubell
Eric Xing
115
16
0
09 Oct 2023
Integrating Stock Features and Global Information via Large Language
  Models for Enhanced Stock Return Prediction
Integrating Stock Features and Global Information via Large Language Models for Enhanced Stock Return Prediction
Yujie Ding
Shuai Jia
Tianyi Ma
Bingcheng Mao
Xiuze Zhou
Liuliu Li
Dongming Han
AIFin
143
9
0
09 Oct 2023
Dynamic Top-k Estimation Consolidates Disagreement between Feature
  Attribution Methods
Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods
Jonathan Kamp
Lisa Beinborn
Antske Fokkens
FAtt
64
1
0
09 Oct 2023
IDTraffickers: An Authorship Attribution Dataset to link and connect
  Potential Human-Trafficking Operations on Text Escort Advertisements
IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements
V. Saxena
Benjamin Bashpole
Gijs Van Dijck
Gerasimos Spanakis
104
2
0
09 Oct 2023
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Hongqiu Wu
Linfeng Liu
Haizhen Zhao
Min Zhang
LRMAI4CENAIELM
84
7
0
09 Oct 2023
Universal Multi-modal Entity Alignment via Iteratively Fusing Modality
  Similarity Paths
Universal Multi-modal Entity Alignment via Iteratively Fusing Modality Similarity Paths
Bolin Zhu
Xiaoze Liu
Xin Mao
Zhuo Chen
Lingbing Guo
Tao Gui
Qi Zhang
92
2
0
09 Oct 2023
Continuous Invariance Learning
Continuous Invariance Learning
Yong Lin
Fan Zhou
Lu Tan
Lintao Ma
Jiameng Liu
...
Yuan Yuan
Yu Liu
James Y. Zhang
Yujiu Yang
Hao Wang
CLLOOD
82
4
0
09 Oct 2023
Visual Storytelling with Question-Answer Plans
Visual Storytelling with Question-Answer Plans
Danyang Liu
Mirella Lapata
Frank Keller
CoGe
92
9
0
08 Oct 2023
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient
  Partially Relevant Video Retrieval
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
Yuting Wang
Jinpeng Wang
Bin Chen
Ziyun Zeng
Shu-Tao Xia
75
11
0
08 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text
  via Conditional Probability Curvature
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
94
153
0
08 Oct 2023
Enhancing Document-level Event Argument Extraction with Contextual Clues
  and Role Relevance
Enhancing Document-level Event Argument Extraction with Contextual Clues and Role Relevance
Wanlong Liu
Shaohuan Cheng
DingYi Zeng
Hong Qu
112
30
0
08 Oct 2023
Enhancing Argument Structure Extraction with Efficient Leverage of
  Contextual Information
Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information
Yun Luo
Zhen Yang
Fandong Meng
Yingjie Li
Jie Zhou
Yue Zhang
69
1
0
08 Oct 2023
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot
  Performance via Probability Calibration
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration
Ercong Nie
Helmut Schmid
Hinrich Schütze
UQCV
101
2
0
08 Oct 2023
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Yifan Jiang
Filip Ilievski
Kaixin Ma
Zhivar Sourati
LRMReLM
101
12
0
08 Oct 2023
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as
  You May Think -- Introducing AI Detectability Index
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index
Megha Chakraborty
S.M. Towhidul Islam Tonmoy
S. M. Mehedi
Krish Sharma
Niyar R. Barman
...
Tanay Kumar
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
DeLMO
82
21
0
08 Oct 2023
Compresso: Structured Pruning with Collaborative Prompting Learns
  Compact Large Language Models
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models
Song Guo
Jiahang Xu
Li Zhang
Mao Yang
87
15
0
08 Oct 2023
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot
  Question Answering
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
Xiusi Chen
Jyun-Yu Jiang
Wei-Cheng Chang
Cho-Jui Hsieh
Hsiang-Fu Yu
Wei Wang
97
12
0
08 Oct 2023
The Troubling Emergence of Hallucination in Large Language Models -- An
  Extensive Definition, Quantification, and Prescriptive Remediations
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations
Vipula Rawte
Swagata Chakraborty
Agnibh Pathak
Anubhav Sarkar
S.M. Towhidul Islam Tonmoy
Aman Chadha
Mikel Artetxe
Punit Daniel Simig
HILM
94
131
0
08 Oct 2023
TopicAdapt- An Inter-Corpora Topics Adaptation Approach
TopicAdapt- An Inter-Corpora Topics Adaptation Approach
Pritom Saha Akash
Trisha Das
Kevin Chen-Chuan Chang
38
0
0
08 Oct 2023
Exploring the Usage of Chinese Pinyin in Pretraining
Exploring the Usage of Chinese Pinyin in Pretraining
Baojun Wang
Kun Xu
Lifeng Shang
AI4CE
32
0
0
08 Oct 2023
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code
  Translation
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
Weixiang Yan
Yuchen Tian
Yunzhe Li
Qian Chen
Wen Wang
119
42
0
08 Oct 2023
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and
  Human-Computer Interaction with Integrated AI Generative Models
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models
Gabriele Tolomei
Cesare Campagnano
Fabrizio Silvestri
Giovanni Trappolini
77
4
0
07 Oct 2023
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via
  Pre-trained Models
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models
Ziyi Yin
Muchao Ye
Tianrong Zhang
Tianyu Du
Jinguo Zhu
Han Liu
Jinghui Chen
Ting Wang
Fenglong Ma
AAMLVLMCoGe
89
44
0
07 Oct 2023
From Nuisance to News Sense: Augmenting the News with Cross-Document
  Evidence and Context
From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context
Jeremiah Milbauer
Ziqi Ding
Zhijin Wu
Tongshuang Wu
88
2
0
06 Oct 2023
Measuring Information in Text Explanations
Measuring Information in Text Explanations
Zining Zhu
Frank Rudzicz
FAtt
68
0
0
06 Oct 2023
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth
  Estimation
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Muhammad Osama Khan
Junbang Liang
Chun-Kai Wang
Shan Yang
Yu Lou
MDE
88
4
0
06 Oct 2023
Hermes: Unlocking Security Analysis of Cellular Network Protocols by
  Synthesizing Finite State Machines from Natural Language Specifications
Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications
Abdullah Al Ishtiaq
Sarkar Snigdha Sarathi Das
Syed Md Mukit Rashid
Ali Ranjbar
Kai Tu
...
Zhezheng Song
Weixuan Wang
M. Akon
Rui Zhang
Syed Rafiul Hussain
43
10
0
06 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
Tuomas Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
103
55
0
06 Oct 2023
A Comprehensive Evaluation of Large Language Models on Benchmark
  Biomedical Text Processing Tasks
A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks
Fangshuo Liao
Md Tahmid Rahman Laskar
Cruz Barnum
Jimmy Xiangji Huang
AI4MHLM&MA
97
82
0
06 Oct 2023
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Wanyun Cui
Qianle Wang
LRM
92
9
0
06 Oct 2023
Document-Level Relation Extraction with Relation Correlation Enhancement
Document-Level Relation Extraction with Relation Correlation Enhancement
Yusheng Huang
Zhouhan Lin
51
2
0
06 Oct 2023
Acoustic and linguistic representations for speech continuous emotion
  recognition in call center conversations
Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations
Manon Macary
Marie Tahon
Yannick Esteve
Daniel Luzzati
73
3
0
06 Oct 2023
Quantized Transformer Language Model Implementations on Edge Devices
Quantized Transformer Language Model Implementations on Edge Devices
Mohammad Wali Ur Rahman
Murad Mehrab Abrar
Hunter Gibbons Copening
Salim Hariri
Sicong Shao
Pratik Satam
Soheil Salehi
MQ
68
11
0
06 Oct 2023
Toward a Foundation Model for Time Series Data
Toward a Foundation Model for Time Series Data
Chin-Chia Michael Yeh
Xin Dai
Huiyuan Chen
Yan Zheng
Yujie Fan
...
Vivian Lai
Zhongfang Zhuang
Junpeng Wang
Liang Wang
Wei Zhang
AI4TSAI4CE
159
26
0
05 Oct 2023
OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable
  Evasion Attacks
OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks
Ofir Bar Tal
Adi Haviv
Amit H. Bermano
AAML
79
0
0
05 Oct 2023
The Anatomy of Deception: Technical and Human Perspectives on a
  Large-scale Phishing Campaign
The Anatomy of Deception: Technical and Human Perspectives on a Large-scale Phishing Campaign
Anargyros Chrysanthou
Yorgos Pantis
Constantinos Patsakis
61
1
0
05 Oct 2023
Tik-to-Tok: Translating Language Models One Token at a Time: An
  Embedding Initialization Strategy for Efficient Language Adaptation
Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
François Remy
Pieter Delobelle
Bettina Berendt
Kris Demuynck
Thomas Demeester
79
3
0
05 Oct 2023
Reformulating Domain Adaptation of Large Language Models as
  Adapt-Retrieve-Revise
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise
Zhen Wan
Yating Zhang
Yexiang Wang
Fei Cheng
Sadao Kurohashi
CLLAILaw
108
10
0
05 Oct 2023
SoK: Access Control Policy Generation from High-level Natural Language
  Requirements
SoK: Access Control Policy Generation from High-level Natural Language Requirements
Sakuna Jayasundara
N. Arachchilage
Giovanni Russello
40
2
0
05 Oct 2023
Observatory: Characterizing Embeddings of Relational Tables
Observatory: Characterizing Embeddings of Relational Tables
Tianji Cong
Madelon Hulsebos
Zhenjie Sun
Paul Groth
H. V. Jagadish
93
10
0
05 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Zihao Lin
Yan Sun
Yifan Shi
Xueqian Wang
Lifu Huang
Li Shen
Dacheng Tao
98
12
0
04 Oct 2023
Large Language Model Cascades with Mixture of Thoughts Representations
  for Cost-efficient Reasoning
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Murong Yue
Jie Zhao
Min Zhang
Liang Du
Ziyu Yao
LRM
130
71
0
04 Oct 2023
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
Deniz Bayazit
Negar Foroutan
Zeming Chen
Gail Weiss
Antoine Bosselut
KELM
105
16
0
04 Oct 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for
  Decision Making
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
102
19
0
04 Oct 2023
Kosmos-G: Generating Images in Context with Multimodal Large Language
  Models
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan
Li Dong
Shaohan Huang
Zhiliang Peng
Wenhu Chen
Furu Wei
VLM
152
68
0
04 Oct 2023
Previous
123...787980...214215216
Next