Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,769 papers shown
Title
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang
Linyang Li
Ke Ren
Botian Jiang
Dong Zhang
Xipeng Qiu
DeLMO
101
60
0
13 Oct 2023
InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems
Willy Chung
Samuel Cahyawijaya
Bryan Wilie
Holy Lovenia
Pascale Fung
80
6
0
13 Oct 2023
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models
Takuma Udagawa
Aashka Trivedi
Michele Merler
Bishwaranjan Bhattacharjee
81
7
0
13 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
164
8
0
13 Oct 2023
LLM-augmented Preference Learning from Natural Language
Inwon Kang
Sikai Ruan
Tyler Ho
Jui-Chien Lin
Farhad Mohsin
Oshani Seneviratne
Lirong Xia
60
3
0
12 Oct 2023
Towards Robust Multi-Modal Reasoning via Model Selection
Xiangyan Liu
Rongxue Li
Wei Ji
Tao Lin
LLMAG
LRM
90
6
0
12 Oct 2023
Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic Enhancement
Rui Yang
Jiahao Zhu
Jianping Man
Li Fang
Yi Zhou
94
25
0
12 Oct 2023
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text
Chanho Park
Chengsong Lu
Mingjie Chen
Thomas Hain
147
3
0
12 Oct 2023
Core-sets for Fair and Diverse Data Summarization
S. Mahabadi
S. Trajanovski
76
3
0
12 Oct 2023
ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets
Tobias Schimanski
J. Bingler
Camilla Hyslop
Mathias Kraus
Markus Leippold
58
23
0
12 Oct 2023
Low-Resource Clickbait Spoiling for Indonesian via Question Answering
Ni Putu Intan Maharani
Ayu Purwarianti
Alham Fikri Aji
70
2
0
12 Oct 2023
To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer
Md. Mushfiqur Rahman
Fardin Ahsan Sakib
Fahim Faisal
Antonios Anastasopoulos
60
3
0
12 Oct 2023
Effects of Human Adversarial and Affable Samples on BERT Generalization
Aparna Elangovan
Jiayuan He
Yuan Li
Karin Verspoor
95
3
0
12 Oct 2023
LEMON: Lossless model expansion
Yite Wang
Jiahao Su
Hanlin Lu
Cong Xie
Tianyi Liu
Jianbo Yuan
Yanghua Peng
Ruoyu Sun
Hongxia Yang
73
14
0
12 Oct 2023
D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning
A. Maharana
Prateek Yadav
Mohit Bansal
98
34
0
11 Oct 2023
Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention
Huiyin Xue
Nikolaos Aletras
102
0
0
11 Oct 2023
Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations
Zhuoyan Li
Hangxiao Zhu
Zhuoran Lu
Ming Yin
SyDa
120
82
0
11 Oct 2023
Faithfulness Measurable Masked Language Models
Andreas Madsen
Siva Reddy
Sarath Chandar
83
3
0
11 Oct 2023
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Aishwarya N. Reganti
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
75
1
0
11 Oct 2023
Language Models As Semantic Indexers
Bowen Jin
Hansi Zeng
Guoyin Wang
Xiusi Chen
Tianxin Wei
...
Yang Li
Hanqing Lu
Suhang Wang
Jiawei Han
Xianfeng Tang
RALM
86
20
0
11 Oct 2023
Ontology Enrichment for Effective Fine-grained Entity Typing
Si-yuan Ouyang
Jiaxin Huang
Pranav Pillai
Yunyi Zhang
Yu Zhang
Jiawei Han
178
6
0
11 Oct 2023
Composite Backdoor Attacks Against Large Language Models
Hai Huang
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
80
49
0
11 Oct 2023
QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking
Liangming Pan
Xinyuan Lu
Min-Yen Kan
Preslav Nakov
LRM
99
23
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng Zhang
Yue Zhang
HILM
KELM
172
201
0
11 Oct 2023
Fast-ELECTRA for Efficient Pre-training
Chengyu Dong
Liyuan Liu
Hao Cheng
Jingbo Shang
Jianfeng Gao
Xiaodong Liu
79
2
0
11 Oct 2023
On the Impact of Cross-Domain Data on German Language Models
Amin Dada
Aokun Chen
C.A.I. Peng
Kaleb E. Smith
Ahmad Idrissi-Yaghir
...
Daniel Truhn
Jan Egger
Jiang Bian
Jens Kleesiek
Yonghui Wu
60
5
0
11 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
94
6
0
11 Oct 2023
An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT
Shyni Sharaf
V. Anoop
LM&MA
36
2
0
11 Oct 2023
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations
Qizhi Pei
Wei Zhang
Jinhua Zhu
Kehan Wu
Kaiyuan Gao
Lijun Wu
Yingce Xia
Rui Yan
120
73
0
11 Oct 2023
PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model
Tatsuya Ide
Eiki Murata
Daisuke Kawahara
T. Yamazaki
Shengzhe Li
K. Shinzato
Toshinori Sato
LRM
102
2
0
11 Oct 2023
"A Tale of Two Movements": Identifying and Comparing Perspectives in #BlackLivesMatter and #BlueLivesMatter Movements-related Tweets using Weakly Supervised Graph-based Structured Prediction
Shamik Roy
Dan Goldwasser
88
4
0
11 Oct 2023
Argumentative Stance Prediction: An Exploratory Study on Multimodality and Few-Shot Learning
Arushi Sharma
Abhibha Gupta
Maneesh Bilalpur
58
6
0
11 Oct 2023
Jaeger: A Concatenation-Based Multi-Transformer VQA Model
Jieting Long
Zewei Shi
Penghao Jiang
Yidong Gan
53
0
0
11 Oct 2023
Comparing Styles across Languages: A Cross-Cultural Exploration of Politeness
Shreya Havaldar
Matthew Pressimone
Eric Wong
Lyle Ungar
123
2
0
11 Oct 2023
Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels
Kasturi Bhattacharjee
Rashmi Gangadharaiah
26
0
0
10 Oct 2023
LLMs Killed the Script Kiddie: How Agents Supported by Large Language Models Change the Landscape of Network Threat Testing
Stephen Moskal
Sam Laney
Erik Hemberg
Una-May O’Reilly
76
21
0
10 Oct 2023
Improving Contrastive Learning of Sentence Embeddings with Focal-InfoNCE
Peng-Fei Hou
Xingyu Li
93
7
0
10 Oct 2023
A Comparative Study of Transformer-based Neural Text Representation Techniques on Bug Triaging
Atish Kumar Dipongkor
Kevin Moran
23
8
0
10 Oct 2023
Advancing Transformer's Capabilities in Commonsense Reasoning
Yu Zhou
Yunqiu Han
Hanyu Zhou
Yulun Wu
VLM
LRM
ReLM
52
0
0
10 Oct 2023
Uni3D: Exploring Unified 3D Representation at Scale
Junsheng Zhou
Jinsheng Wang
Baorui Ma
Yu-Shen Liu
Tiejun Huang
Xinlong Wang
113
98
0
10 Oct 2023
Multi-domain improves out-of-distribution and data-limited scenarios for medical image analysis
Ece Ozkan
Xavier Boix
OOD
46
0
0
10 Oct 2023
Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
CML
63
4
0
10 Oct 2023
Topic-DPR: Topic-based Prompts for Dense Passage Retrieval
Q. Xiao
Shuangyin Li
Lei Chen
122
2
0
10 Oct 2023
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
Yupei Du
Albert Gatt
Dong Nguyen
71
1
0
10 Oct 2023
Large Language Models for Propaganda Detection
Kilian Sprenkamp
Daniel Gordon Jones
L. Zavolokina
75
12
0
10 Oct 2023
P5: Plug-and-Play Persona Prompting for Personalized Response Selection
Joosung Lee
Min Sik Oh
Donghun Lee
67
3
0
10 Oct 2023
I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Yusheng Huang
Zhouhan Lin
57
5
0
10 Oct 2023
Gem5Pred: Predictive Approaches For Gem5 Simulation Time
Tian Yan
Xueyang Li
Sifat Ut Taki
Saeid Mehrdad
AI4CE
15
0
0
10 Oct 2023
BC4LLM: Trusted Artificial Intelligence When Blockchain Meets Large Language Models
Haoxiang Luo
Jian Luo
Athanasios V. Vasilakos
79
10
0
10 Oct 2023
Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction
C.A.I. Peng
Xi Yang
Kaleb E. Smith
Zehao Yu
Aokun Chen
Jiang Bian
Yonghui Wu
VLM
LRM
85
32
0
10 Oct 2023
Previous
1
2
3
...
77
78
79
...
214
215
216
Next