Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 9,167 papers shown
Title
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in Text
Shuaiyi Li
Yang Deng
Wai Lam
58
2
0
19 Oct 2023
Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompt
Gangwei Jiang
Caigao Jiang
Siqiao Xue
James Y. Zhang
Junqing Zhou
Defu Lian
Ying Wei
VLM
51
7
0
19 Oct 2023
Contrastive Learning for Inference in Dialogue
Etsuko Ishii
Yan Xu
Bryan Wilie
Ziwei Ji
Holy Lovenia
Willy Chung
Pascale Fung
40
0
0
19 Oct 2023
MTS-LOF: Medical Time-Series Representation Learning via Occlusion-Invariant Features
Huayu Li
Ana S. Carreon-Rascon
Xiwen Chen
Geng Yuan
Ao Li
AI4TS
19
5
0
19 Oct 2023
A Read-and-Select Framework for Zero-shot Entity Linking
Zhenran Xu
Yulin Chen
Baotian Hu
Min Zhang
44
5
0
19 Oct 2023
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Qingru Zhang
Dhananjay Ram
Cole Hawkins
Sheng Zha
Tuo Zhao
60
15
0
19 Oct 2023
Automated Repair of Declarative Software Specifications in the Era of Large Language Models
Md Rashedul Hasan
Jiawei Li
Iftekhar Ahmed
Hamid Bagheri
55
2
0
19 Oct 2023
Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding
Jianing Wang
Qiushi Sun
Nuo Chen
Chengyu Wang
Jun Huang
Ming Gao
Xiang Li
UQLM
46
3
0
19 Oct 2023
Solving Hard Analogy Questions with Relation Embedding Chains
Nitesh Kumar
Steven Schockaert
39
1
0
18 Oct 2023
SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Mohammadreza Salehi
Sachin Mehta
Aditya Kusupati
Ali Farhadi
Hannaneh Hajishirzi
69
5
0
18 Oct 2023
CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation
Philipp Borchert
Jochen De Weerdt
Kristof Coussement
Arno De Caigny
Marie-Francine Moens
43
1
0
18 Oct 2023
DesignQuizzer: A Community-Powered Conversational Agent for Learning Visual Design
Zhenhui Peng
Qiaoyi Chen
Zhiyu Shen
Xiaojuan Ma
Antti Oulasvirta
29
5
0
18 Oct 2023
Gold: A Global and Local-aware Denoising Framework for Commonsense Knowledge Graph Noise Detection
Zheye Deng
Weiqi Wang
Zhaowei Wang
Xin Liu
Yangqiu Song
38
9
0
18 Oct 2023
Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing
Dmitry Nikolaev
Sebastian Padó
56
5
0
18 Oct 2023
Rather a Nurse than a Physician -- Contrastive Explanations under Investigation
Oliver Eberle
Ilias Chalkidis
Laura Cabello
Stephanie Brandl
36
9
0
18 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
37
3
0
18 Oct 2023
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
Su ah Lee
Seokjin Oh
Woohwan Jung
41
3
0
18 Oct 2023
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
Daeun Lee
Sejung Son
Hyolim Jeon
Seungbae Kim
Jinyoung Han
29
3
0
18 Oct 2023
Learning under Label Proportions for Text Classification
Jatin Chauhan
Xiaoxuan Wang
Wei Wang
35
1
0
18 Oct 2023
Open-ended Commonsense Reasoning with Unrestricted Answer Scope
Chen Ling
Xuchao Zhang
Xujiang Zhao
Yanchi Liu
Wei Cheng
Mika Oishi
Takao Osaki
Katsushi Matsuda
Haifeng Chen
Liang Zhao
ReLM
LRM
34
1
0
18 Oct 2023
Field-testing items using artificial intelligence: Natural language processing with transformers
Hotaka Maeda
14
2
0
18 Oct 2023
VeRA: Vector-based Random Matrix Adaptation
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
VLM
41
138
0
17 Oct 2023
Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks
Muhan Zhang
11
1
0
17 Oct 2023
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations
Yazhou Zhang
Mengyao Wang
Youxi Wu
Prayag Tiwari
Qiuchi Li
Benyou Wang
Jing Qin
63
23
0
17 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
53
0
0
17 Oct 2023
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Haochen Shi
Weiqi Wang
Tianqing Fang
Baixuan Xu
Wenxuan Ding
Xin Liu
Yangqiu Song
74
7
0
17 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
34
3
0
17 Oct 2023
Entity Matching using Large Language Models
Ralph Peeters
Christian Bizer
43
13
0
17 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Shiyuan Huang
Siddarth Mamidanna
Shreedhar Jangam
Yilun Zhou
Leilani H. Gilpin
LRM
MILM
ELM
66
68
0
17 Oct 2023
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing
Quoc-Nam Nguyen
Thang Chau Phan
Duc-Vu Nguyen
Kiet Van Nguyen
33
8
0
17 Oct 2023
H2O Open Ecosystem for State-of-the-art Large Language Models
Arno Candel
Jon McKinney
Philipp Singer
Pascal Pfeiffer
Maximilian Jeblick
Chun Ming Lee
Marcos V. Conde
VLM
30
4
0
17 Oct 2023
Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models
Yilmazcan Ozyurt
Stefan Feuerriegel
Ce Zhang
51
1
0
17 Oct 2023
Understanding writing style in social media with a supervised contrastively pre-trained transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
23
4
0
17 Oct 2023
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
S. Nigam
Aniket Deroy
Noel Shallum
Ayush Kumar Mishra
Anup Roy
Shubham Kumar Mishra
Arnab Bhattacharya
Saptarshi Ghosh
Kripabandhu Ghosh
AILaw
ELM
28
10
0
17 Oct 2023
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation
Tomohito Kasahara
Daisuke Kawahara
51
2
0
17 Oct 2023
Correction Focused Language Model Training for Speech Recognition
Yingyi Ma
Zhe Liu
Ozlem Kalinli
KELM
53
3
0
17 Oct 2023
Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning
Huiming Wang
Zhaodonghui Li
Liying Cheng
De Wen Soh
Lidong Bing
53
2
0
17 Oct 2023
A State-Vector Framework for Dataset Effects
E. Sahak
Zining Zhu
Frank Rudzicz
38
1
0
17 Oct 2023
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
48
56
0
16 Oct 2023
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
Ryan Shea
Zhou Yu
OffRL
47
7
0
16 Oct 2023
G-SPEED: General SParse Efficient Editing MoDel
Haoke Zhang
Yue Wang
Juntao Li
Xiabing Zhou
Min Zhang
SyDa
KELM
35
1
0
16 Oct 2023
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
55
42
0
16 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
43
5
0
16 Oct 2023
Decomposed Prompt Tuning via Low-Rank Reparameterization
Yao Xiao
Lu Xu
Jiaxi Li
Wei Lu
Xiaoli Li
VLM
33
6
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
53
64
0
16 Oct 2023
FiLM: Fill-in Language Models for Any-Order Generation
Tianxiao Shen
Hao-Chun Peng
Ruoqi Shen
Yao Fu
Zaïd Harchaoui
Yejin Choi
46
8
0
15 Oct 2023
Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia
Dimitris Gkoumas
Matthew Purver
Maria Liakata
36
2
0
15 Oct 2023
Rethinking Relation Classification with Graph Meaning Representations
Li Zhou
Wenyu Chen
DingYi Zeng
Hong Qu
Daniel Hershcovich
AI4CE
30
0
0
15 Oct 2023
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer
Boan Liu
Liang Ding
Li Shen
Keqin Peng
Yu Cao
Dazhao Cheng
Dacheng Tao
MoE
41
7
0
15 Oct 2023
EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
Huanhuan Ma
Weizhi Xu
Yifan Wei
Liuji Chen
Liang Wang
Qiang Liu
Shu Wu
Liang Wang
37
15
0
15 Oct 2023
Previous
1
2
3
...
72
73
74
...
182
183
184
Next