Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,779 papers shown
Title
DesignQuizzer: A Community-Powered Conversational Agent for Learning Visual Design
Zhenhui Peng
Qiaoyi Chen
Zhiyu Shen
Xiaojuan Ma
Antti Oulasvirta
54
5
0
18 Oct 2023
Gold: A Global and Local-aware Denoising Framework for Commonsense Knowledge Graph Noise Detection
Zheye Deng
Weiqi Wang
Zhaowei Wang
Xin Liu
Yangqiu Song
62
9
0
18 Oct 2023
Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing
Dmitry Nikolaev
Sebastian Padó
82
5
0
18 Oct 2023
Rather a Nurse than a Physician -- Contrastive Explanations under Investigation
Oliver Eberle
Ilias Chalkidis
Laura Cabello
Stephanie Brandl
75
10
0
18 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
71
4
0
18 Oct 2023
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
Su ah Lee
Seokjin Oh
Woohwan Jung
79
3
0
18 Oct 2023
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
Daeun Lee
Sejung Son
Hyolim Jeon
Seungbae Kim
Jinyoung Han
51
3
0
18 Oct 2023
Learning under Label Proportions for Text Classification
Jatin Chauhan
Xiaoxuan Wang
Wei Wang
65
1
0
18 Oct 2023
Open-ended Commonsense Reasoning with Unrestricted Answer Scope
Chen Ling
Xuchao Zhang
Xujiang Zhao
Yanchi Liu
Wei Cheng
Mika Oishi
Takao Osaki
Katsushi Matsuda
Haifeng Chen
Liang Zhao
ReLM
LRM
69
1
0
18 Oct 2023
Field-testing items using artificial intelligence: Natural language processing with transformers
Hotaka Maeda
21
2
0
18 Oct 2023
Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scoring of Texts with Large Language Models
Patrick Y. Wu
Jonathan Nagler
Joshua A. Tucker
Solomon Messing
LRM
129
3
0
18 Oct 2023
VeRA: Vector-based Random Matrix Adaptation
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
VLM
85
164
0
17 Oct 2023
Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks
Muhan Zhang
27
2
0
17 Oct 2023
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations
Yazhou Zhang
Mengyao Wang
Youxi Wu
Prayag Tiwari
Qiuchi Li
Benyou Wang
Jing Qin
148
24
0
17 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
82
0
0
17 Oct 2023
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Haochen Shi
Weiqi Wang
Tianqing Fang
Baixuan Xu
Wenxuan Ding
Xin Liu
Yangqiu Song
116
7
0
17 Oct 2023
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation
Jaap Jumelet
Michael Hanna
Marianne de Heer Kloots
Anna Langedijk
Charlotte Pouw
Oskar van der Wal
82
3
0
17 Oct 2023
Entity Matching using Large Language Models
Ralph Peeters
Christian Bizer
86
15
0
17 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Shiyuan Huang
Siddarth Mamidanna
Shreedhar Jangam
Yilun Zhou
Leilani H. Gilpin
LRM
MILM
ELM
114
77
0
17 Oct 2023
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing
Quoc-Nam Nguyen
Thang Chau Phan
Duc-Vu Nguyen
Kiet Van Nguyen
60
11
0
17 Oct 2023
H2O Open Ecosystem for State-of-the-art Large Language Models
Arno Candel
Jon McKinney
Philipp Singer
Pascal Pfeiffer
Maximilian Jeblick
Chun Ming Lee
Marcos V. Conde
VLM
55
4
0
17 Oct 2023
Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models
Yilmazcan Ozyurt
Stefan Feuerriegel
Ce Zhang
123
1
0
17 Oct 2023
Understanding writing style in social media with a supervised contrastively pre-trained transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
123
6
0
17 Oct 2023
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation
S. Nigam
Aniket Deroy
Noel Shallum
Ayush Kumar Mishra
Anup Roy
Shubham Kumar Mishra
Arnab Bhattacharya
Saptarshi Ghosh
Kripabandhu Ghosh
AILaw
ELM
80
11
0
17 Oct 2023
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation
Tomohito Kasahara
Daisuke Kawahara
80
3
0
17 Oct 2023
Correction Focused Language Model Training for Speech Recognition
Yingyi Ma
Zhe Liu
Ozlem Kalinli
KELM
96
3
0
17 Oct 2023
Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning
Huiming Wang
Zhaodonghui Li
Liying Cheng
De Wen Soh
Lidong Bing
80
3
0
17 Oct 2023
A State-Vector Framework for Dataset Effects
E. Sahak
Zining Zhu
Frank Rudzicz
64
1
0
17 Oct 2023
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
89
64
0
16 Oct 2023
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
Ryan Shea
Zhou Yu
OffRL
97
8
0
16 Oct 2023
G-SPEED: General SParse Efficient Editing MoDel
Haoke Zhang
Yue Wang
Juntao Li
Xiabing Zhou
Min Zhang
SyDa
KELM
63
1
0
16 Oct 2023
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
118
45
0
16 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
78
6
0
16 Oct 2023
Decomposed Prompt Tuning via Low-Rank Reparameterization
Yao Xiao
Lu Xu
Jiaxi Li
Wei Lu
Xiaoli Li
VLM
73
7
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
219
76
0
16 Oct 2023
FiLM: Fill-in Language Models for Any-Order Generation
Tianxiao Shen
Hao-Chun Peng
Ruoqi Shen
Yao Fu
Zaïd Harchaoui
Yejin Choi
93
10
0
15 Oct 2023
Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia
Dimitris Gkoumas
Matthew Purver
Maria Liakata
63
2
0
15 Oct 2023
Rethinking Relation Classification with Graph Meaning Representations
Li Zhou
Wenyu Chen
DingYi Zeng
Hong Qu
Daniel Hershcovich
AI4CE
51
0
0
15 Oct 2023
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer
Boan Liu
Liang Ding
Li Shen
Keqin Peng
Yu Cao
Dazhao Cheng
Dacheng Tao
MoE
82
9
0
15 Oct 2023
EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
Huanhuan Ma
Weizhi Xu
Yifan Wei
Liuji Chen
Liang Wang
Qiang Liu
Shu Wu
Liang Wang
101
18
0
15 Oct 2023
Progressive Evidence Refinement for Open-domain Multimodal Retrieval Question Answering
Shuwen Yang
Anran Wu
Xingjiao Wu
Luwei Xiao
Tianlong Ma
Cheng Jin
Liang He
69
4
0
15 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
94
15
0
14 Oct 2023
A Digital Language Coherence Marker for Monitoring Dementia
Dimitris Gkoumas
Adam Tsakalidis
Maria Liakata
47
1
0
14 Oct 2023
An Expression Tree Decoding Strategy for Mathematical Equation Generation
Wenqi Zhang
Yongliang Shen
Qingpeng Nong
Zeqi Tan
Zeqi Tan Yanna Ma
Weiming Lu
AIMat
98
6
0
14 Oct 2023
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents
Hyungjoo Chae
Yongho Song
Kai Tzu-iunn Ong
Taeyoon Kwon
Minjin Kim
Youngjae Yu
Dongha Lee
Dongyeop Kang
Jinyoung Yeo
LRM
88
42
0
13 Oct 2023
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Peng Li
Yeye He
Dror Yashar
Weiwei Cui
Song Ge
Haidong Zhang
D. Fainman
Dongmei Zhang
Surajit Chaudhuri
ALM
LMTD
87
82
0
13 Oct 2023
Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration
Yiquan Wu
Siying Zhou
Yifei Liu
Weiming Lu
Xiaozhong Liu
Yating Zhang
Changlong Sun
Leilei Gan
Kun Kuang
AILaw
ELM
84
40
0
13 Oct 2023
ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction
Jianghao Lin
Bo Chen
Hangyu Wang
Yunjia Xi
Yanru Qu
Xinyi Dai
Kangning Zhang
Ruiming Tang
Yong Yu
Weinan Zhang
174
34
0
13 Oct 2023
Regularization-Based Methods for Ordinal Quantification
Mirko Bunse
Alejandro Moreo
Fabrizio Sebastiani
M. Senz
33
1
0
13 Oct 2023
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Chen Zhang
L. F. D’Haro
Chengguang Tang
Ke Shi
Guohua Tang
Haizhou Li
ELM
72
11
0
13 Oct 2023
Previous
1
2
3
...
76
77
78
...
214
215
216
Next