Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,802 papers shown
Title
SocialDial: A Benchmark for Socially-Aware Dialogue Systems
Haolan Zhan
Zhuang Li
Yufei Wang
Linhao Luo
Tao Feng
...
Lay-Ki Soon
Suraj Sharma
Ingrid Zukerman
Zhaleh Semnani Azad
Gholamreza Haffari
123
17
0
24 Apr 2023
CHEAT: A Large-scale Dataset for Detecting ChatGPT-writtEn AbsTracts
Peipeng Yu
Jiahan Chen
Xuan Feng
Zhihua Xia
175
45
0
24 Apr 2023
KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual Fine-Tuning for Persuasion Techniques Detection
Timo Hromadka
Timotej Smolen
T. Remiš
Branislav Pecher
Ivan Srba
48
11
0
24 Apr 2023
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]
Alexandros Zeakis
G. Papadakis
Dimitrios Skoutas
Manolis Koubarakis
78
39
0
24 Apr 2023
Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Deepanway Ghosal
Navonil Majumder
Ambuj Mehrish
Soujanya Poria
234
152
0
24 Apr 2023
PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis
Shuyong Wei
Nuo Xu
51
5
0
24 Apr 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELM
AI4MH
138
168
0
23 Apr 2023
Differentiate ChatGPT-generated and Human-written Medical Texts
Wenxiong Liao
Zheng Liu
Haixing Dai
Shaochen Xu
Zihao Wu
...
Xiaoke Huang
Dajiang Zhu
Hongmin Cai
Tianming Liu
Xiang Li
LM&MA
DeLMO
MedIm
AI4MH
62
60
0
23 Apr 2023
Graph Neural Networks for Text Classification: A Survey
Kunze Wang
Yihao Ding
S. Han
FaML
GNN
99
29
0
23 Apr 2023
Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks
Heng Wang
Wenqian Zhang
Yuyang Bai
Zhaoxuan Tan
Shangbin Feng
Qinghua Zheng
Minnan Luo
80
4
0
22 Apr 2023
SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval
Haitao Li
Qingyao Ai
Jia Chen
Qian Dong
Yueyue Wu
Yu-an Liu
C. L. Philip Chen
Qi Tian
AILaw
ELM
RALM
80
79
0
22 Apr 2023
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Shangda Wu
Dingyao Yu
Xu Tan
Maosong Sun
CLIP
VLM
76
15
0
21 Apr 2023
LEIA: Linguistic Embeddings for the Identification of Affect
S. Aroyehun
Lukas Malik
Hannah Metzler
Nikolas Haimerl
Anna Flavia Di Natale
David Garcia
35
3
0
21 Apr 2023
Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs
Costas Mavromatis
V. Ioannidis
Shen Wang
Da Zheng
Soji Adeshina
Jun Ma
Han Zhao
Christos Faloutsos
George Karypis
79
31
0
20 Apr 2023
Word Sense Induction with Knowledge Distillation from BERT
Anik Saha
Alex Gittens
B. Yener
51
1
0
20 Apr 2023
MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label Framing Detection with Contrastive Learning
Qisheng Liao
Meiting Lai
Preslav Nakov
VLM
43
10
0
20 Apr 2023
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Shaoxiong Ji
Tianlin Zhang
Kailai Yang
Sophia Ananiadou
Min Zhang
Jörg Tiedemann
AI4MH
ALM
86
29
0
20 Apr 2023
GPT-NER: Named Entity Recognition via Large Language Models
Shuhe Wang
Xiaofei Sun
Xiaoya Li
Rongbin Ouyang
Leilei Gan
Tianwei Zhang
Jiwei Li
Guoyin Wang
108
202
0
20 Apr 2023
Interventional Probing in High Dimensions: An NLI Case Study
Julia Rozanova
Marco Valentino
Lucas C. Cordeiro
André Freitas
45
7
0
20 Apr 2023
Is Cross-modal Information Retrieval Possible without Training?
Hyunjin Choi
HyunJae Lee
Seongho Joe
Youngjune Gwon
49
1
0
20 Apr 2023
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts
Ashutosh Modi
Prathamesh Kalamkar
S. Karn
Aman Tiwari
Abhinav Joshi
Sai Kiran Tanikella
S. Guha
Sachin Malhan
Vivek Raghavan
ELM
AILaw
55
42
0
19 Apr 2023
EC^2: Emergent Communication for Embodied Control
Yao Mu
Shunyu Yao
Mingyu Ding
Ping Luo
Chuang Gan
LM&Ro
79
20
0
19 Apr 2023
Hyperbolic Image-Text Representations
Karan Desai
Maximilian Nickel
Tanmay Rajpurohit
Justin Johnson
Ramakrishna Vedantam
VLM
109
67
0
18 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Ruihao Gong
Jian Ren
Zhengang Li
MQ
78
36
0
18 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
66
39
0
18 Apr 2023
Revisiting k-NN for Fine-tuning Pre-trained Language Models
Lei Li
Jing Chen
Bo Tian
Ning Zhang
63
1
0
18 Apr 2023
D2CSE: Difference-aware Deep continuous prompts for Contrastive Sentence Embeddings
HyunJae Lee
VLM
52
0
0
18 Apr 2023
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Zheng Lian
Haiyang Sun
Guoying Zhao
Kang Chen
Mingyu Xu
...
Meng Wang
Min Zhang
Guoying Zhao
Björn W. Schuller
Jianhua Tao
96
51
0
18 Apr 2023
Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs
Da Silva Gameiro Henrique
Andrei Kucharavy
R. Guerraoui
DeLMO
83
8
0
18 Apr 2023
From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation
Adarsh Kumar
Pedro Sarmento
73
4
0
18 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavaš
Ivan Vulić
84
23
0
18 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
70
11
0
18 Apr 2023
A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification
Yunlong Feng
Bohan Li
Libo Qin
Xiao Xu
Wanxiang Che
46
3
0
18 Apr 2023
HeRo: RoBERTa and Longformer Hebrew Language Models
Vitaly Shalumov
Harel Haskey
VLM
96
7
0
18 Apr 2023
Classification of US Supreme Court Cases using BERT-Based Techniques
Shubham Vatsal
Adam Meyers
J. Ortega
ELM
AILaw
53
3
0
17 Apr 2023
An Unbiased Transformer Source Code Learning with Semantic Vulnerability Graph
Nafis Tanveer Islam
G. Parra
Dylan Manuel
E. Bou-Harb
Peyman Najafirad
85
10
0
17 Apr 2023
Improving Autoregressive NLP Tasks via Modular Linearized Attention
Victor Agostinelli
Lizhong Chen
60
1
0
17 Apr 2023
LED: A Dataset for Life Event Extraction from Dialogs
Yi-Pei Chen
An-Zi Yen
Hen-Hsen Huang
Hideki Nakayama
Hsin-Hsi Chen
8
4
0
17 Apr 2023
Context-Dependent Embedding Utterance Representations for Emotion Recognition in Conversations
Patrícia Pereira
Helena Moniz
Isabel Dias
Joao Paulo Carvalho
79
9
0
17 Apr 2023
VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning
Zhen-Ru Zhang
Chuanqi Tan
Songfang Huang
Fei Huang
VLM
64
5
0
17 Apr 2023
SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language Model
Nan Li
Bo Kang
T. D. Bie
64
15
0
17 Apr 2023
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Juan Pablo Zuluaga
Amrutha Prasad
Iuliia Nigmatulina
P. Motlícek
Matthias Kleinert
67
23
0
16 Apr 2023
It's All in the Embedding! Fake News Detection Using Document Embeddings
Ciprian-Octavian Truică
Elena Simona Apostol
85
51
0
16 Apr 2023
MisRoBÆRTa: Transformers versus Misinformation
Ciprian-Octavian Truică
Elena Simona Apostol
66
39
0
16 Apr 2023
Permutation Equivariance of Transformers and Its Applications
Hengyuan Xu
Liyao Xiang
Hang Ye
Dixi Yao
Pengzhi Chu
Baochun Li
56
15
0
16 Apr 2023
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu
Ziyin Zhang
Wanyang Zhang
Shisen Yue
Xiaojing Zhao
Xinyuan Cheng
Yiwen Zhang
Hai Hu
DeLMO
103
55
0
16 Apr 2023
The Self-Perception and Political Biases of ChatGPT
Jérôme Rutinowski
Sven Franke
Jan Endendyk
Ina Dormuth
Markus Pauly
106
104
0
14 Apr 2023
OPI at SemEval 2023 Task 9: A Simple But Effective Approach to Multilingual Tweet Intimacy Analysis
Slawomir Dadas
57
2
0
14 Apr 2023
Keeping the Questions Conversational: Using Structured Representations to Resolve Dependency in Conversational Question Answering
Munazza Zaib
Quan Z. Sheng
W. Zhang
A. Mahmood
80
2
0
14 Apr 2023
Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10
David Thulke
Nico Daheim
Christian Dugast
Hermann Ney
83
7
0
14 Apr 2023
Previous
1
2
3
...
110
111
112
...
215
216
217
Next