Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,493 papers shown
Title
Fake or Genuine? Contextualised Text Representation for Fake Review Detection
Rami Mohawesh
Shuxiang Xu
Matthew Springer
Muna Al-Hawawreh
Sumbal Maqsood
DeLMO
13
18
0
29 Dec 2021
Automatic Pharma News Categorization
S. Adaszewski
P. Kuner
Ralf J. Jaeger
OOD
26
3
0
28 Dec 2021
"A Passage to India": Pre-trained Word Embeddings for Indian Languages
Saurav Kumar
Saunack Kumar
Diptesh Kanojia
P. Bhattacharyya
75
31
0
27 Dec 2021
Evaluating Contextual Embeddings and their Extraction Layers for Depression Assessment
Matthew Matero
Albert Y. C. Hung
H. Andrew Schwartz
AI4MH
30
4
0
27 Dec 2021
Learning Bi-typed Multi-relational Heterogeneous Graph via Dual Hierarchical Attention Networks
Yu Zhao
Shaopeng Wei
Huaming Du
Xingyan Chen
Qing Li
Fuzhen Zhuang
Jiaheng Liu
Gang Kou
44
10
0
24 Dec 2021
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Shuohuan Wang
Yu Sun
Yang Xiang
Zhihua Wu
Siyu Ding
...
Tian Wu
Wei Zeng
Ge Li
Wen Gao
Haifeng Wang
ELM
39
79
0
23 Dec 2021
Sparse-softmax: A Simpler and Faster Alternative Softmax Transformation
Shaoshi Sun
Zhenyuan Zhang
B. Huang
Pengbin Lei
Jianlin Su
Shengfeng Pan
Jiarun Cao
UQCV
16
7
0
23 Dec 2021
Hybrid Curriculum Learning for Emotion Recognition in Conversation
Lin Yang
Yi Shen
Yue Mao
Longjun Cai
34
53
0
22 Dec 2021
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?
Xinhsuai Dong
Anh Tuan Luu
Min Lin
Shuicheng Yan
Hanwang Zhang
SILM
AAML
25
55
0
22 Dec 2021
Contrast and Generation Make BART a Good Dialogue Emotion Recognizer
Shimin Li
Hang Yan
Xipeng Qiu
25
84
0
21 Dec 2021
Diaformer: Automatic Diagnosis via Symptoms Sequence Generation
Junying Chen
Dongfang Li
Qingcai Chen
Wenxiu Zhou
Xin Liu
MedIm
35
30
0
20 Dec 2021
Cascading Adaptors to Leverage English Data to Improve Performance of Question Answering for Low-Resource Languages
Hariom A. Pandya
Bhavik Ardeshna
Brijesh S. Bhatt
29
6
0
18 Dec 2021
Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription
Nikolai Vogler
J. Allen
M. Miller
Taylor Berg-Kirkpatrick
32
5
0
16 Dec 2021
Learning Rich Representation of Keyphrases from Text
Mayank Kulkarni
Debanjan Mahata
Ravneet Arora
Rajarshi Bhowmik
VLM
32
65
0
16 Dec 2021
ErAConD : Error Annotated Conversational Dialog Dataset for Grammatical Error Correction
Xun Yuan
Derek Pham
Sam Davidson
Zhou Yu
30
6
0
15 Dec 2021
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation
Yu Li
Baolin Peng
Yelong Shen
Yi Mao
Lars Liden
Zhou Yu
Jianfeng Gao
24
53
0
15 Dec 2021
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
40
16
0
14 Dec 2021
Roof-Transformer: Divided and Joined Understanding with Knowledge Enhancement
Wei-Lin Liao
Chengwei Su
Wei-Yun Ma
32
0
0
13 Dec 2021
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
Benjamin Minixhofer
Fabian Paischer
Navid Rekabsaz
29
74
0
13 Dec 2021
Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension
Shusheng Xu
Yichen Liu
Xiaoyuan Yi
Siyuan Zhou
Huizi Li
Yi Wu
ELM
31
3
0
13 Dec 2021
Improving the Question Answering Quality using Answer Candidate Filtering based on Natural-Language Features
Aleksandr Gashkov
A. Perevalov
M. Eltsova
A. Both
19
3
0
10 Dec 2021
3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Jianhui Yu
Chaoyi Zhang
Heng Wang
Dingxin Zhang
Yang Song
Tiange Xiang
Dongnan Liu
Weidong (Tom) Cai
ViT
MedIm
21
32
0
09 Dec 2021
Detecting potentially harmful and protective suicide-related content on twitter: A machine learning approach
Hannah Metzler
Hubert Baginski
Thomas Niederkrotenthaler
David Garcia
AI4MH
30
13
0
09 Dec 2021
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip Torr
148
310
0
04 Dec 2021
Evaluating NLP Systems On a Novel Cloze Task: Judging the Plausibility of Possible Fillers in Instructional Texts
Zizhao Hu
Ravikiran Chanumolu
Xingyu Lin
Nayela Ayaz
Vincent Chi
ELM
17
4
0
03 Dec 2021
Single-Shot Black-Box Adversarial Attacks Against Malware Detectors: A Causal Language Model Approach
Junjie Hu
Mohammadreza Ebrahimi
Hsinchun Chen
AAML
18
11
0
03 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
23
29
0
02 Dec 2021
NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging
Zihan Liu
Feijun Jiang
Yuxiang Hu
Chen Shi
Pascale Fung
27
37
0
01 Dec 2021
A Comparative Study of Transformers on Word Sense Disambiguation
Avi Chawla
Nidhi Mulay
Vikas Bishnoi
Gaurav Dhama
Dr. Anil Kumar Singh
32
4
0
30 Nov 2021
EdiBERT, a generative model for image editing
Thibaut Issenhuth
Ugo Tanielian
Jérémie Mary
David Picard
DiffM
35
12
0
30 Nov 2021
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Adam Botach
Evgenii Zheltonozhskii
Chaim Baskin
VOS
38
141
0
29 Nov 2021
Action based Network for Conversation Question Reformulation
Zheyu Ye
Jiang Liu
Qian Yu
Jianxun Ju
24
0
0
29 Nov 2021
An Empirical Study of Topic Transition in Dialogue
Mayank Soni
Brendan Spillane
E. Gilmartin
Christian Saam
Benjamin R. Cowan
Vincent P. Wade
19
4
0
28 Nov 2021
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
Wenjie Wang
Lijuan Wang
Zicheng Liu
VLM
55
218
0
24 Nov 2021
Efficient Softmax Approximation for Deep Neural Networks with Attention Mechanism
Ihor Vasyltsov
Wooseok Chang
33
12
0
21 Nov 2021
Capitalization and Punctuation Restoration: a Survey
V. Pais
D. Tufis
21
19
0
21 Nov 2021
Seeking Common but Distinguishing Difference, A Joint Aspect-based Sentiment Analysis Model
Hongjiang Jing
Zuchao Li
Hai Zhao
Shu Jiang
27
25
0
18 Nov 2021
LAnoBERT: System Log Anomaly Detection based on BERT Masked Language Model
Yukyung Lee
Jina Kim
Pilsung Kang
17
79
0
18 Nov 2021
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
74
1,126
0
18 Nov 2021
WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia
Cheng-Mao Hsu
Cheng-Te Li
Diego Sáez-Trumper
Yi-Zhan Hsu
SSL
26
13
0
16 Nov 2021
Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection
Jan Philip Wahle
Nischal Ashok Kumar
Terry Ruas
Norman Meuschke
Tirthankar Ghosal
Bela Gipp
38
17
0
15 Nov 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
79
332
0
11 Nov 2021
ICDAR 2021 Competition on Document VisualQuestion Answering
Rubèn Pérez Tito
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
43
23
0
10 Nov 2021
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViT
AAML
195
258
0
10 Nov 2021
Focusing on Potential Named Entities During Active Label Acquisition
Ali Osman Berk Şapcı
Oznur Tastan
Reyyan Yeniterzi
29
2
0
06 Nov 2021
IBERT: Idiom Cloze-style reading comprehension with Attention
Ruiyang Qin
Haozheng Luo
Zheheng Fan
Ziang Ren
AIMat
28
10
0
05 Nov 2021
Leveraging Sentiment Analysis Knowledge to Solve Emotion Detection Tasks
Maude Nguyen-The
Guillaume-Alexandre Bilodeau
Jan Rockemann
30
4
0
05 Nov 2021
A Syntax-Guided Grammatical Error Correction Model with Dependency Tree Correction
Zhaohong Wan
Xiaojun Wan
38
6
0
05 Nov 2021
An Empirical Study of the Effectiveness of an Ensemble of Stand-alone Sentiment Detection Tools for Software Engineering Datasets
Gias Uddin
Yann-Gaël Guéhénuc
Foutse Khomh
C. Roy
17
8
0
04 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
38
216
0
04 Nov 2021
Previous
1
2
3
...
13
14
15
...
28
29
30
Next