Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,503 papers shown
Title
Finding Structural Knowledge in Multimodal-BERT
Victor Milewski
Miryam de Lhoneux
Marie-Francine Moens
27
9
0
17 Mar 2022
PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer
Lina Achaji
Thierno Barry
Thibault Fouqueray
Julien Moreau
François Aioun
François Charpillet
20
15
0
17 Mar 2022
Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss
Yantao Gong
Cao Liu
Fan Yang
Xunliang Cai
Guanglu Wan
Jiansong Chen
Weipeng Zhang
Houfeng Wang
UQCV
24
2
0
17 Mar 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
Wei Li
Can Gao
Guocheng Niu
Xinyan Xiao
Hao Liu
Jiachen Liu
Hua Wu
Haifeng Wang
MLLM
19
21
0
17 Mar 2022
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
26
14
0
15 Mar 2022
A Novel Perspective to Look At Attention: Bi-level Attention-based Explainable Topic Modeling for News Classification
Dairui Liu
Derek Greene
Ruihai Dong
33
11
0
14 Mar 2022
PERT: Pre-training BERT with Permuted Language Model
Yiming Cui
Ziqing Yang
Ting Liu
33
37
0
14 Mar 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai
Heng-Jui Chang
Wen-Chin Huang
Zili Huang
Kushal Lakhotia
...
Hsuan-Jui Chen
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
31
110
0
14 Mar 2022
Can pre-trained Transformers be used in detecting complex sensitive sentences? -- A Monsanto case study
Roelien C. Timmer
David Liebowitz
Surya Nepal
S. Kanhere
28
8
0
14 Mar 2022
SciNLI: A Corpus for Natural Language Inference on Scientific Text
Mobashir Sadat
Cornelia Caragea
AILaw
32
36
0
13 Mar 2022
Information retrieval for label noise document ranking by bag sampling and group-wise loss
Chunyuan Li
Jiajia Ding
Xing Hu
Fan Wang
RALM
21
0
0
12 Mar 2022
A comparative study of non-deep learning, deep learning, and ensemble learning methods for sunspot number prediction
Yuchen Dang
Ziqi Chen
Heng Li
Hai Shu
ELM
BDL
27
26
0
11 Mar 2022
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
71
529
0
10 Mar 2022
HealthPrompt: A Zero-shot Learning Paradigm for Clinical Natural Language Processing
Sonish Sivarajkumar
Yanshan Wang
VLM
LM&MA
39
54
0
09 Mar 2022
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
Daizong Liu
Xiang Fang
Wei Hu
Pan Zhou
32
37
0
06 Mar 2022
IISERB Brains at SemEval 2022 Task 6: A Deep-learning Framework to Identify Intended Sarcasm in English
Tanuj Singh Shekhawat
M. Kumar
Udaybhan Rathore
Aditya Joshi
Jasabanta Patro
21
2
0
04 Mar 2022
Improving Health Mentioning Classification of Tweets using Contrastive Adversarial Training
Pervaiz Iqbal Khan
Shoaib Ahmed Siddiqui
Imran Razzak
Andreas Dengel
Sheraz Ahmed
26
3
0
03 Mar 2022
A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
Tianxiang Sun
Xiangyang Liu
Wei-wei Zhu
Zhichao Geng
Lingling Wu
Yilong He
Yuan Ni
Guotong Xie
Xuanjing Huang
Xipeng Qiu
42
40
0
03 Mar 2022
Large-Scale Hate Speech Detection with Cross-Domain Transfer
Cagri Toraman
Furkan Şahinuç
E. Yilmaz
37
60
0
02 Mar 2022
Improving Performance of Automated Essay Scoring by using back-translation essays and adjusted scores
You-Jin Jong
Yong-Jin Kim
Ok-Chol Ri
25
6
0
01 Mar 2022
Semantic Sentence Composition Reasoning for Multi-Hop Question Answering
Qianglong Chen
LRM
26
2
0
01 Mar 2022
TraceNet: Tracing and Locating the Key Elements in Sentiment Analysis
Qinghua Zhao
Shuai Ma
19
0
0
28 Feb 2022
Automated Identification of Toxic Code Reviews Using ToxiCR
Jaydeb Sarker
Asif Kamal Turzo
Mingyou Dong
Amiangshu Bosu
27
32
0
26 Feb 2022
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language Models Better
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
27
58
0
24 Feb 2022
Short-answer scoring with ensembles of pretrained language models
Christopher M. Ormerod
41
8
0
23 Feb 2022
Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt
Lianzhe Huang
Shuming Ma
Dongdong Zhang
Furu Wei
Houfeng Wang
VLM
LRM
26
32
0
23 Feb 2022
Utilizing Out-Domain Datasets to Enhance Multi-Task Citation Analysis
Dominique Mercier
Syed Tahseen Raza Rizvi
Vikas Rajashekar
Sheraz Ahmed
Andreas Dengel
18
1
0
22 Feb 2022
VLP: A Survey on Vision-Language Pre-training
Feilong Chen
Duzhen Zhang
Minglun Han
Xiuyi Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
82
213
0
18 Feb 2022
SAITS: Self-Attention-based Imputation for Time Series
Wenjie Du
David Cote
Yang Liu
AI4TS
30
232
0
17 Feb 2022
Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question Answering Evaluation
Jannis Bulian
Christian Buck
Wojciech Gajewski
Benjamin Boerschinger
Tal Schuster
36
44
0
15 Feb 2022
A Differential Entropy Estimator for Training Neural Networks
Georg Pichler
Pierre Colombo
Malik Boudiaf
Günther Koliander
Pablo Piantanida
25
21
0
14 Feb 2022
FedQAS: Privacy-aware machine reading comprehension with federated learning
Addi Ait-Mlouk
Sadi Alawadi
Salman Toor
Andreas Hellander
38
11
0
09 Feb 2022
Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
Yu Meng
Yunyi Zhang
Jiaxin Huang
Yu Zhang
Jiawei Han
56
56
0
09 Feb 2022
Universal Spam Detection using Transfer Learning of BERT Model
Vijay Srinivas Tida
Sonya Hsu
28
47
0
07 Feb 2022
Conversational Agents: Theory and Applications
M. Wahde
M. Virgolin
LLMAG
40
25
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
74
852
0
07 Feb 2022
Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Arka Mitra
Priyanshu Sankhala
20
6
0
05 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
50
250
0
03 Feb 2022
Relative Position Prediction as Pre-training for Text Encoders
Rickard Brüel-Gabrielsson
Chris Scarvelis
28
1
0
02 Feb 2022
GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records
Xi Yang
Aokun Chen
Nima M. Pournejatian
Hoo-Chang Shin
Kaleb E. Smith
...
Duane A. Mitchell
W. Hogan
E. Shenkman
Jiang Bian
Yonghui Wu
AI4MH
LM&MA
42
515
0
02 Feb 2022
WebFormer: The Web-page Transformer for Structure Information Extraction
Qifan Wang
Yi Fang
Anirudh Ravula
Fuli Feng
Xiaojun Quan
Dongfang Liu
ViT
149
65
0
01 Feb 2022
Stock2Vec: An Embedding to Improve Predictive Models for Companies
Ziruo Yi
Tingsong Xiao
Kaz-Onyeakazi Ijeoma
Ratnam Cheran
Yuvraj Baweja
Phillip Nelson
AIFin
32
3
0
27 Jan 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan
Dallas Card
Sarah K. Drier
E. K. Gade
Leroy Z. Wang
Zeyu Wang
Luke Zettlemoyer
Noah A. Smith
175
74
0
25 Jan 2022
Text Style Transfer for Bias Mitigation using Masked Language Modeling
E. Tokpo
T. Calders
24
31
0
21 Jan 2022
Identifying Adversarial Attacks on Text Classifiers
Zhouhang Xie
Jonathan Brophy
Adam Noack
Wencong You
Kalyani Asthana
Carter Perkins
Sabrina Reis
Sameer Singh
Daniel Lowd
AAML
31
9
0
21 Jan 2022
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
17
6
0
20 Jan 2022
Sentiment Analysis: Predicting Yelp Scores
Bhanu Prakash Reddy Guda
Mashrin Srivastava
Deep Karkhanis
27
3
0
20 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
45
154
0
17 Jan 2022
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
39
101
0
15 Jan 2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
Samyam Rajbhandari
Conglong Li
Z. Yao
Minjia Zhang
Reza Yazdani Aminabadi
A. A. Awan
Jeff Rasley
Yuxiong He
47
288
0
14 Jan 2022
Previous
1
2
3
...
12
13
14
...
29
30
31
Next