v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown

Title
TopicBERT for Energy Efficient Document Classification Yatin Chaudhary Pankaj Gupta Khushbu Saxena Vivek Kulkarni Thomas Runkler Hinrich Schütze 75 21 0 15 Oct 2020
Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey Khyathi Chandu A. Black 76 0 0 14 Oct 2020
Text Classification Using Label Names Only: A Language Model Self-Training Approach Yu Meng Yunyi Zhang Jiaxin Huang Chenyan Xiong Heng Ji Chao Zhang Jiawei Han VLM 88 76 0 14 Oct 2020
Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries Xiaofei Sun Zijun Sun Yuxian Meng Jiwei Li Chun Fan 61 20 0 14 Oct 2020
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search Gyuwan Kim Kyunghyun Cho 96 98 0 14 Oct 2020
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision Hao Tan Joey Tianyi Zhou CLIP 89 121 0 14 Oct 2020
With Little Power Comes Great Responsibility Dallas Card Peter Henderson Urvashi Khandelwal Robin Jia Kyle Mahowald Dan Jurafsky 281 119 0 13 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond Jimmy J. Lin Rodrigo Nogueira Andrew Yates VLM 397 628 0 13 Oct 2020
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension Ekta Sood Simon Tannert Diego Frassinelli Andreas Bulling Ngoc Thang Vu HAI 75 57 0 13 Oct 2020
Aspect-based Document Similarity for Research Papers Malte Ostendorff Terry Ruas Till Blume Bela Gipp Georg Rehm 105 27 0 13 Oct 2020
CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations Fuli Luo Pengcheng Yang Shicheng Li Xuancheng Ren Xu Sun VLM SSL 73 16 0 13 Oct 2020
RGCL at SemEval-2020 Task 6: Neural Approaches to Definition Extraction Tharindu Ranasinghe Alistair Plum Constantin Orasan R. Mitkov NAI 21 2 0 13 Oct 2020
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media Tharindu Ranasinghe Hansi Hettiarachchi 60 20 0 13 Oct 2020
BRUMS at SemEval-2020 Task 3: Contextualised Embeddings for Predicting the (Graded) Effect of Context in Word Similarity Hansi Hettiarachchi Tharindu Ranasinghe 53 14 0 13 Oct 2020
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models Zhengbao Jiang Antonios Anastasopoulos Jun Araki Haibo Ding Graham Neubig HILM KELM 98 144 0 13 Oct 2020
Incorporating BERT into Parallel Sequence Decoding with Adapters Junliang Guo Zhirui Zhang Linli Xu Hao-Ran Wei Boxing Chen Enhong Chen 113 69 0 13 Oct 2020
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance Jianquan Li Xiaokang Liu Honghong Zhao Ruifeng Xu Min Yang Yaohong Jin 111 54 0 13 Oct 2020
Improving Self-supervised Pre-training via a Fully-Explored Masked Language Model Ming Zheng Dinghan Shen Yelong Shen Weizhu Chen Lin Xiao SSL 29 4 0 12 Oct 2020
Measuring and Reducing Gendered Correlations in Pre-trained Models Kellie Webster Xuezhi Wang Ian Tenney Alex Beutel Emily Pitler Ellie Pavlick Jilin Chen Ed Chi Slav Petrov FaML 117 260 0 12 Oct 2020
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification Jordan J. Bird Anikó Ekárt Diego Resende Faria 61 60 0 12 Oct 2020
PECOS: Prediction for Enormous and Correlated Output Spaces Hsiang-Fu Yu Kai Zhong Jiong Zhang Wei-Cheng Chang Inderjit S. Dhillon 130 85 0 12 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph Jingkang Yang Weirong Chen Xue Jiang Xiaopeng Yan Huabin Zheng Wayne Zhang NoLa 77 13 0 12 Oct 2020
On the Complementary Nature of Knowledge Graph Embedding, Fine Grain Entity Types, and Language Modeling Rajat Patel Francis Ferraro 40 1 0 12 Oct 2020
Counterfactual Variable Control for Robust and Interpretable Question Answering S. Yu Yulei Niu Shuohang Wang Jing Jiang Qianru Sun AAML OOD 93 9 0 12 Oct 2020
Meta-Context Transformers for Domain-Specific Response Generation Debanjana Kar Suranjana Samanta A. Azad 44 1 0 12 Oct 2020
Pre-trained Language Model Based Active Learning for Sentence Matching Guirong Bai Shizhu He Kang Liu Jun Zhao Zaiqing Nie 112 10 0 12 Oct 2020
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs Jing Zhang Bo Chen Lingxi Zhang Xirui Ke Haipeng Ding NAI 112 3 0 12 Oct 2020
Quantitative Argument Summarization and Beyond: Cross-Domain Key Point Analysis Roy Bar-Haim Yoav Kantor Lilach Eden Roni Friedman Dan Lahav Noam Slonim 82 47 0 11 Oct 2020
InfoMiner at WNUT-2020 Task 2: Transformer-based Covid-19 Informative Tweet Extraction Hansi Hettiarachchi Tharindu Ranasinghe MedIm 36 21 0 11 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering Giannis Daras Nikita Kitaev Augustus Odena A. Dimakis 106 46 0 11 Oct 2020
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task Z. Li Hai Zhao Rui Wang Kehai Chen Masao Utiyama Eiichiro Sumita 66 15 0 11 Oct 2020
Contrastive Representation Learning: A Framework and Review Phúc H. Lê Khắc Graham Healy Alan F. Smeaton SSL AI4TS 330 722 0 10 Oct 2020
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks Stephen Mussmann Robin Jia Percy Liang 83 15 0 10 Oct 2020
Automated Concatenation of Embeddings for Structured Prediction Xinyu Wang Yong Jiang Nguyen Bach Tao Wang Zhongqiang Huang Fei Huang Kewei Tu 109 177 0 10 Oct 2020
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding Yu-An Wang Yun-Nung Chen SSL 59 95 0 10 Oct 2020
Adversarial Self-Supervised Data-Free Distillation for Text Classification Xinyin Ma Yongliang Shen Gongfan Fang Chen Chen Chenghao Jia Weiming Lu 124 24 0 10 Oct 2020
Recursive Top-Down Production for Sentence Generation with Latent Trees Shawn Tan Songlin Yang Timothy J. O'Donnell Alessandro Sordoni Aaron Courville 47 4 0 09 Oct 2020
Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels Harris Chan J. Kiros William Chan LRM 23 0 0 09 Oct 2020
TurboTransformers: An Efficient GPU Serving System For Transformer Models Jiarui Fang Yang Yu Chen-liang Zhao Jie Zhou 86 140 0 09 Oct 2020
Plug-and-Play Conversational Models Andrea Madotto Etsuko Ishii Zhaojiang Lin Sumanth Dathathri Pascale Fung 86 51 0 09 Oct 2020
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models Grégory Senay Emmanuelle Salin 34 2 0 08 Oct 2020
Deep Learning Meets Projective Clustering Alaa Maalouf Harry Lang Daniela Rus Dan Feldman 113 9 0 08 Oct 2020
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference Tianyu Liu Xin Zheng Xiaoan Ding Baobao Chang Zhifang Sui 73 25 0 08 Oct 2020
Improving Attention Mechanism with Query-Value Interaction Chuhan Wu Fangzhao Wu Tao Qi Yongfeng Huang 43 4 0 08 Oct 2020
Assessing Phrasal Representation and Composition in Transformers Lang-Chi Yu Allyson Ettinger CoGe 90 68 0 08 Oct 2020
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference Xiaoan Ding Tianyu Liu Baobao Chang Zhifang Sui Kevin Gimpel 85 8 0 08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition Yun He Ziwei Zhu Yin Zhang Qin Chen James Caverlee AI4MH 87 109 0 08 Oct 2020
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge Yun He Zhuoer Wang Yin Zhang Ruihong Huang James Caverlee 51 23 0 08 Oct 2020
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks Nikunj Saunshi Sadhika Malladi Sanjeev Arora 87 89 0 07 Oct 2020
A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic Nima Ebadi Peyman Najafirad OOD 42 2 0 07 Oct 2020