v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,524 papers shown

Title
Is Attention always needed? A Case Study on Language Identification from Speech A. Mandal Santanu Pal Indranil Dutta Mahidas Bhattacharya S. Naskar 48 6 0 05 Oct 2021
Autoregressive Diffusion Models Emiel Hoogeboom Alexey A. Gritsenko Jasmijn Bastings Ben Poole Rianne van den Berg Tim Salimans DiffM 134 155 0 05 Oct 2021
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation Chen Zhang L. F. D’Haro Yiming Chen Thomas Friedrichs Haizhou Li 66 5 0 05 Oct 2021
A Survey On Neural Word Embeddings Erhan Sezerer Selma Tekir AI4TS 90 13 0 05 Oct 2021
Classification of hierarchical text using geometric deep learning: the case of clinical trials corpus Sohrab Ferdowsi Nikolay Borissov J. Knafou P. Amini Douglas Teodoro 33 7 0 04 Oct 2021
Revisiting Self-Training for Few-Shot Learning of Language Model Yiming Chen Yan Zhang Chen Zhang Grandee Lee Ran Cheng Haizhou Li 71 42 0 04 Oct 2021
Scheduling Optimization Techniques for Neural Network Training Hyungjun Oh Junyeol Lee HyeongJu Kim Jiwon Seo 55 1 0 03 Oct 2021
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark Joel Niklaus Ilias Chalkidis Matthias Sturmer ELM AILaw 67 70 0 02 Oct 2021
ProTo: Program-Guided Transformer for Program-Guided Tasks Zelin Zhao Karan Samel Binghong Chen Le Song ViT LM&Ro 100 30 0 02 Oct 2021
Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification Jiong Zhang Wei-Cheng Chang Hsiang-Fu Yu Inderjit S. Dhillon 117 103 0 01 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models Robert Wolfe Aylin Caliskan 125 51 0 01 Oct 2021
A Survey of Knowledge Enhanced Pre-trained Models Jian Yang Xinyu Hu Gang Xiao Yulong Shen KELM 109 6 0 01 Oct 2021
Focused Contrastive Training for Test-based Constituency Analysis Benjamin Roth Erion cCano 31 0 0 30 Sep 2021
Fine-tuning wav2vec2 for speaker recognition Nik Vaessen David A. van Leeuwen 116 109 0 30 Sep 2021
First to Possess His Statistics: Data-Free Model Extraction Attack on Tabular Data Masataka Tasumi Kazuki Iwahana Naoto Yanai Katsunari Shishido Toshiya Shimizu Yuji Higuchi I. Morikawa Jun Yajima AAML 78 4 0 30 Sep 2021
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System Yixuan Su Lei Shu Elman Mansimov Arshit Gupta Deng Cai Yi-An Lai Yi Zhang 229 195 0 29 Sep 2021
Multimodal Emotion Recognition with High-level Speech and Text Features M. R. Makiuchi Kuniaki Uto Koichi Shinoda 85 72 0 29 Sep 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Hu Xu Gargi Ghosh Po-Yao (Bernie) Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer Florian Metze Luke Zettlemoyer Christoph Feichtenhofer CLIP VLM 335 584 0 28 Sep 2021
What to Prioritize? Natural Language Processing for the Development of a Modern Bug Tracking Solution in Hardware Development T. Do Markus Dobler Niklas Kühl 32 0 0 28 Sep 2021
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation Adaku Uchendu Zeyu Ma Thai Le Rui Zhang Dongwon Lee DeLMO 115 127 0 27 Sep 2021
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering Ekta Sood Fabian Kögel Florian Strohm Prajit Dhar Andreas Bulling 67 19 0 27 Sep 2021
Context-guided Triple Matching for Multiple Choice Question Answering Xun Yao Junlong Ma Xinrong Hu Junping Liu Jie Yang Wanqing Li 66 2 0 27 Sep 2021
Understanding and Overcoming the Challenges of Efficient Transformer Quantization Yelysei Bondarenko Markus Nagel Tijmen Blankevoort MQ 83 146 0 27 Sep 2021
Multiplicative Position-aware Transformer Models for Language Understanding Zhiheng Huang Davis Liang Peng Xu Bing Xiang 36 1 0 27 Sep 2021
Improving Question Answering Performance Using Knowledge Distillation and Active Learning Yasaman Boreshban Seyed Morteza Mirbostani Gholamreza Ghassem-Sani Seyed Abolghasem Mirroshandel Shahin Amiriparian 83 16 0 26 Sep 2021
Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation Mirza Yusuf Praatibh Surana Gauri Gupta Krithika Ramesh 86 8 0 26 Sep 2021
Entity Linking Meets Deep Learning: Techniques and Solutions Wei Shen Yuhan Li Yinan Liu Jiawei Han Jianyong Wang Xiaojie Yuan 124 53 0 26 Sep 2021
One-shot Key Information Extraction from Document with Deep Partial Graph Matching Minghong Yao Zhiguang Liu Liangwei Wang Houqiang Li Liansheng Zhuang 118 5 0 26 Sep 2021
Parallel Refinements for Lexically Constrained Text Generation with BART Xingwei He 83 43 0 26 Sep 2021
DziriBERT: a Pre-trained Language Model for the Algerian Dialect Amine Abdaoui Mohamed Berrimi Mourad Oussalah A. Moussaoui 99 45 0 25 Sep 2021
Finetuning Transformer Models to Build ASAG System Mithun Thakkar 15 2 0 25 Sep 2021
More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering Yang Bai D. Wang 166 10 0 25 Sep 2021
Pushing on Text Readability Assessment: A Transformer Meets Handcrafted Linguistic Features Bruce W. Lee Yoonna Jang J. Lee VLM 101 83 0 25 Sep 2021
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus Daniela Trotta R. Guarasci Elisa Leonardelli Sara Tonelli 103 31 0 24 Sep 2021
Lacking the embedding of a word? Look it up into a traditional dictionary Elena Sofia Ruzzetti Leonardo Ranaldi Michele Mastromattei Francesca Fallucchi Fabio Massimo Zanzotto 61 15 0 24 Sep 2021
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System Libo Qin Tianbao Xie Shijue Huang Qiguang Chen Xiao Xu Wanxiang Che 114 20 0 23 Sep 2021
Conditional Poisson Stochastic Beam Search Clara Meister Afra Amini Tim Vieira Ryan Cotterell 83 10 0 22 Sep 2021
BFClass: A Backdoor-free Text Classification Framework Zichao Li Dheeraj Mekala Chengyu Dong Jingbo Shang SILM 108 28 0 22 Sep 2021
Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing K. Kanakarajan Bhuvana Kundumani Malaikannan Sankarasubbu ALM MoE 62 5 0 22 Sep 2021
MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization Xinnuo Xu Ondrej Dusek Shashi Narayan Verena Rieser Ioannis Konstas HILM 74 6 0 22 Sep 2021
Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages Tejas I. Dhamecha V. Rudramurthy Samarth Bharadwaj Karthik Sankaranarayanan P. Bhattacharyya 95 26 0 22 Sep 2021
Digital Signal Processing Using Deep Neural Networks Brian Shevitski Y. Watkins Nicole Man Michael Girard AI4CE 88 4 0 21 Sep 2021
AutoGCL: Automated Graph Contrastive Learning via Learnable View Generators Yihang Yin Qingzhong Wang Siyu Huang Haoyi Xiong Xiang Zhang 112 156 0 21 Sep 2021
RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation Md. Akmal Haidar Nithin Anchuri Mehdi Rezagholizadeh Abbas Ghaddar Philippe Langlais Pascal Poupart 111 22 0 21 Sep 2021
BERT Has Uncommon Sense: Similarity Ranking for Word Sense BERTology Luke Gessler Nathan Schneider 66 7 0 20 Sep 2021
DisCoDisCo at the DISRPT2021 Shared Task: A System for Discourse Segmentation, Classification, and Connective Detection Luke Gessler Shabnam Behzad Yang Liu Siyao Peng Yilun Zhu Amir Zeldes 93 33 0 20 Sep 2021
Towards Zero-Label Language Learning Zirui Wang Adams Wei Yu Orhan Firat Yuan Cao SyDa 251 105 0 19 Sep 2021
Navigating the Kaleidoscope of COVID-19 Misinformation Using Deep Learning Yuanzhi Chen Mohammad Rashedul Hasan 60 4 0 19 Sep 2021
Augmenting semantic lexicons using word embeddings and transfer learning Thayer Alshaabi C. V. Oort M. Fudolig M. V. Arnold C. Danforth P. Dodds 80 4 0 18 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling David R. So Wojciech Mañke Hanxiao Liu Zihang Dai Noam M. Shazeer Quoc V. Le VLM 285 156 0 17 Sep 2021