v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown

Title
Dictionary-Assisted Supervised Contrastive Learning Patrick Y. Wu Richard Bonneau Joshua A. Tucker Jonathan Nagler CLIP 66 0 0 27 Oct 2022
Privately Fine-Tuning Large Language Models with Differential Privacy R. Behnia Mohammadreza Ebrahimi Jason L. Pacheco B. Padmanabhan 127 51 0 26 Oct 2022
Causality Detection using Multiple Annotation Decisions Quynh-Anh Nguyen Arka Mitra 29 2 0 26 Oct 2022
Learning on Large-scale Text-attributed Graphs via Variational Inference Jianan Zhao Meng Qu Chaozhuo Li Hao Yan Qian Liu Rui Li Xing Xie Jian Tang VLM 132 142 0 26 Oct 2022
Uncertainty Sentence Sampling by Virtual Adversarial Perturbation Han Zhang Zhen Zhang Hongfei Jiang Yang Song 40 0 0 26 Oct 2022
Leveraging Affirmative Interpretations from Negation Improves Natural Language Understanding Md Mosharaf Hossain Eduardo Blanco 73 5 0 26 Oct 2022
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis Sudhandar Balakrishnan Yihao Fang Xioadan Zhu 46 1 0 26 Oct 2022
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models Aaron Mueller Yudi Xia Tal Linzen MILM 113 10 0 25 Oct 2022
Revision for Concision: A Constrained Paraphrase Generation Task Wenchuan Mu Kwanin Lim 70 3 0 25 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models Hong Liu Sang Michael Xie Zhiyuan Li Tengyu Ma AI4CE 135 55 0 25 Oct 2022
PolyHope: Two-Level Hope Speech Detection from Tweets F. Balouchzahi Grigori Sidorov Alexander Gelbukh 53 50 0 25 Oct 2022
Audio MFCC-gram Transformers for respiratory insufficiency detection in COVID-19 M. Gauy Marcelo Finger 54 9 0 25 Oct 2022
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction P. Zhang Junlin Zhang 52 3 0 25 Oct 2022
Effective Pre-Training Objectives for Transformer-based Autoencoders Luca Di Liello Matteo Gabburo Alessandro Moschitti 41 3 0 24 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models Hao Liu Xinyang Geng Lisa Lee Igor Mordatch Sergey Levine Sharan Narang Pieter Abbeel KELM CLL 89 2 0 24 Oct 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation Junyi Li Tianyi Tang Wayne Xin Zhao J. Nie Ji-Rong Wen 81 17 0 24 Oct 2022
A Visual Tour Of Current Challenges In Multimodal Language Models Shashank Sonkar Naiming Liu Richard G. Baraniuk DiffM 47 2 0 22 Oct 2022
DiscoSense: Commonsense Reasoning with Discourse Connectives Prajjwal Bhargava Vincent Ng LRM 335 4 0 22 Oct 2022
Leveraging Large Language Models for Multiple Choice Question Answering Joshua Robinson Christopher Rytting David Wingate ELM 244 200 0 22 Oct 2022
P $^3$ LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training Junwei Bao Yifan Wang Jiangyong Ying Yeyun Gong Jing Zhao Youzheng Wu Xiaodong He 70 1 0 22 Oct 2022
TCAB: A Large-Scale Text Classification Attack Benchmark Kalyani Asthana Zhouhang Xie Wencong You Adam Noack Jonathan Brophy Sameer Singh Daniel Lowd 121 3 0 21 Oct 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding Yuechen Wang Wen-gang Zhou Houqiang Li AI4TS 63 13 0 21 Oct 2022
Extracted BERT Model Leaks More Information than You Think! Xuanli He Chen Chen Lingjuan Lyu Xingliang Yuan SILM MIACV 74 6 0 21 Oct 2022
Design a Sustainable Micro-mobility Future: Trends and Challenges in the United States and European Union Using Natural Language Processing Techniques Lilit Avetisyan Chengxin Zhang Sue Bai Ehsan Moradi-Pari Fred Feng Shan Bao Feng Zhou 21 0 0 21 Oct 2022
Transcending Scaling Laws with 0.1% Extra Compute Yi Tay Jason W. Wei Hyung Won Chung Vinh Q. Tran David R. So ... Donald Metzler Slav Petrov N. Houlsby Quoc V. Le Mostafa Dehghani LRM 109 71 0 20 Oct 2022
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts Xiangyang Liu Tianxiang Sun Xuanjing Huang Xipeng Qiu VLM 103 29 0 20 Oct 2022
How Does a Deep Learning Model Architecture Impact Its Privacy? A Comprehensive Study of Privacy Attacks on CNNs and Transformers Guangsheng Zhang B. Liu Huan Tian Tianqing Zhu Ming Ding Wanlei Zhou PILM MIACV 85 6 0 20 Oct 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective Adaku Uchendu Thai Le Dongwon Lee DeLMO 121 45 0 19 Oct 2022
Language Detoxification with Attribute-Discriminative Latent Space Jin Myung Kwak Minseon Kim Sung Ju Hwang 70 14 0 19 Oct 2022
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning Hongqiu Wu Ruixue Ding Haizhen Zhao Boli Chen Pengjun Xie Fei Huang Min Zhang MoMe 102 8 0 19 Oct 2022
Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models Hao Zhang 29 0 0 19 Oct 2022
ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler Jiaxin Zhang Yashar Moshfeghi AIMat 68 18 0 18 Oct 2022
Detecting and analyzing missing citations to published scientific entities Jialiang Lin Yao Yu Jia-Qi Song X. Shi 52 4 0 18 Oct 2022
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters Hongyu Zhao Hao Tan Hongyuan Mei MoE 81 18 0 18 Oct 2022
Controllable Fake Document Infilling for Cyber Deception Yibo Hu Yu Lin Eric Parolin Latif Khan Kevin W. Hamlen 66 8 0 18 Oct 2022
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis Shuai Fan Chen Lin Haonan Li Zheng-Wen Lin Jinsong Su Hang Zhang Yeyun Gong Jian Guo Nan Duan VLM 67 19 0 18 Oct 2022
Modelling Emotion Dynamics in Song Lyrics with State Space Models Yingjin Song Daniel Beck 87 5 0 17 Oct 2022
Flipped Classroom: Effective Teaching for Time Series Forecasting P. Teutsch Patrick Mäder AI4TS 67 8 0 17 Oct 2022
Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding Jiadong Wang Wenkang Huang Qiuhui Shi Hongbin Wang Minghui Qiu Xiang Li Ming Gao KELM VLM 90 19 0 16 Oct 2022
A Simple and Strong Baseline for End-to-End Neural RST-style Discourse Parsing Naoki Kobayashi Tsutomu Hirao Hidetaka Kamigaito Manabu Okumura Masaaki Nagata 38 11 0 15 Oct 2022
Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition Shuguang Chen Leonardo Neves Thamar Solorio 110 4 0 14 Oct 2022
HashFormers: Towards Vocabulary-independent Pre-trained Transformers Huiyin Xue Nikolaos Aletras 51 4 0 14 Oct 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation Tianxiang Sun Junliang He Xipeng Qiu Xuanjing Huang 85 47 0 14 Oct 2022
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning Tianxiang Sun Zhengfu He Qinen Zhu Xipeng Qiu Xuanjing Huang VLM VPVLM 38 21 0 14 Oct 2022
StyLEx: Explaining Style Using Human Lexical Annotations Shirley Anugrah Hayati Kyumin Park Dheeraj Rajagopal Lyle Ungar Dongyeop Kang 96 3 0 14 Oct 2022
Frustratingly Easy Sentiment Analysis of Text Streams: Generating High-Quality Emotion Arcs Using Emotion Lexicons Daniela Teodorescu Saif M. Mohammad 55 8 0 13 Oct 2022
Predicting Fine-Tuning Performance with Probing Zining Zhu Soroosh Shahtalebi Frank Rudzicz 64 10 0 13 Oct 2022
Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning Wangzhen Guo Qinkang Gong Hanjiang Lai LRM 75 4 0 13 Oct 2022
Spontaneous Emerging Preference in Two-tower Language Model Zhengqi He Taro Toyoizumi LRM 50 1 0 13 Oct 2022
On the Evaluation of the Plausibility and Faithfulness of Sentiment Analysis Explanations Julia El Zini Mohamad Mansour Basel Mousi M. Awad 64 8 0 13 Oct 2022