v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown

Title
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index Megha Chakraborty S.M. Towhidul Islam Tonmoy S. M. Mehedi Krish Sharma Niyar R. Barman ... Tanay Kumar Vinija Jain Aman Chadha Amit P. Sheth Amitava Das DeLMO 82 21 0 08 Oct 2023
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations Vipula Rawte Swagata Chakraborty Agnibh Pathak Anubhav Sarkar S.M. Towhidul Islam Tonmoy Aman Chadha Mikel Artetxe Punit Daniel Simig HILM 94 131 0 08 Oct 2023
Higher-Order DeepTrails: Unified Approach to *Trails Tobias Koopmann Jan Pfister André Markus Astrid Carolus Carolin Wienrich Andreas Hotho AI4TS 33 0 0 06 Oct 2023
Genetic prediction of quantitative traits: a machine learner's guide focused on height L. Bourguignon Caroline Weis C. Jutzeler Michael Adamer AI4CE 19 0 0 06 Oct 2023
Quantized Transformer Language Model Implementations on Edge Devices Mohammad Wali Ur Rahman Murad Mehrab Abrar Hunter Gibbons Copening Salim Hariri Sicong Shao Pratik Satam Soheil Salehi MQ 68 11 0 06 Oct 2023
A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4 Katikapalli Subramanyam Kalyan LM&MA AI4CE LRM AILaw ELM 131 248 0 04 Oct 2023
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation Filippo Perrina Francesco Marchiori Mauro Conti Nino Vincenzo Verde 44 11 0 04 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs Guanghui Qin Corby Rosset Ethan C. Chau Nikhil Rao Benjamin Van Durme 57 11 0 03 Oct 2023
The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers Rickard Brannvall 46 0 0 03 Oct 2023
Jury: A Comprehensive Evaluation Toolkit Devrim Cavusoglu Secil Sen Ulas Sert S. Altinuc ELM 16 2 0 03 Oct 2023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy Pingzhi Li Zhenyu Zhang Prateek Yadav Yi-Lin Sung Yu Cheng Mohit Bansal Tianlong Chen MoMe 85 39 0 02 Oct 2023
Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models Man Luo Shrinidhi Kumbhar Ming shen Mihir Parmar Neeraj Varshney Pratyay Banerjee Somak Aditya Chitta Baral ReLM ELM LRM 137 31 0 02 Oct 2023
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks Hao Chen Jindong Wang Ankit Shah Ran Tao Hongxin Wei Berfin cSimcsek Masashi Sugiyama Bhiksha Raj 110 32 0 29 Sep 2023
Unsupervised Pretraining for Fact Verification by Language Model Distillation A. Bazaga Pietro Lio Bo Dai HILM 106 2 0 28 Sep 2023
Social Media Fashion Knowledge Extraction as Captioning Yifei Yuan Wenxuan Zhang Yang Deng Wai Lam 54 1 0 28 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey Victoria Smith Ali Shahin Shamsabadi Carolyn Ashurst Adrian Weller PILM 110 27 0 27 Sep 2023
Large Language Model Alignment: A Survey Tianhao Shen Renren Jin Yufei Huang Chuang Liu Weilong Dong Zishan Guo Xinwei Wu Yan Liu Deyi Xiong LM&MA 112 207 0 26 Sep 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning Jiadong Wang Chengyu Wang Chuanqi Tan Jun Huang Ming Gao KELM 100 6 0 26 Sep 2023
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS Dake Guo Xinfa Zhu Liumeng Xue Tao Li Yuanjun Lv Yuepeng Jiang Linfu Xie 76 1 0 25 Sep 2023
Text Classification: A Perspective of Deep Learning Methods Zhongwei Wan VLM 33 7 0 24 Sep 2023
Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach Mohammad Kashif Mohammad Zohair Saquib Ali 20 4 0 23 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches Deepak Gupta Kush Attal Dina Demner-Fushman LM&MA 54 1 0 21 Sep 2023
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision Jinzhao Zhou Yiqun Duan Yu-Cheng Chang Yu-Kai Wang Chin-Teng Lin 76 6 0 21 Sep 2023
SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels Elena Shushkevich Long Mai Manuel V. Loureiro Steven Derby Tri Kurniawan Wijaya AI4TS 83 0 0 21 Sep 2023
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs Chengyuan Liu Fubang Zhao Lizhi Qing Yangyang Kang Changlong Sun Kun Kuang Leilei Gan AAML 75 21 0 21 Sep 2023
Word Embedding with Neural Probabilistic Prior Shaogang Ren Dingcheng Li P. Li BDL 49 0 0 21 Sep 2023
Exploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and Concreteness Vipula Rawte Prachi Priya S.M. Towhidul Islam Tonmoy M. M. Zaman A. Sheth Amitava Das 54 19 0 20 Sep 2023
Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education Ramteja Sajja Y. Sermet Muhammed Cikmaz David M. Cwiertny Ibrahim Demir 104 149 0 19 Sep 2023
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing Ching-Hsun Tseng Shin-Jye Lee Po-Wei Cheng Chien Lee Chih-Chieh Hung 31 0 0 18 Sep 2023
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking Megha Sundriyal Md. Shad Akhtar Tanmoy Chakraborty 62 0 0 17 Sep 2023
SplitEE: Early Exit in Deep Neural Networks with Split Computing Divya J. Bajpai Vivek K. Trivedi S. L. Yadav M. Hanawal 82 7 0 17 Sep 2023
Pedestrian Trajectory Prediction Using Dynamics-based Deep Learning Honghui Wang Weiming Zhi Gustavo Batista Rohitash Chandra 65 1 0 16 Sep 2023
MHLAT: Multi-hop Label-wise Attention Model for Automatic ICD Coding Junwen Duan Han Jiang Ying Yu 74 2 0 16 Sep 2023
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges Fei Dou Jin Ye Geng Yuan Qin Lu Wei Niu ... Hongyue Sun Yunli Shao Changying Li Tianming Liu Wenzhan Song AI4CE 85 29 0 14 Sep 2023
DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective Pu Miao Zeyao Du Junlin Zhang SSL 77 7 0 14 Sep 2023
Beyond original Research Articles Categorization via NLP Rosanna Turrisi 126 1 0 13 Sep 2023
Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweets Ramya Tekumalla Juan M. Banda 53 8 0 12 Sep 2023
Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review Pengzhou Cheng Zongru Wu Wei Du Haodong Zhao Wei Lu Gongshen Liu SILM AAML 183 21 0 12 Sep 2023
Balanced and Explainable Social Media Analysis for Public Health with Large Language Models Yan Jiang Ruihong Qiu Yi Zhang Peng Zhang 65 7 0 12 Sep 2023
Challenges in Annotating Datasets to Quantify Bias in Under-represented Society Vithya Yogarajan Gillian Dobbie Timothy Pistotti Joshua Bensemann Kobe Knowles 95 2 0 11 Sep 2023
CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts Rabindra Lamsal M. Read S. Karunasekera 62 15 0 11 Sep 2023
Improving Information Extraction on Business Documents with Specific Pre-Training Tasks Thibault Douzon S. Duffner Christophe Garcia Jérémy Espinas 55 6 0 11 Sep 2023
UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media Yan Jiang Ruihong Qiu Yi Zhang Zi Huang LM&MA 52 2 0 08 Sep 2023
Introducing "Forecast Utterance" for Conversational Data Science Md. Mahadi Hassan Alex Knipper S. Karmaker AI4TS 56 0 0 07 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach Dario Di Palma Giovanni Maria Biancofiore Vito Walter Anelli Fedelucio Narducci Tommaso Di Noia E. Sciascio ALM 132 30 0 07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs Chao Feng Xinyu Zhang Zichu Fei KELM 83 50 0 06 Sep 2023
A deep Natural Language Inference predictor without language-specific training data Lorenzo Corradi Alessandro Manenti Francesca Del Bonifro Francesco Setti D. Sorbo 34 0 0 06 Sep 2023
UniSA: Unified Generative Framework for Sentiment Analysis Zaijing Li Ting-En Lin Yuchuan Wu Meng Liu Fengxiao Tang Mingde Zhao Yongbin Li 101 18 0 04 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks Sarthak Anand 60 0 0 02 Sep 2023
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair Yuxiang Wei Chun Xia Lingming Zhang KELM 99 107 0 01 Sep 2023