v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown

Title
Controllable Discovery of Intents: Incremental Deep Clustering Using Semi-Supervised Contrastive Learning Mrinal Rawat Hithesh Sankararaman Victor Barrès 86 0 0 18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning Jiacheng Ye Jiahui Gao Shansan Gong Lin Zheng Xin Jiang Zhiyu Li Dianbo Sui DiffM LRM 180 25 0 18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs SeongYeub Chu JongWoo Kim Bryan Wong MunYong Yi LRM 94 3 0 18 Oct 2024
Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification Zeren Shui Petros Karypis Daniel S. Karls Mingjian Wen Saurav Manchanda E. Tadmor George Karypis 46 1 0 17 Oct 2024
The Mystery of the Pathological Path-star Task for Language Models Arvid Frydenlund LRM 127 4 0 17 Oct 2024
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training Zhiyuan Ma Jianjun Li Guohui Li Kaiyan Huang VLM 120 9 0 16 Oct 2024
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator Taozhe Li Wei Sun 61 0 0 14 Oct 2024
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models S. Nigam Aniket Deroy Subhankar Maity Arnab Bhattacharya ELM AILaw 76 6 0 14 Oct 2024
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling Wenze Liu Le Zhuo Yi Xin Sheng Xia Peng Gao Xiangyu Yue 125 9 0 14 Oct 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement Yuxi Xie Anirudh Goyal Xiaobao Wu Xunjian Yin Xiao Xu Min-Yen Kan Liangming Pan William Yang Wang LRM 358 1 0 12 Oct 2024
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling Rui Liu Zhenqi Jia Jie Yang Yifan Hu Hong Li 98 2 0 12 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive Survey Syed Mustafa Haider Rizvi Ramsha Imran Arif Mahmood GNN OOD FaML 51 2 0 12 Oct 2024
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction Qianyue Hao Jingyang Fan Fengli Xu Jian Yuan Yong Li 67 9 0 10 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking Erwan Fagnou Paul Caillon Blaise Delattre Alexandre Allauzen 92 5 0 07 Oct 2024
Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterances Alina Wróblewska 58 0 0 07 Oct 2024
Computational design of target-specific linear peptide binders with TransformerBeta Haowen Zhao Francesco A. Aprile Barbara Bravi 77 0 0 07 Oct 2024
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks Yijiong Yu Ma Xiufa Fang Jianwei Zhi-liang Xu Su Guangyao ... Zhixiao Qi Wei Wang Wen Liu Ran Chen Ji Pei LRM RALM 73 0 0 06 Oct 2024
Fundamental Limitations on Subquadratic Alternatives to Transformers Josh Alman Hantao Yu 119 4 0 05 Oct 2024
Variational Language Concepts for Interpreting Foundation Language Models Hengyi Wang Shiwei Tan Zhiqing Hong Desheng Zhang Hao Wang 148 3 0 04 Oct 2024
Linear Transformer Topological Masking with Graph Random Features Isaac Reid Kumar Avinava Dubey Deepali Jain Will Whitney Amr Ahmed ... Connor Schenck Richard E. Turner René Wagner Adrian Weller Krzysztof Choromanski 87 1 0 04 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs Wei Wu Chao Wang L. Chen Mingze Yin Yiheng Zhu Kun Fu Jieping Ye Hui Xiong Zheng Wang 143 1 0 04 Oct 2024
Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification Sudipta Singha Roy Xindi Wang Robert E. Mercer Frank Rudzicz 62 0 0 03 Oct 2024
On The Adaptation of Unlimiformer for Decoder-Only Transformers Kian Ahrabian Alon Benhaim Barun Patra Jay Pujara Saksham Singhal Xia Song 68 0 0 02 Oct 2024
Preserving Generalization of Language models in Few-shot Continual Relation Extraction Quyen Tran Nguyen Xuan Thanh Nguyen Hoang Anh Nam Le Hai Trung Le Linh Van Ngo Thien Huu Nguyen CLL KELM 80 7 0 01 Oct 2024
Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications Aditi Godbole Jabin Geevarghese George Smita Shandilya 84 5 0 27 Sep 2024
Trustworthy AI: Securing Sensitive Data in Large Language Models G. Feretzakis V. Verykios 58 17 0 26 Sep 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions Zeyneb N. Kaya Souvick Ghosh 55 0 0 25 Sep 2024
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles Hanwen Zhang Dusit Niyato Wei Zhang Changyuan Zhao Hongyang Du Abbas Jamalipour Sumei Sun Yiyang Pei AI4CE 70 2 0 24 Sep 2024
Improving Academic Skills Assessment with NLP and Ensemble Learning Xinyi Huang Yingyi Wu Danyang Zhang Jiacheng Hu Yujian Long 49 7 0 23 Sep 2024
"I Never Said That": A dataset, taxonomy and baselines on response clarity classification Konstantinos Thomas Giorgos Filandrianos Maria Lymperaiou Chrysoula Zerva Giorgos Stamou 58 0 0 20 Sep 2024
GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for Interpretable Text Classification Ximing Wen Wenjuan Tan Rosina O. Weber 89 2 0 20 Sep 2024
Incremental and Data-Efficient Concept Formation to Support Masked Word Prediction Xin Lian Nishant Baglodi Christopher J. MacLellan 61 1 0 19 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer Humen Zhong Zhibo Yang Zhaohai Li Peng Wang Jun Tang Wenqing Cheng Cong Yao 71 1 0 18 Sep 2024
Evaluation of pretrained language models on music understanding Yannis Vasilakis Rachel M. Bittner Johan Pauwels 99 1 0 17 Sep 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities Bilal Faye Hanane Azzag M. Lebbah ObjD 105 0 0 17 Sep 2024
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation Seyed Rohollah Hosseyni Ali Ahmad Rahmani S. J. Seyedmohammadi Sanaz Seyedin Arash Mohammadi DiffM 93 7 0 17 Sep 2024
Language Models Learn Metadata: Political Stance Detection Case Study Stanley Cao Felix Drinkall 51 0 0 15 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs Madhusudan Ghosh Shrimon Mukherjee Asmit Ganguly Partha Basuchowdhuri S. Naskar Debasis Ganguly 99 8 0 15 Sep 2024
Synthetic4Health: Generating Annotated Synthetic Clinical Letters Libo Ren Samuel Belkadi Lifeng Han Warren Del-Pinto Goran Nenadic SyDa 57 2 0 14 Sep 2024
Layerwise Change of Knowledge in Neural Networks Xu Cheng Lei Cheng Zhaoran Peng Yang Xu Tian Han Quanshi Zhang KELM FAtt 74 5 0 13 Sep 2024
TheraGen: Therapy for Every Generation Kartikey Doshi Jimit Shah Narendra Shekokar AI4MH 53 0 0 12 Sep 2024
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout Anbin QI Zhongliang Liu Xinyong Zhou Jinba Xiao Fengrun Zhang Qi Gan Ming Tao Gaozheng Zhang Lu Zhang VLM 48 2 0 11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models Maryam Akhavan Aghdam Hongpeng Jin Yanzhao Wu MoE 63 3 0 10 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention Anna-Maria Halacheva M. Nayyeri Steffen Staab 78 1 0 08 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges Haolong Chen Hanzhi Chen Zijian Zhao Kaifeng Han Guangxu Zhu Yichen Zhao Ying Du Wei Xu Qingjiang Shi ALM VLM 111 5 0 06 Sep 2024
Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation Yihang Zheng Yue Liu Zhenghao Lin Yi Luo Xuanhe Zhou Chen Lin Jinsong Su Guoliang Li Shifu Li ELM 103 2 0 05 Sep 2024
Dreaming is All You Need Mingze Ni Wei Liu 53 0 0 03 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review Muhammad Umair Tangina Sultana Young-Koo Lee 80 4 0 02 Sep 2024
Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph Yuxiang Wang Xiao Yan Shiyu Jin Quanqing Xu Chuanhui Yang Yuanyuan Zhu Chuang Hu Bo Du Jiawei Jiang VLM 61 0 0 01 Sep 2024
EMP: Enhance Memory in Data Pruning Jinying Xiao Ping Li Jie Nie Zhe Tang VLM 94 0 0 28 Aug 2024