Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,520 papers shown
Title
Augmenting Language Models with Long-Term Memory
Weizhi Wang
Li Dong
Hao Cheng
Xiaodong Liu
Xifeng Yan
Jianfeng Gao
Furu Wei
KELM
RALM
106
96
0
12 Jun 2023
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search
Jian Xie
Yidan Liang
Jingping Liu
Yanghua Xiao
Baohua Wu
Shenghua Ni
VLM
LRM
90
9
0
11 Jun 2023
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Shicheng Tan
Weng Lam Tam
Yuanchun Wang
Wenwen Gong
Yang Yang
...
Jiahao Liu
Jingang Wang
Shuo Zhao
Peng Zhang
Jie Tang
ALM
MoE
82
13
0
11 Jun 2023
Mimicking the Thinking Process for Emotion Recognition in Conversation with Prompts and Paraphrasing
Tingyu Zhang
Zhuang Chen
Ming Zhong
T. Qian
67
15
0
11 Jun 2023
Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks
Katelyn Mei
Sonia Fereidooni
Aylin Caliskan
87
56
0
08 Jun 2023
Hexatagging: Projective Dependency Parsing as Tagging
Afra Amini
Tianyu Liu
Ryan Cotterell
VLM
3DV
47
3
0
08 Jun 2023
Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction
Simone Scaboro
Beatrice Portelli
Emmanuele Chersoni
Enrico Santus
Giuseppe Serra
66
9
0
08 Jun 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter
Ajay Jaiswal
Shiwei Liu
Tianlong Chen
Zhangyang Wang
VLM
75
34
0
06 Jun 2023
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
75
6
0
05 Jun 2023
Interactive Editing for Text Summarization
Yujia Xie
Xun Wang
Si-Qing Chen
Wayne Xiong
Pengcheng He
KELM
334
2
0
05 Jun 2023
A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires
Hoyun Song
Jisu Shin
Huije Lee
Jong C. Park
72
7
0
05 Jun 2023
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Alham Fikri Aji
Genta Indra Winata
Radityo Eko Prasojo
Phil Blunsom
A. Kuncoro
65
8
0
05 Jun 2023
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications
Han Xie
Da Zheng
Jun Ma
Houyu Zhang
V. Ioannidis
...
Sheng Wang
Carl Yang
Yi Xu
Belinda Zeng
Trishul Chilimbi
AI4CE
106
40
0
05 Jun 2023
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Omar Shaikh
Caleb Ziems
William B. Held
Aryan Pariani
Fred Morstatter
Diyi Yang
85
14
0
04 Jun 2023
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
Jianghui Wang
Yuxuan Wang
Dongyan Zhao
Zilong Zheng
96
1
0
04 Jun 2023
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Daniela Teodorescu
Saif M. Mohammad
59
13
0
03 Jun 2023
Towards Coding Social Science Datasets with Language Models
Anonymous Acl
Taylor Sorensen
Lisa P. Argyle
Ethan C. Busby
Nancy Fulda
Joshua R Gubler
David Wingate
ALM
SyDa
55
11
0
03 Jun 2023
SourceP: Detecting Ponzi Schemes on Ethereum with Source Code
Pengcheng Lu
Liang Cai
Keting Yin
AI4TS
97
4
0
02 Jun 2023
MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models
Masoud Monajatipoor
Liunian Harold Li
Mozhdeh Rouhsedaghat
Lin F. Yang
Kai-Wei Chang
MLLM
LRM
80
14
0
02 Jun 2023
An Overview on Generative AI at Scale with Edge-Cloud Computing
Yun Cheng Wang
Jintang Xue
Chengwei Wei
C.-C. Jay Kuo
69
35
0
02 Jun 2023
Cook-Gen: Robust Generative Modeling of Cooking Actions from Recipes
R. Venkataramanan
Kaushik Roy
Kanak Raj
Renjith Prasad
Yuxin Zi
Vignesh Narayanan
Amit P. Sheth
38
18
0
01 Jun 2023
Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
Ziao Yang
Samridhi Choudhary
Siegfried Kunzmann
Zheng Zhang
MQ
81
3
0
01 Jun 2023
Effective Structured Prompting by Meta-Learning and Representative Verbalizer
Weisen Jiang
Yu Zhang
James T. Kwok
VLM
OffRL
94
18
0
01 Jun 2023
Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home
Eda Okur
Roddy Fuentes Alba
Saurav Sahay
L. Nachman
63
0
0
01 Jun 2023
MedNgage: A Dataset for Understanding Engagement in Patient-Nurse Conversations
Yan Wang
H. Donovan
Sabit Hassan
Mailhe Alikhani
85
3
0
31 May 2023
A Global Context Mechanism for Sequence Labeling
Conglei Xu
Kun Shen
Hongguang Sun
Yang Xu
67
6
0
31 May 2023
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Amirhossein Abaskohi
S. Rothe
Yadollah Yaghoobzadeh
VLM
105
19
0
29 May 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback
Shengchao Liu
Jiong Wang
Yijin Yang
Chengpeng Wang
Ling Liu
Hongyu Guo
Chaowei Xiao
LM&MA
KELM
AI4MH
107
38
0
29 May 2023
The Utility of Large Language Models and Generative AI for Education Research
Andrew Katz
Umair Shakir
B. Chambers
AI4CE
68
6
0
29 May 2023
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
76
0
0
28 May 2023
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making
Xuanjie Fang
Sijie Cheng
Yang Liu
Wen Wang
AAML
65
9
0
27 May 2023
Entailment as Robust Self-Learner
Jiaxin Ge
Hongyin Luo
Yoon Kim
James R. Glass
109
3
0
26 May 2023
DeepSI: Interactive Deep Learning for Semantic Interaction
Yail Bian
Chris North
HAI
118
15
0
26 May 2023
Sentence-Incremental Neural Coreference Resolution
Matt Grenander
Shay B. Cohen
Mark Steedman
CLL
102
5
0
26 May 2023
Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review
Fred Philippy
Siwen Guo
Shohreh Haddadan
LRM
72
37
0
26 May 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
97
7
0
25 May 2023
Neural Summarization of Electronic Health Records
Koyena Pal
Seyed Ali Bahrainian
Laura Y. Mercurio
Carsten Eickhoff
39
3
0
24 May 2023
Multi-modal Machine Learning for Vehicle Rating Predictions Using Image, Text, and Parametric Data
Hanqi Su
Binyang Song
Faez Ahmed
56
6
0
24 May 2023
Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective
Tianyu Liu
Afra Amini
Mrinmaya Sachan
Ryan Cotterell
93
2
0
24 May 2023
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives
Xinpeng Wang
Leonie Weissweiler
Hinrich Schütze
Barbara Plank
66
8
0
24 May 2023
PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
Yau-Shian Wang
Ta-Chung Chi
Ruohong Zhang
Yiming Yang
VLM
53
13
0
24 May 2023
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
Feng Jiang
Weihao Liu
Xiaomin Chu
Peifeng Li
Qiaoming Zhu
Haizhou Li
72
1
0
24 May 2023
Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization
Shoujie Tong
Heming Xia
Damai Dai
Runxin Xu
Tianyu Liu
Binghuai Lin
Yunbo Cao
Zhifang Sui
53
0
0
24 May 2023
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation
Bashar Alhafni
Go Inoue
Christian Khairallah
Nizar Habash
87
19
0
24 May 2023
SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank
Dheeraj Mekala
Adithya Samavedhi
Chengyu Dong
Jingbo Shang
OODD
57
2
0
24 May 2023
Difference-Masking: Choosing What to Mask in Continued Pretraining
Alex Wilf
Syeda Nahida Akter
Leena Mathur
Paul Pu Liang
Sheryl Mathew
Mengrou Shou
Eric Nyberg
Louis-Philippe Morency
CLL
SSL
58
5
0
23 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
106
2
0
23 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
60
2
0
23 May 2023
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
Bo Zhou
Qianglong Chen
Tianyu Wang
Xiaoshi Zhong
Yin Zhang
ELM
119
10
0
23 May 2023
Revisiting Acceptability Judgements
Hai Hu
Ziyin Zhang
Wei-Ping Huang
J. Lai
Aini Li
Yi Ma
Jiahui Huang
Peng Zhang
Chien-Jer Charles Lin
Rui Wang
71
2
0
23 May 2023
Previous
1
2
3
...
15
16
17
...
69
70
71
Next