Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,520 papers shown
Title
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index
Megha Chakraborty
S.M. Towhidul Islam Tonmoy
S. M. Mehedi
Krish Sharma
Niyar R. Barman
...
Tanay Kumar
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
DeLMO
82
21
0
08 Oct 2023
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations
Vipula Rawte
Swagata Chakraborty
Agnibh Pathak
Anubhav Sarkar
S.M. Towhidul Islam Tonmoy
Aman Chadha
Mikel Artetxe
Punit Daniel Simig
HILM
94
131
0
08 Oct 2023
Higher-Order DeepTrails: Unified Approach to *Trails
Tobias Koopmann
Jan Pfister
André Markus
Astrid Carolus
Carolin Wienrich
Andreas Hotho
AI4TS
33
0
0
06 Oct 2023
Genetic prediction of quantitative traits: a machine learner's guide focused on height
L. Bourguignon
Caroline Weis
C. Jutzeler
Michael Adamer
AI4CE
19
0
0
06 Oct 2023
Quantized Transformer Language Model Implementations on Edge Devices
Mohammad Wali Ur Rahman
Murad Mehrab Abrar
Hunter Gibbons Copening
Salim Hariri
Sicong Shao
Pratik Satam
Soheil Salehi
MQ
68
11
0
06 Oct 2023
A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4
Katikapalli Subramanyam Kalyan
LM&MA
AI4CE
LRM
AILaw
ELM
131
248
0
04 Oct 2023
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation
Filippo Perrina
Francesco Marchiori
Mauro Conti
Nino Vincenzo Verde
44
11
0
04 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
57
11
0
03 Oct 2023
The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers
Rickard Brannvall
46
0
0
03 Oct 2023
Jury: A Comprehensive Evaluation Toolkit
Devrim Cavusoglu
Secil Sen
Ulas Sert
S. Altinuc
ELM
16
2
0
03 Oct 2023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Pingzhi Li
Zhenyu Zhang
Prateek Yadav
Yi-Lin Sung
Yu Cheng
Mohit Bansal
Tianlong Chen
MoMe
85
39
0
02 Oct 2023
Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Man Luo
Shrinidhi Kumbhar
Ming shen
Mihir Parmar
Neeraj Varshney
Pratyay Banerjee
Somak Aditya
Chitta Baral
ReLM
ELM
LRM
137
31
0
02 Oct 2023
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks
Hao Chen
Jindong Wang
Ankit Shah
Ran Tao
Hongxin Wei
Berfin cSimcsek
Masashi Sugiyama
Bhiksha Raj
110
32
0
29 Sep 2023
Unsupervised Pretraining for Fact Verification by Language Model Distillation
A. Bazaga
Pietro Lio
Bo Dai
HILM
106
2
0
28 Sep 2023
Social Media Fashion Knowledge Extraction as Captioning
Yifei Yuan
Wenxuan Zhang
Yang Deng
Wai Lam
54
1
0
28 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey
Victoria Smith
Ali Shahin Shamsabadi
Carolyn Ashurst
Adrian Weller
PILM
110
27
0
27 Sep 2023
Large Language Model Alignment: A Survey
Tianhao Shen
Renren Jin
Yufei Huang
Chuang Liu
Weilong Dong
Zishan Guo
Xinwei Wu
Yan Liu
Deyi Xiong
LM&MA
112
207
0
26 Sep 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning
Jiadong Wang
Chengyu Wang
Chuanqi Tan
Jun Huang
Ming Gao
KELM
100
6
0
26 Sep 2023
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS
Dake Guo
Xinfa Zhu
Liumeng Xue
Tao Li
Yuanjun Lv
Yuepeng Jiang
Linfu Xie
76
1
0
25 Sep 2023
Text Classification: A Perspective of Deep Learning Methods
Zhongwei Wan
VLM
33
7
0
24 Sep 2023
Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach
Mohammad Kashif
Mohammad Zohair
Saquib Ali
20
4
0
23 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
Deepak Gupta
Kush Attal
Dina Demner-Fushman
LM&MA
54
1
0
21 Sep 2023
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision
Jinzhao Zhou
Yiqun Duan
Yu-Cheng Chang
Yu-Kai Wang
Chin-Teng Lin
76
6
0
21 Sep 2023
SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels
Elena Shushkevich
Long Mai
Manuel V. Loureiro
Steven Derby
Tri Kurniawan Wijaya
AI4TS
83
0
0
21 Sep 2023
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Chengyuan Liu
Fubang Zhao
Lizhi Qing
Yangyang Kang
Changlong Sun
Kun Kuang
Leilei Gan
AAML
75
21
0
21 Sep 2023
Word Embedding with Neural Probabilistic Prior
Shaogang Ren
Dingcheng Li
P. Li
BDL
49
0
0
21 Sep 2023
Exploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and Concreteness
Vipula Rawte
Prachi Priya
S.M. Towhidul Islam Tonmoy
M. M. Zaman
A. Sheth
Amitava Das
54
19
0
20 Sep 2023
Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education
Ramteja Sajja
Y. Sermet
Muhammed Cikmaz
David M. Cwiertny
Ibrahim Demir
104
149
0
19 Sep 2023
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing
Ching-Hsun Tseng
Shin-Jye Lee
Po-Wei Cheng
Chien Lee
Chih-Chieh Hung
31
0
0
18 Sep 2023
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking
Megha Sundriyal
Md. Shad Akhtar
Tanmoy Chakraborty
62
0
0
17 Sep 2023
SplitEE: Early Exit in Deep Neural Networks with Split Computing
Divya J. Bajpai
Vivek K. Trivedi
S. L. Yadav
M. Hanawal
82
7
0
17 Sep 2023
Pedestrian Trajectory Prediction Using Dynamics-based Deep Learning
Honghui Wang
Weiming Zhi
Gustavo Batista
Rohitash Chandra
65
1
0
16 Sep 2023
MHLAT: Multi-hop Label-wise Attention Model for Automatic ICD Coding
Junwen Duan
Han Jiang
Ying Yu
74
2
0
16 Sep 2023
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Fei Dou
Jin Ye
Geng Yuan
Qin Lu
Wei Niu
...
Hongyue Sun
Yunli Shao
Changying Li
Tianming Liu
Wenzhan Song
AI4CE
85
29
0
14 Sep 2023
DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective
Pu Miao
Zeyao Du
Junlin Zhang
SSL
77
7
0
14 Sep 2023
Beyond original Research Articles Categorization via NLP
Rosanna Turrisi
126
1
0
13 Sep 2023
Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweets
Ramya Tekumalla
Juan M. Banda
53
8
0
12 Sep 2023
Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review
Pengzhou Cheng
Zongru Wu
Wei Du
Haodong Zhao
Wei Lu
Gongshen Liu
SILM
AAML
183
21
0
12 Sep 2023
Balanced and Explainable Social Media Analysis for Public Health with Large Language Models
Yan Jiang
Ruihong Qiu
Yi Zhang
Peng Zhang
65
7
0
12 Sep 2023
Challenges in Annotating Datasets to Quantify Bias in Under-represented Society
Vithya Yogarajan
Gillian Dobbie
Timothy Pistotti
Joshua Bensemann
Kobe Knowles
95
2
0
11 Sep 2023
CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts
Rabindra Lamsal
M. Read
S. Karunasekera
62
15
0
11 Sep 2023
Improving Information Extraction on Business Documents with Specific Pre-Training Tasks
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
55
6
0
11 Sep 2023
UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
Yan Jiang
Ruihong Qiu
Yi Zhang
Zi Huang
LM&MA
52
2
0
08 Sep 2023
Introducing "Forecast Utterance" for Conversational Data Science
Md. Mahadi Hassan
Alex Knipper
S. Karmaker
AI4TS
56
0
0
07 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
E. Sciascio
ALM
132
30
0
07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
83
50
0
06 Sep 2023
A deep Natural Language Inference predictor without language-specific training data
Lorenzo Corradi
Alessandro Manenti
Francesca Del Bonifro
Francesco Setti
D. Sorbo
34
0
0
06 Sep 2023
UniSA: Unified Generative Framework for Sentiment Analysis
Zaijing Li
Ting-En Lin
Yuchuan Wu
Meng Liu
Fengxiao Tang
Mingde Zhao
Yongbin Li
101
18
0
04 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
60
0
0
02 Sep 2023
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair
Yuxiang Wei
Chun Xia
Lingming Zhang
KELM
99
107
0
01 Sep 2023
Previous
1
2
3
...
12
13
14
...
69
70
71
Next