ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
Can we obtain significant success in RST discourse parsing by using
  Large Language Models?
Can we obtain significant success in RST discourse parsing by using Large Language Models?
Aru Maekawa
Tsutomu Hirao
Hidetaka Kamigaito
Manabu Okumura
31
2
0
08 Mar 2024
The Social Impact of Generative AI: An Analysis on ChatGPT
The Social Impact of Generative AI: An Analysis on ChatGPT
M. T. Baldassarre
D. Caivano
Berenice Fernandez Nieto
Domenico Gigante
Azzurra Ragone
43
63
0
07 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network
  Stacking
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
91
5
0
07 Mar 2024
ShortGPT: Layers in Large Language Models are More Redundant Than You
  Expect
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Xin Men
Mingyu Xu
Qingyu Zhang
Bingning Wang
Hongyu Lin
Yaojie Lu
Xianpei Han
Weipeng Chen
117
141
0
06 Mar 2024
A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Dongyu Yao
Asaad Alghamdi
Qingrong Xia
Xiaoye Qu
Xinyu Duan
Zhefeng Wang
Yi Zheng
Baoxing Huai
Peilun Cheng
Zhou Zhao
61
0
0
05 Mar 2024
Mitigating Reversal Curse in Large Language Models via Semantic-aware
  Permutation Training
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
Qingyan Guo
Rui Wang
Junliang Guo
Xu Tan
Jiang Bian
Yujiu Yang
LRM
91
7
0
01 Mar 2024
Rethinking Tokenization: Crafting Better Tokenizers for Large Language
  Models
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
Jinbiao Yang
LLMAG
169
11
0
01 Mar 2024
Deep Learning Detection Method for Large Language Models-Generated
  Scientific Content
Deep Learning Detection Method for Large Language Models-Generated Scientific Content
Bushra Alhijawi
Rawan Jarrar
Aseel AbuAlRub
Arwa Bader
DeLMO
51
7
0
27 Feb 2024
Natural Language Processing Methods for Symbolic Music Generation and
  Information Retrieval: a Survey
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Dinh-Viet-Toan Le
Louis Bigo
Mikaela Keller
Dorien Herremans
MedIm
88
14
0
27 Feb 2024
Beyond Self-learned Attention: Mitigating Attention Bias in
  Transformer-based Models Using Attention Guidance
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
Jiri Gesi
Iftekhar Ahmed
79
0
0
26 Feb 2024
Generating Effective Ensembles for Sentiment Analysis
Generating Effective Ensembles for Sentiment Analysis
Itay Etelis
Avi Rosenfeld
Abraham Itzhak Weinberg
David Sarne
79
2
0
26 Feb 2024
Layer-wise Regularized Dropout for Neural Language Models
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
49
0
0
26 Feb 2024
Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems
Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems
Enrico Liscio
Luciano Cavalcante Siebert
Catholijn M. Jonker
P. Murukannaiah
105
5
0
26 Feb 2024
Enhancing Cloud-Based Large Language Model Processing with Elasticsearch
  and Transformer Models
Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models
Chunhe Ni
Jiang Wu
Hongbo Wang
Wenran Lu
Chenwei Zhang
55
8
0
24 Feb 2024
CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for
  Aspect-Level Sentiment Classification in Korean
CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Dongjun Jang
Jean Seo
Sungjoo Byun
Taekyoung Kim
Minseok Kim
Hyopil Shin
49
1
0
23 Feb 2024
The Impact of Word Splitting on the Semantic Content of Contextualized
  Word Representations
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations
Aina Garí Soler
Matthieu Labeau
Chloé Clavel
VLM
72
2
0
22 Feb 2024
Efficient data selection employing Semantic Similarity-based Graph
  Structures for model training
Efficient data selection employing Semantic Similarity-based Graph Structures for model training
Roxana Petcu
Subhadeep Maji
28
1
0
22 Feb 2024
Detecting misinformation through Framing Theory: the Frame Element-based
  Model
Detecting misinformation through Framing Theory: the Frame Element-based Model
Guan-Hua Wang
Rebecca Frederick
Jinglong Duan
William Wong
V. Rupar
Weihua Li
Quan-wei Bai
95
2
0
19 Feb 2024
Utilizing BERT for Information Retrieval: Survey, Applications,
  Resources, and Challenges
Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges
Jiajia Wang
Jimmy Xiangji Huang
Xinhui Tu
Junmei Wang
Angela J. Huang
Md Tahmid Rahman Laskar
Amran Bhuiyan
100
39
0
18 Feb 2024
From Prejudice to Parity: A New Approach to Debiasing Large Language
  Model Word Embeddings
From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings
Aishik Rakshit
Smriti Singh
Shuvam Keshari
Arijit Ghosh Chowdhury
Vinija Jain
Aman Chadha
63
3
0
18 Feb 2024
Exploring ChatGPT for Next-generation Information Retrieval:
  Opportunities and Challenges
Exploring ChatGPT for Next-generation Information Retrieval: Opportunities and Challenges
Yizheng Huang
Jimmy X. Huang
116
11
0
17 Feb 2024
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Nikhil Bhendawade
Irina Belousova
Qichen Fu
Henry Mason
Mohammad Rastegari
Mahyar Najibi
LRM
124
32
0
16 Feb 2024
Understanding Survey Paper Taxonomy about Large Language Models via
  Graph Representation Learning
Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning
Jun Zhuang
C. Kennington
37
10
0
16 Feb 2024
Leveraging the Context through Multi-Round Interactions for Jailbreaking
  Attacks
Leveraging the Context through Multi-Round Interactions for Jailbreaking Attacks
Yixin Cheng
Markos Georgopoulos
Volkan Cevher
Grigorios G. Chrysos
AAML
71
15
0
14 Feb 2024
Eliciting Personality Traits in Large Language Models
Eliciting Personality Traits in Large Language Models
Airlie Hilliard
Cristian Muñoz
Zekun Wu
Adriano Soares Koshiyama
56
11
0
13 Feb 2024
Punctuation Restoration Improves Structure Understanding Without Supervision
Punctuation Restoration Improves Structure Understanding Without Supervision
Junghyun Min
Minho Lee
Woochul Lee
Yeonsoo Lee
157
1
0
13 Feb 2024
OrderBkd: Textual backdoor attack through repositioning
OrderBkd: Textual backdoor attack through repositioning
Irina Alekseevskaia
Konstantin Arkhipenko
75
3
0
12 Feb 2024
Pushing The Limit of LLM Capacity for Text Classification
Pushing The Limit of LLM Capacity for Text Classification
Yazhou Zhang
Mengyao Wang
Chenyu Ren
Qiuchi Li
Prayag Tiwari
Benyou Wang
Jing Qin
VLMAI4TS
100
30
0
12 Feb 2024
UVTM: Universal Vehicle Trajectory Modeling with ST Feature Domain Generation
UVTM: Universal Vehicle Trajectory Modeling with ST Feature Domain Generation
Yan Lin
Jilin Hu
Shengnan Guo
Bin Yang
Christian S. Jensen
Youfang Lin
Huaiyu Wan
110
0
0
11 Feb 2024
Large Language Models: A Survey
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALMLM&MAELM
246
425
0
09 Feb 2024
The Fine-Grained Complexity of Gradient Computation for Training Large
  Language Models
The Fine-Grained Complexity of Gradient Computation for Training Large Language Models
Josh Alman
Zhao Song
70
15
0
07 Feb 2024
The Use of a Large Language Model for Cyberbullying Detection
The Use of a Large Language Model for Cyberbullying Detection
Bayode Ogunleye
Babitha Dharmaraj
46
18
0
06 Feb 2024
Harnessing PubMed User Query Logs for Post Hoc Explanations of
  Recommended Similar Articles
Harnessing PubMed User Query Logs for Post Hoc Explanations of Recommended Similar Articles
Ashley Shin
Qiao Jin
James Anibal
Zhiyong Lu
50
0
0
05 Feb 2024
English Prompts are Better for NLI-based Zero-Shot Emotion
  Classification than Target-Language Prompts
English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts
Patrick Bareiss
Roman Klinger
Jeremy Barnes
69
10
0
05 Feb 2024
Empowering Time Series Analysis with Large Language Models: A Survey
Empowering Time Series Analysis with Large Language Models: A Survey
Yushan Jiang
Zijie Pan
Xikun Zhang
Sahil Garg
Anderson Schneider
Yuriy Nevmyvaka
Dongjin Song
AI4TSAIFin
178
56
0
05 Feb 2024
From Partial to Strictly Incremental Constituent Parsing
From Partial to Strictly Incremental Constituent Parsing
Ana Ezquerro
Carlos Gómez-Rodríguez
David Vilares
40
0
0
05 Feb 2024
Exploiting Class Probabilities for Black-box Sentence-level Attacks
Exploiting Class Probabilities for Black-box Sentence-level Attacks
Raha Moraffah
Huan Liu
60
1
0
05 Feb 2024
Advancing Graph Representation Learning with Large Language Models: A
  Comprehensive Survey of Techniques
Advancing Graph Representation Learning with Large Language Models: A Comprehensive Survey of Techniques
Qiheng Mao
Zemin Liu
Chenghao Liu
Zhuo Li
Jianling Sun
69
10
0
04 Feb 2024
Revisiting the Markov Property for Machine Translation
Revisiting the Markov Property for Machine Translation
Cunxiao Du
Hao Zhou
Zhaopeng Tu
Jing Jiang
110
2
0
03 Feb 2024
A Hybrid Strategy for Chat Transcript Summarization
A Hybrid Strategy for Chat Transcript Summarization
Pratik K. Biswas
60
0
0
02 Feb 2024
Does DetectGPT Fully Utilize Perturbation? Bridging Selective
  Perturbation to Fine-tuned Contrastive Learning Detector would be Better
Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better
Shengchao Liu
Xiaoming Liu
Yichen Wang
Zehua Cheng
Chengzhengxu Li
Zhaohan Zhang
Y. Lan
Chao Shen
DeLMO
90
5
0
01 Feb 2024
Emergency Department Decision Support using Clinical Pseudo-notes
Emergency Department Decision Support using Clinical Pseudo-notes
Simon A. Lee
Sujay Jain
Alex Chen
Kyoka Ono
Jennifer Fang
Á. Rudas
Jeffrey N. Chiang
94
12
0
31 Jan 2024
SNNLP: Energy-Efficient Natural Language Processing Using Spiking Neural
  Networks
SNNLP: Energy-Efficient Natural Language Processing Using Spiking Neural Networks
R. A. Knipper
Kaniz Mishty
Mehdi Sadi
Karmaker Santu
46
1
0
31 Jan 2024
Manipulating Predictions over Discrete Inputs in Machine Teaching
Manipulating Predictions over Discrete Inputs in Machine Teaching
Xiaodong Wu
Yufei Han
H. Dahrouj
Jianbing Ni
Zhenwen Liang
Xiangliang Zhang
64
0
0
31 Jan 2024
Fine-tuning Transformer-based Encoder for Turkish Language Understanding
  Tasks
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks
Savas Yildirim
35
7
0
30 Jan 2024
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Weijiao Zhang
Jindong Han
Zhao Xu
Hang Ni
Hao Liu
Hui Xiong
Hui Xiong
AI4CE
248
18
0
30 Jan 2024
Breaking Free Transformer Models: Task-specific Context Attribution
  Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs
Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs
Stepan Tytarenko
Mohammad Ruhul Amin
32
3
0
30 Jan 2024
GuReT: Distinguishing Guilt and Regret related Text
GuReT: Distinguishing Guilt and Regret related Text
S. Butt
F. Balouchzahi
Abdul Gafar Manuel Meque
Maaz Amjad
Hector G. Ceballos Cancino
Grigori Sidorov
Alexander Gelbukh
37
0
0
29 Jan 2024
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in
  BERT pretraining
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
Wen-Chieh Liang
Youzhi Liang
OffRL
49
2
0
29 Jan 2024
Quantifying Stereotypes in Language
Quantifying Stereotypes in Language
Yang Liu
70
1
0
28 Jan 2024
Previous
123...8910...697071
Next