Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,335 papers shown
Title
VTPNet for 3D deep learning on point cloud
Wei Zhou
Weiwei Jin
Qian Wang
Yifan Wang
Dekui Wang
Xingxing Hao
Yong Yu
3DPC
ViT
14
0
0
10 May 2023
ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Bimal Gajera
Shreeyash Mukul Gowaikar
Chandan Gupta
Aman Chadha
Aishwarya N. Reganti
Amit P. Sheth
Amitava Das
ELM
25
14
0
08 May 2023
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
22
6
0
08 May 2023
Differentially Private Attention Computation
Yeqi Gao
Zhao Song
Xin Yang
50
20
0
08 May 2023
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Zhanpeng Zeng
Cole Hawkins
Min-Fong Hong
Aston Zhang
Nikolaos Pappas
Vikas Singh
Shuai Zheng
21
6
0
07 May 2023
Pre-training Language Model as a Multi-perspective Course Learner
Beiduo Chen
Shaohan Huang
Zi-qiang Zhang
Wu Guo
Zhen-Hua Ling
Haizhen Huang
Furu Wei
Weiwei Deng
Qi Zhang
34
0
0
06 May 2023
DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse Relation Recognition
Chunkit Chan
Xin Liu
Cheng Jiayang
Zihan Li
Yangqiu Song
Ginny Wong
Simon See
30
30
0
06 May 2023
VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets
V. Saxena
Nils Rethmeier
Gijs Van Dijck
Gerasimos Spanakis
26
6
0
04 May 2023
Using Language Models on Low-end Hardware
Silin Gao
Beatriz Borges
Saya Kanno
Antoine Bosselut
21
0
0
03 May 2023
Calibration Error Estimation Using Fuzzy Binning
Geetanjali Bihani
Julia Taylor Rayz
97
2
0
30 Apr 2023
MMViT: Multiscale Multiview Vision Transformers
Yuchen Liu
Natasha Ong
Kaiyan Peng
Bo Xiong
Qifan Wang
...
Madian Khabsa
Kaiyue Yang
David C. Liu
Donald Williamson
Hanchao Yu
ViT
33
4
0
28 Apr 2023
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]
Alexandros Zeakis
G. Papadakis
Dimitrios Skoutas
Manolis Koubarakis
32
37
0
24 Apr 2023
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
28
24
0
19 Apr 2023
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Zheng Lian
Haiyang Sun
Guoying Zhao
Kang Chen
Mingyu Xu
...
Meng Wang
Min Zhang
Guoying Zhao
Björn W. Schuller
Jianhua Tao
40
48
0
18 Apr 2023
MisRoBÆRTa: Transformers versus Misinformation
Ciprian-Octavian Truică
Elena Simona Apostol
27
37
0
16 Apr 2023
Fairness in Visual Clustering: A Novel Transformer Clustering Approach
Xuan-Bac Nguyen
C. Duong
Marios Savvides
Kaushik Roy
Hugh Churchill
Khoa Luu
37
9
0
14 Apr 2023
Context-aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Shiyin Kang
Helen Meng
33
6
0
13 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
32
17
0
10 Apr 2023
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Ye Jiang
Xiaomin Yu
Yimin Wang
Xiaoman Xu
Xingyi Song
Diana Maynard
29
20
0
09 Apr 2023
Continual Graph Convolutional Network for Text Classification
Tiandeng Wu
Qijiong Liu
Yinhao Cao
yao. huang
Xiao-Ming Wu
Jiandong Ding
GNN
29
10
0
09 Apr 2023
Multi-class Categorization of Reasons behind Mental Disturbance in Long Texts
Muskan Garg
AI4MH
23
2
0
08 Apr 2023
MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities
Priyanka Kargupta
Tanay Komarlu
Susik Yoon
Xuan Wang
Jiawei Han
36
8
0
04 Apr 2023
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT
Yi Qi
Xingyu Zhao
Siddartha Khastgir
Xiaowei Huang
24
14
0
03 Apr 2023
MiniRBT: A Two-stage Distilled Small Chinese Pre-trained Model
Xin Yao
Ziqing Yang
Yiming Cui
Shijin Wang
28
3
0
03 Apr 2023
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection
Sihao Hu
Zhen Zhang
B. Luo
Shengliang Lu
Bingsheng He
Ling Liu
30
39
0
29 Mar 2023
Planning with Sequence Models through Iterative Energy Minimization
Hongyi Chen
Yilun Du
Yiye Chen
J. Tenenbaum
Patricio A. Vela
32
6
0
28 Mar 2023
Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing
Walid Hariri
AI4MH
LM&MA
33
85
0
27 Mar 2023
Informed Machine Learning, Centrality, CNN, Relevant Document Detection, Repatriation of Indigenous Human Remains
M. A. Bashar
R. Nayak
G. Knapman
Paul Turnbull
C. Fforde
37
1
0
25 Mar 2023
SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization
Yi-Syuan Chen
Yun-Zhu Song
Hong-Han Shuai
33
6
0
24 Mar 2023
Paraphrase Detection: Human vs. Machine Content
Jonas Becker
Jan Philip Wahle
Terry Ruas
Bela Gipp
DeLMO
35
14
0
24 Mar 2023
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function
A. B. Siddique
M. H. Maqbool
Kshitija Taywade
H. Foroosh
24
12
0
24 Mar 2023
Human Behavior in the Time of COVID-19: Learning from Big Data
Hanjia Lyu
Arsal Imtiaz
Yufei Zhao
Jiebo Luo
35
6
0
23 Mar 2023
Multi-View Zero-Shot Open Intent Induction from Dialogues: Multi Domain Batch and Proxy Gradient Transfer
Hyukhun Koh
Haesung Pyun
Nakyeong Yang
Kyomin Jung
40
1
0
23 Mar 2023
ChatGPT and a New Academic Reality: Artificial Intelligence-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing
Brady Lund
Ting Wang
Nishith Reddy Mannuru
Bing Nie
S. Shimray
Ziang Wang
AI4CE
15
498
0
21 Mar 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
Learning for Amalgamation: A Multi-Source Transfer Learning Framework For Sentiment Classification
Cuong V. Nguyen
Khiem H. Le
Anh Tran
Quang-Cuong Pham
Binh T. Nguyen
15
14
0
16 Mar 2023
Task-specific Fine-tuning via Variational Information Bottleneck for Weakly-supervised Pathology Whole Slide Image Classification
Honglin Li
Chenglu Zhu
Yunlong Zhang
Yuxuan Sun
Zhongyi Shui
Wenwei Kuang
S. Zheng
L. Yang
69
57
0
15 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
30
37
0
14 Mar 2023
Transformer-based approaches to Sentiment Detection
O. E. Ojo
Hoang Thang Ta
Alexander Gelbukh
Hiram Calvo
O. O. Adebanji
Grigori Sidorov
6
7
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A. B. Siddique
30
4
0
12 Mar 2023
Generating Query Focused Summaries without Fine-tuning the Transformer-based Pre-trained Models
D. Abdullah
Shamanth Nayak
Gandharv Suri
Yllias Chali
30
2
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
27
42
0
10 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
34
7
0
09 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
35
508
0
07 Mar 2023
GlobalNER: Incorporating Non-local Information into Named Entity Recognition
Chiao-Wei Hsu
Keh-Yih Su
NAI
21
0
0
06 Mar 2023
WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Manan Suri
Aaryak Garg
Divya Chaudhary
I. Gorton
B. Kumar
18
1
0
05 Mar 2023
Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study
Mingxu Tao
Yansong Feng
Dongyan Zhao
CLL
KELM
32
10
0
02 Mar 2023
H-AES: Towards Automated Essay Scoring for Hindi
Shubhankar K. Singh
Anirudh Pupneja
Shivaansh Mital
Cheril Shah
Manish Bawkar
Lakshman Prasad Gupta
Ajit Kumar
Yaman Kumar Singla
Rushali Gupta
R. Shah
21
6
0
28 Feb 2023
HugNLP: A Unified and Comprehensive Library for Natural Language Processing
Jiadong Wang
Nuo Chen
Qiushi Sun
Wenkang Huang
Chengyu Wang
Ming Gao
27
3
0
28 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
101
0
27 Feb 2023
Previous
1
2
3
4
5
6
...
25
26
27
Next