Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.03130
Cited By
A Structured Self-attentive Sentence Embedding
9 March 2017
Zhouhan Lin
Minwei Feng
Cicero Nogueira dos Santos
Mo Yu
Bing Xiang
Bowen Zhou
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Structured Self-attentive Sentence Embedding"
50 / 205 papers shown
Title
Hierarchical Attention Network for Interpretable ECG-based Heart Disease Classification
Mario Padilla Rodriguez
Mohamed Nafea
28
0
0
25 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
41
2
0
02 Mar 2025
UniASM: Binary Code Similarity Detection without Fine-tuning
Yeming Gu
Hui Shu
Fei Kang
Fan Hu
64
10
0
21 Feb 2025
Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction
Alexei Figueroa
Justus Westerhoff
Golzar Atefi
Dennis Fast
B. Winter
Felix Alexader Gers
Alexander Loser
Wolfang Nejdl
52
0
0
03 Feb 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
47
5
0
23 Jan 2025
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction
Guan-Hua Huang
Wan-Chen Lai
Tai-Been Chen
Chien-Chin Hsu
Huei-Yung Chen
Yi-Chen Wu
Li-Ren Yeh
MedIm
34
2
0
31 Oct 2024
MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks
Elad Hirsch
Gefen Dawidowicz
A. Tal
MedIm
28
1
0
04 Jul 2024
Semantic-Enhanced Relational Metric Learning for Recommender Systems
Mingming Li
Fuqing Zhu
Feng Yuan
Songlin Hu
37
0
0
07 Jun 2024
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models
Yue Zhang
Hehe Fan
Yi Yang
43
3
0
24 May 2024
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
46
2
0
22 May 2024
On Defining Smart Cities using Transformer Neural Networks
Andrei Khurshudov
14
0
0
20 Feb 2024
LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Muhammad Farid Adilazuarda
Samuel Cahyawijaya
Alham Fikri Aji
Genta Indra Winata
Ayu Purwarianti
19
5
0
11 Jan 2024
Multi-View Fusion and Distillation for Subgrade Distresses Detection based on 3D-GPR
Chunpeng Zhou
Kang Ning
Haishuai Wang
Zhi Yu
Sheng Zhou
Jiajun Bu
16
1
0
09 Aug 2023
Learning without Forgetting for Vision-Language Models
Da-Wei Zhou
Yuanhan Zhang
Jingyi Ning
Jingyi Ning
De-Chuan Zhan
De-Chuan Zhan
Ziwei Liu
VLM
CLL
71
37
0
30 May 2023
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Ziwei He
Meng-Da Yang
Minwei Feng
Jingcheng Yin
X. Wang
Jingwen Leng
Zhouhan Lin
ViT
29
11
0
24 May 2023
Student-friendly Knowledge Distillation
Mengyang Yuan
Bo Lang
Fengnan Quan
18
17
0
18 May 2023
Efficient Attention via Control Variates
Lin Zheng
Jianbo Yuan
Chong-Jun Wang
Lingpeng Kong
26
18
0
09 Feb 2023
SLCNN: Sentence-Level Convolutional Neural Network for Text Classification
A. Jarrahi
Ramin Mousa
Leila Safari
AILaw
10
2
0
27 Jan 2023
Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection
Shaoxiong Ji
Ya Gao
Pekka Marttinen
GNN
17
3
0
25 Jan 2023
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Ziyu Wan
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
J. Wang
AI4CE
16
10
0
24 Dec 2022
Fine-Grained Distillation for Long Document Retrieval
Yucheng Zhou
Tao Shen
Xiubo Geng
Chongyang Tao
Guodong Long
Can Xu
Daxin Jiang
RALM
19
28
0
20 Dec 2022
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Xiaowei Chi
Jiaming Liu
Ming Lu
Rongyu Zhang
Zhaoqing Wang
Yandong Guo
Shanghang Zhang
3DPC
38
19
0
02 Dec 2022
A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding
Zhihong Zhu
Weiyuan Xu
Xuxin Cheng
Tengtao Song
Yuexian Zou
19
22
0
08 Nov 2022
Conversation Disentanglement with Bi-Level Contrastive Learning
Chengyu Huang
Zheng Zhang
Hao Fei
Lizi Liao
DRL
22
7
0
27 Oct 2022
Revision for Concision: A Constrained Paraphrase Generation Task
Wenchuan Mu
Kwanin Lim
25
3
0
25 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
Mudit Verma
Katherine Metcalf
25
8
0
17 Oct 2022
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
Jinchao Zhang
Shuyang Jiang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
3DV
39
9
0
14 Oct 2022
Bayesian Neural Network Language Modeling for Speech Recognition
Boyang Xue
Shoukang Hu
Junhao Xu
Mengzhe Geng
Xunying Liu
Helen M. Meng
UQCV
BDL
31
14
0
28 Aug 2022
EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning
R. Gupta
Shivani Nandgaonkar
Nikhil Cherian Kurian
S. Rane
A. Sethi
MedIm
29
7
0
26 Aug 2022
Causal Intervention Improves Implicit Sentiment Analysis
Siyin Wang
Jie Zhou
Changzhi Sun
Junjie Ye
Tao Gui
Qi Zhang
Xuanjing Huang
35
16
0
19 Aug 2022
3D Siamese Transformer Network for Single Object Tracking on Point Clouds
Le Hui
Lingpeng Wang
Ling-Yu Tang
Kaihao Lan
Jin Xie
Jian Yang
ViT
3DPC
25
59
0
25 Jul 2022
Improving Multi-Interest Network with Stable Learning
Zhaocheng Liu
Yingtao Luo
Di Zeng
Qiang Liu
Daqing Chang
Dongying Kong
Zhi Chen
HAI
44
1
0
14 Jul 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
36
149
0
27 Apr 2022
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
31
296
0
27 Mar 2022
Class-Incremental Learning for Action Recognition in Videos
Jaeyoo Park
Minsoo Kang
Bohyung Han
CLL
19
52
0
25 Mar 2022
Simplicial Attention Neural Networks
L. Giusti
Claudio Battiloro
P. Lorenzo
S. Sardellitti
Sergio Barbarossa
38
32
0
14 Mar 2022
Integrating Dependency Tree Into Self-attention for Sentence Representation
Junhua Ma
Jiajun Li
Yuxuan Liu
Shangbo Zhou
Xue Li
15
2
0
11 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors
Wen Wu
C. Zhang
Xixin Wu
P. Woodland
46
14
0
08 Mar 2022
A Topology-Attention ConvLSTM Network and Its Application to EM Images
Jiaqi Yang
Xiaoling Hu
Chao Chen
Chialing Tsai
3DPC
MedIm
14
3
0
07 Feb 2022
Unsupervised Image Fusion Method based on Feature Mutual Mapping
D. Rao
Xiaojun Wu
Tianyang Xu
Guoyang Chen
16
0
0
25 Jan 2022
Online Deep Learning based on Auto-Encoder
Siyun Zhang
Jian-wei Liu
Xin Zuo
Run-kun Lu
Siming Lian
22
6
0
19 Jan 2022
A Unified Review of Deep Learning for Automated Medical Coding
Shaoxiong Ji
Wei Sun
Xiaobo Li
Hang Dong
Ara Taalas
Yijia Zhang
Honghan Wu
Esa Pitkänen
Pekka Marttinen
AI4TS
MedIm
21
27
0
08 Jan 2022
Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity
Jian-wei Liu
Xi-hao Ding
Run-kun Lu
Xiong-lin Luo
8
1
0
01 Jan 2022
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
30
14
0
26 Dec 2021
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition
Junhao Xu
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen M. Meng
MQ
32
15
0
29 Nov 2021
A Probabilistic Hard Attention Model For Sequentially Observed Scenes
Samrudhdhi B. Rangrej
James J. Clark
24
12
0
15 Nov 2021
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
91
0
08 Nov 2021
Developing neural machine translation models for Hungarian-English
A. Nagy
28
1
0
07 Nov 2021
End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
Théo Deschamps-Berger
L. Lamel
Laurence Devillers
39
27
0
28 Oct 2021
Combining Vagueness Detection with Deep Learning to Identify Fake News
Paul Guélorget
B. Icard
Guillaume Gadek
Souhir Gahbiche-Braham
S. Gatepaille
G. Atemezing
Paul Égré
22
10
0
27 Oct 2021
1
2
3
4
5
Next