Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05594
Cited By
v1
v2 (latest)
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"
50 / 435 papers shown
Title
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
25
1
0
08 Sep 2021
RGB-D Salient Object Detection with Ubiquitous Target Awareness
Yifan Zhao
Jiawei Zhao
Jia Li
Xiaowu Chen
89
46
0
08 Sep 2021
Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects
Bing Wei
Yudi Zhao
K. Hao
Lei Gao
74
5
0
08 Sep 2021
Robust Attentive Deep Neural Network for Exposing GAN-generated Faces
Hui Guo
Shu Hu
Xin Wang
Ming-Ching Chang
Siwei Lyu
CVBM
77
37
0
05 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
100
18
0
20 Aug 2021
Causal Attention for Unbiased Visual Recognition
Tan Wang
Chan Zhou
Qianru Sun
Hanwang Zhang
OOD
CML
110
114
0
19 Aug 2021
Cervical Optical Coherence Tomography Image Classification Based on Contrastive Self-Supervised Texture Learning
Kaiyi Chen
Qingbin Wang
Yutao Ma
24
13
0
11 Aug 2021
Learn to Grasp with Less Supervision: A Data-Efficient Maximum Likelihood Grasp Sampling Loss
Xinghao Zhu
Yefan Zhou
Yongxiang Fan
Lingfeng Sun
Jianyu Chen
Masayoshi Tomizuka
89
15
0
10 Aug 2021
Transductive Few-Shot Classification on the Oblique Manifold
Guodong Qi
Huimin Yu
Zhaohui Lu
Shuzhao Li
70
45
0
09 Aug 2021
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNN
CoGe
77
17
0
08 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
147
68
0
05 Aug 2021
Efficient Human Pose Estimation by Maximizing Fusion and High-Level Spatial Attention
Zhiyuan Ren
Yao Zhou
Yizhe Chen
Rui Zhou
Yayu Gao
3DH
52
4
0
29 Jul 2021
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Wentian Zhao
Yao Hu
Heda Wang
Xinxiao Wu
Jiebo Luo
55
49
0
26 Jul 2021
High-Resolution Pelvic MRI Reconstruction Using a Generative Adversarial Network with Attention and Cyclic Loss
Guangyuan Li
Jun Lv
Xiangrong Tong
Chengyan Wang
Guang Yang
MedIm
43
24
0
21 Jul 2021
DRDF: Determining the Importance of Different Multimodal Information with Dual-Router Dynamic Framework
Haiwen Hong
Xuan Jin
Yin Zhang
Yunqing Hu
Jingfeng Zhang
Yuan He
Hui Xue
MoE
34
0
0
21 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
153
270
0
14 Jul 2021
Fast Pixel-Matching for Video Object Segmentation
Siyue Yu
Jimin Xiao
Bingfeng Zhang
Eng Gee Lim
VOS
48
9
0
09 Jul 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
71
6
0
26 Jun 2021
All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection
Meng Cao
Can Zhang
Dongming Yang
Yuexian Zou
53
15
0
24 Jun 2021
Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification
Chenyu Guo
Jiyang Xie
Kongming Liang
Xian Sun
Zhanyu Ma
92
4
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCV
MedIm
115
12
0
21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
142
7
0
20 Jun 2021
Salient Positions based Attention Network for Image Classification
Sheng Fang
Kaiyu Li
Zhe Li
70
3
0
09 Jun 2021
Understanding top-down attention using task-oriented ablation design
Freddie Bickford-Smith
Brett D. Roads
Xiaoliang Luo
Bradley C. Love
56
1
0
08 Jun 2021
Vision Transformers with Hierarchical Attention
Yun-Hai Liu
Yu-Huan Wu
Guolei Sun
Le Zhang
Ajad Chhatkuli
Luc Van Gool
ViT
87
39
0
06 Jun 2021
RDA: Robust Domain Adaptation via Fourier Adversarial Attacking
Jiaxing Huang
Dayan Guan
Aoran Xiao
Shijian Lu
AAML
113
77
0
05 Jun 2021
Attention mechanisms and deep learning for machine vision: A survey of the state of the art
A. M. Hafiz
S. A. Parah
R. A. Bhat
93
45
0
03 Jun 2021
New Encoder Learning for Captioning Heavy Rain Images via Semantic Visual Feature Matching
Chang-Hwan Son
Pung-Hwi Ye
130
3
0
28 May 2021
Sparta: Spatially Attentive and Adversarially Robust Activation
Qing Guo
Felix Juefei Xu
Changqing Zhou
Wei Feng
Yang Liu
Song Wang
AAML
70
4
0
18 May 2021
Cross-Modality Brain Tumor Segmentation via Bidirectional Global-to-Local Unsupervised Domain Adaptation
Kelei He
Wen Ji
Tao Zhou
Zhuoyuan Li
Jing Huo
Xin Zhang
Yang Gao
Dinggang Shen
Bing-Bin Zhang
Junfeng Zhang
OOD
51
6
0
17 May 2021
Are Convolutional Neural Networks or Transformers more like human vision?
Shikhar Tuli
Ishita Dasgupta
Erin Grant
Thomas Griffiths
ViT
FaML
88
185
0
15 May 2021
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching
Chenchi Zhang
Wenbo Ma
Jun Xiao
Hanwang Zhang
Jian Shao
Yueting Zhuang
Long Chen
86
4
0
12 May 2021
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Yu Yao
E. Atkins
Matthew Johnson-Roberson
Ram Vasudevan
Xiaoxiao Du
72
37
0
10 May 2021
CUAB: Convolutional Uncertainty Attention Block Enhanced the Chest X-ray Image Analysis
Chi-Shiang Wang
Fang Su
T. Lee
Yi-Shan Tsai
Jung-Hsien Chiang
35
3
0
05 May 2021
Attention and Prediction Guided Motion Detection for Low-Contrast Small Moving Targets
Hongxin Wang
Jiannan Zhao
Huatian Wang
Cheng Hu
Jigen Peng
Shigang Yue
80
17
0
27 Apr 2021
Attention in Attention Network for Image Super-Resolution
Haoyu Chen
Jinjin Gu
Zhi-Li Zhang
SupR
65
70
0
19 Apr 2021
Beyond Joint Demosaicking and Denoising: An Image Processing Pipeline for a Pixel-bin Image Sensor
S. Sharif
R. A. Naqvi
Mithun Biswas
SupR
78
41
0
19 Apr 2021
Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Cheng Xue
Lei Zhu
Huazhu Fu
Xiaowei Hu
Xiaomeng Li
Hai Zhang
Pheng Ann Heng
54
160
0
05 Apr 2021
Text to Image Generation with Semantic-Spatial Aware GAN
Kaiqin Hu
Wentong Liao
M. Yang
Bodo Rosenhahn
113
121
0
01 Apr 2021
DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared Cross-modality Person Re-identification
Junhui Yin
Zhanyu Ma
Jiyang Xie
Shibo Nie
Kongming Liang
Jun Guo
65
2
0
01 Apr 2021
FANet: A Feedback Attention Network for Improved Biomedical Image Segmentation
Nikhil Kumar Tomar
Debesh Jha
Michael A. Riegler
Haavard D. Johansen
Dag Johansen
J. Rittscher
Pål Halvorsen
Sharib Ali
MedIm
88
154
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
128
198
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
133
61
0
31 Mar 2021
Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation
Gihyun Kwon
Jong Chul Ye
118
53
0
30 Mar 2021
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
99
77
0
22 Mar 2021
Learning Calibrated-Guidance for Object Detection in Aerial Images
Zongqi Wei
Dong Liang
Dong Zhang
Liyan Zhang
Qixiang Geng
Mingqiang Wei
Huiyu Zhou
87
35
0
21 Mar 2021
Multimodal End-to-End Sparse Model for Emotion Recognition
Wenliang Dai
Samuel Cahyawijaya
Zihan Liu
Pascale Fung
CVBM
86
84
0
17 Mar 2021
Bio-Inspired Representation Learning for Visual Attention Prediction
Yuan Yuan
Hailong Ning
Xiaoqiang Lu
61
30
0
09 Mar 2021
Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images
Renshuai Tao
Yanlu Wei
Hainan Li
Aishan Liu
Yifu Ding
Haotong Qin
Xianglong Liu
137
18
0
01 Mar 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
166
227
0
20 Feb 2021
Previous
1
2
3
4
5
6
7
8
9
Next