ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05594
  4. Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning
v1v2 (latest)

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
ArXiv (abs)PDFHTML

Papers citing "SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"

50 / 435 papers shown
Title
RefineCap: Concept-Aware Refinement for Image Captioning
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
25
1
0
08 Sep 2021
RGB-D Salient Object Detection with Ubiquitous Target Awareness
RGB-D Salient Object Detection with Ubiquitous Target Awareness
Yifan Zhao
Jiawei Zhao
Jia Li
Xiaowu Chen
89
46
0
08 Sep 2021
Visual Sensation and Perception Computational Models for Deep Learning:
  State of the art, Challenges and Prospects
Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects
Bing Wei
Yudi Zhao
K. Hao
Lei Gao
74
5
0
08 Sep 2021
Robust Attentive Deep Neural Network for Exposing GAN-generated Faces
Robust Attentive Deep Neural Network for Exposing GAN-generated Faces
Hui Guo
Shu Hu
Xin Wang
Ming-Ching Chang
Siwei Lyu
CVBM
77
37
0
05 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
100
18
0
20 Aug 2021
Causal Attention for Unbiased Visual Recognition
Causal Attention for Unbiased Visual Recognition
Tan Wang
Chan Zhou
Qianru Sun
Hanwang Zhang
OODCML
110
114
0
19 Aug 2021
Cervical Optical Coherence Tomography Image Classification Based on
  Contrastive Self-Supervised Texture Learning
Cervical Optical Coherence Tomography Image Classification Based on Contrastive Self-Supervised Texture Learning
Kaiyi Chen
Qingbin Wang
Yutao Ma
24
13
0
11 Aug 2021
Learn to Grasp with Less Supervision: A Data-Efficient Maximum
  Likelihood Grasp Sampling Loss
Learn to Grasp with Less Supervision: A Data-Efficient Maximum Likelihood Grasp Sampling Loss
Xinghao Zhu
Yefan Zhou
Yongxiang Fan
Lingfeng Sun
Jianyu Chen
Masayoshi Tomizuka
89
15
0
10 Aug 2021
Transductive Few-Shot Classification on the Oblique Manifold
Transductive Few-Shot Classification on the Oblique Manifold
Guodong Qi
Huimin Yu
Zhaohui Lu
Shuzhao Li
70
45
0
09 Aug 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNNCoGe
77
17
0
08 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
147
68
0
05 Aug 2021
Efficient Human Pose Estimation by Maximizing Fusion and High-Level
  Spatial Attention
Efficient Human Pose Estimation by Maximizing Fusion and High-Level Spatial Attention
Zhiyuan Ren
Yao Zhou
Yizhe Chen
Rui Zhou
Yayu Gao
3DH
52
4
0
29 Jul 2021
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Wentian Zhao
Yao Hu
Heda Wang
Xinxiao Wu
Jiebo Luo
55
49
0
26 Jul 2021
High-Resolution Pelvic MRI Reconstruction Using a Generative Adversarial
  Network with Attention and Cyclic Loss
High-Resolution Pelvic MRI Reconstruction Using a Generative Adversarial Network with Attention and Cyclic Loss
Guangyuan Li
Jun Lv
Xiangrong Tong
Chengyan Wang
Guang Yang
MedIm
43
24
0
21 Jul 2021
DRDF: Determining the Importance of Different Multimodal Information
  with Dual-Router Dynamic Framework
DRDF: Determining the Importance of Different Multimodal Information with Dual-Router Dynamic Framework
Haiwen Hong
Xuan Jin
Yin Zhang
Yunqing Hu
Jingfeng Zhang
Yuan He
Hui Xue
MoE
34
0
0
21 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
153
270
0
14 Jul 2021
Fast Pixel-Matching for Video Object Segmentation
Fast Pixel-Matching for Video Object Segmentation
Siyue Yu
Jimin Xiao
Bingfeng Zhang
Eng Gee Lim
VOS
48
9
0
09 Jul 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
71
6
0
26 Jun 2021
All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection
All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection
Meng Cao
Can Zhang
Dongming Yang
Yuexian Zou
53
15
0
24 Jun 2021
Cross-layer Navigation Convolutional Neural Network for Fine-grained
  Visual Classification
Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification
Chenyu Guo
Jiyang Xie
Kongming Liang
Xian Sun
Zhanyu Ma
92
4
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCVMedIm
115
12
0
21 Jun 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
142
7
0
20 Jun 2021
Salient Positions based Attention Network for Image Classification
Salient Positions based Attention Network for Image Classification
Sheng Fang
Kaiyu Li
Zhe Li
70
3
0
09 Jun 2021
Understanding top-down attention using task-oriented ablation design
Understanding top-down attention using task-oriented ablation design
Freddie Bickford-Smith
Brett D. Roads
Xiaoliang Luo
Bradley C. Love
56
1
0
08 Jun 2021
Vision Transformers with Hierarchical Attention
Vision Transformers with Hierarchical Attention
Yun-Hai Liu
Yu-Huan Wu
Guolei Sun
Le Zhang
Ajad Chhatkuli
Luc Van Gool
ViT
87
39
0
06 Jun 2021
RDA: Robust Domain Adaptation via Fourier Adversarial Attacking
RDA: Robust Domain Adaptation via Fourier Adversarial Attacking
Jiaxing Huang
Dayan Guan
Aoran Xiao
Shijian Lu
AAML
113
77
0
05 Jun 2021
Attention mechanisms and deep learning for machine vision: A survey of
  the state of the art
Attention mechanisms and deep learning for machine vision: A survey of the state of the art
A. M. Hafiz
S. A. Parah
R. A. Bhat
93
45
0
03 Jun 2021
New Encoder Learning for Captioning Heavy Rain Images via Semantic
  Visual Feature Matching
New Encoder Learning for Captioning Heavy Rain Images via Semantic Visual Feature Matching
Chang-Hwan Son
Pung-Hwi Ye
130
3
0
28 May 2021
Sparta: Spatially Attentive and Adversarially Robust Activation
Sparta: Spatially Attentive and Adversarially Robust Activation
Qing Guo
Felix Juefei Xu
Changqing Zhou
Wei Feng
Yang Liu
Song Wang
AAML
70
4
0
18 May 2021
Cross-Modality Brain Tumor Segmentation via Bidirectional
  Global-to-Local Unsupervised Domain Adaptation
Cross-Modality Brain Tumor Segmentation via Bidirectional Global-to-Local Unsupervised Domain Adaptation
Kelei He
Wen Ji
Tao Zhou
Zhuoyuan Li
Jing Huo
Xin Zhang
Yang Gao
Dinggang Shen
Bing-Bin Zhang
Junfeng Zhang
OOD
51
6
0
17 May 2021
Are Convolutional Neural Networks or Transformers more like human
  vision?
Are Convolutional Neural Networks or Transformers more like human vision?
Shikhar Tuli
Ishita Dasgupta
Erin Grant
Thomas Griffiths
ViTFaML
88
185
0
15 May 2021
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language
  Matching
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching
Chenchi Zhang
Wenbo Ma
Jun Xiao
Hanwang Zhang
Jian Shao
Yueting Zhuang
Long Chen
86
4
0
12 May 2021
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Yu Yao
E. Atkins
Matthew Johnson-Roberson
Ram Vasudevan
Xiaoxiao Du
72
37
0
10 May 2021
CUAB: Convolutional Uncertainty Attention Block Enhanced the Chest X-ray
  Image Analysis
CUAB: Convolutional Uncertainty Attention Block Enhanced the Chest X-ray Image Analysis
Chi-Shiang Wang
Fang Su
T. Lee
Yi-Shan Tsai
Jung-Hsien Chiang
35
3
0
05 May 2021
Attention and Prediction Guided Motion Detection for Low-Contrast Small
  Moving Targets
Attention and Prediction Guided Motion Detection for Low-Contrast Small Moving Targets
Hongxin Wang
Jiannan Zhao
Huatian Wang
Cheng Hu
Jigen Peng
Shigang Yue
80
17
0
27 Apr 2021
Attention in Attention Network for Image Super-Resolution
Attention in Attention Network for Image Super-Resolution
Haoyu Chen
Jinjin Gu
Zhi-Li Zhang
SupR
65
70
0
19 Apr 2021
Beyond Joint Demosaicking and Denoising: An Image Processing Pipeline
  for a Pixel-bin Image Sensor
Beyond Joint Demosaicking and Denoising: An Image Processing Pipeline for a Pixel-bin Image Sensor
S. Sharif
R. A. Naqvi
Mithun Biswas
SupR
78
41
0
19 Apr 2021
Global Guidance Network for Breast Lesion Segmentation in Ultrasound
  Images
Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Cheng Xue
Lei Zhu
Huazhu Fu
Xiaowei Hu
Xiaomeng Li
Hai Zhang
Pheng Ann Heng
54
160
0
05 Apr 2021
Text to Image Generation with Semantic-Spatial Aware GAN
Text to Image Generation with Semantic-Spatial Aware GAN
Kaiqin Hu
Wentong Liao
M. Yang
Bodo Rosenhahn
113
121
0
01 Apr 2021
DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared
  Cross-modality Person Re-identification
DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared Cross-modality Person Re-identification
Junhui Yin
Zhanyu Ma
Jiyang Xie
Shibo Nie
Kongming Liang
Jun Guo
65
2
0
01 Apr 2021
FANet: A Feedback Attention Network for Improved Biomedical Image
  Segmentation
FANet: A Feedback Attention Network for Improved Biomedical Image Segmentation
Nikhil Kumar Tomar
Debesh Jha
Michael A. Riegler
Haavard D. Johansen
Dag Johansen
J. Rittscher
Pål Halvorsen
Sharib Ali
MedIm
88
154
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
128
198
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
133
61
0
31 Mar 2021
Diagonal Attention and Style-based GAN for Content-Style Disentanglement
  in Image Generation and Translation
Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation
Gihyun Kwon
Jong Chul Ye
118
53
0
30 Mar 2021
Human-like Controllable Image Captioning with Verb-specific Semantic
  Roles
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
99
77
0
22 Mar 2021
Learning Calibrated-Guidance for Object Detection in Aerial Images
Learning Calibrated-Guidance for Object Detection in Aerial Images
Zongqi Wei
Dong Liang
Dong Zhang
Liyan Zhang
Qixiang Geng
Mingqiang Wei
Huiyu Zhou
87
35
0
21 Mar 2021
Multimodal End-to-End Sparse Model for Emotion Recognition
Multimodal End-to-End Sparse Model for Emotion Recognition
Wenliang Dai
Samuel Cahyawijaya
Zihan Liu
Pascale Fung
CVBM
86
84
0
17 Mar 2021
Bio-Inspired Representation Learning for Visual Attention Prediction
Bio-Inspired Representation Learning for Visual Attention Prediction
Yuan Yuan
Hailong Ning
Xiaoqiang Lu
61
30
0
09 Mar 2021
Over-sampling De-occlusion Attention Network for Prohibited Items
  Detection in Noisy X-ray Images
Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images
Renshuai Tao
Yanlu Wei
Hainan Li
Aishan Liu
Yifu Ding
Haotong Qin
Xianglong Liu
137
18
0
01 Mar 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for
  Image Captioning
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
166
227
0
20 Feb 2021
Previous
123456789
Next