ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05594
  4. Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
ArXivPDFHTML

Papers citing "SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"

50 / 182 papers shown
Title
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
45
0
0
03 Apr 2025
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
Jutika Borah
H. Singh
MedIm
45
0
0
14 Mar 2025
HASN: Hybrid Attention Separable Network for Efficient Image
  Super-resolution
HASN: Hybrid Attention Separable Network for Efficient Image Super-resolution
Weifeng Cao
Xiaoyan Lei
Jun Shi
Wanyong Liang
Jie Liu
Zongfei Bai
SupR
29
0
0
13 Oct 2024
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation
  Models
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation Models
Chao Tang
Dehao Huang
Wenlong Dong
Ruinian Xu
Hong Zhang
36
11
0
16 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
CompenHR: Efficient Full Compensation for High-resolution Projector
CompenHR: Efficient Full Compensation for High-resolution Projector
Yuxi Wang
H. Ling
Bingyao Huang
3DV
16
4
0
22 Nov 2023
Channel-Wise Contrastive Learning for Learning with Noisy Labels
Channel-Wise Contrastive Learning for Learning with Noisy Labels
Hui-Sung Kang
Sheng Liu
Huaxi Huang
Tongliang Liu
NoLa
42
0
0
14 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
Dual Aggregation Transformer for Image Super-Resolution
Zheng Chen
Yulun Zhang
Jinjin Gu
L. Kong
Xiaokang Yang
Feng Yu
ViT
25
167
0
07 Aug 2023
Transferable Decoding with Visual Entities for Zero-Shot Image
  Captioning
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Junjie Fei
Teng Wang
Jinrui Zhang
Zhenyu He
Chengjie Wang
Feng Zheng
VLM
28
34
0
31 Jul 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
32
78
0
29 May 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
38
11
0
08 Apr 2023
Multi-scale Hierarchical Vision Transformer with Cascaded Attention
  Decoding for Medical Image Segmentation
Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
24
44
0
29 Mar 2023
SiamTHN: Siamese Target Highlight Network for Visual Tracking
SiamTHN: Siamese Target Highlight Network for Visual Tracking
Jiahao Bao
Kaiqiang Chen
Xian Sun
Liangjin Zhao
Wenhui Diao
M. Yan
28
13
0
22 Mar 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
25
16
0
23 Feb 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image
  Captioning
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
31
4
0
08 Feb 2023
Collaborative Perception in Autonomous Driving: Methods, Datasets and
  Challenges
Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges
Yushan Han
Hui Zhang
Huifang Li
Yi Jin
Congyan Lang
Yidong Li
34
100
0
16 Jan 2023
CAT: Learning to Collaborate Channel and Spatial Attention from
  Multi-Information Fusion
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
19
3
0
13 Dec 2022
Semiconductor Defect Pattern Classification by
  Self-Proliferation-and-Attention Neural Network
Semiconductor Defect Pattern Classification by Self-Proliferation-and-Attention Neural Network
Yuanfu Yang
Min Sun
38
6
0
01 Dec 2022
ExpNet: A unified network for Expert-Level Classification
ExpNet: A unified network for Expert-Level Classification
Junde Wu
Huihui Fang
Yehui Yang
Yu Zhang
Haoyi Xiong
Huazhu Fu
Yanwu Xu
27
0
0
29 Nov 2022
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class
  Information
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class Information
Binoy Saha
Sukhendu Das
27
0
0
27 Nov 2022
PKCAM: Previous Knowledge Channel Attention Module
PKCAM: Previous Knowledge Channel Attention Module
Eslam Mohamed Bakr
Ahmad El-Sallab
M. Rashwan
24
1
0
14 Nov 2022
Handwashing Action Detection System for an Autonomous Social Robot
Handwashing Action Detection System for an Autonomous Social Robot
Sreejith Sasidharan
P. Prabha
Devasena Pasupuleti
Anand M. Das
Chaitanya Kapoor
Gayathri Manikutty
Praveen Pankajakshan
Bhavani R. Rao
20
2
0
27 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image
  Captioning
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
24
46
0
19 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic
  Segmentation
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
24
9
0
13 Oct 2022
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient
  Object Detection
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection
Runmin Cong
Qin Lin
Chen Zhang
Chongyi Li
Xiaochun Cao
Qingming Huang
Yao-Min Zhao
ObjD
38
124
0
06 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image
  Captioning
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image
  Retrieval
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval
Yang Shen
Xuhao Sun
Xiu-Shen Wei
Qing-Yuan Jiang
Jian Yang
34
18
0
28 Sep 2022
Scale Attention for Learning Deep Face Representation: A Study Against
  Visual Scale Variation
Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation
Hailin Shi
Hang Du
Yibo Hu
Jun Wang
Dan Zeng
Ting Yao
CVBM
15
0
0
19 Sep 2022
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods
  and Results
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results
Ruicheng Feng
Chongyi Li
Shangchen Zhou
W. Sun
Qingpeng Zhu
Jun Jiang
Qingyu Yang
Chen Change Loy
Liang Feng
40
20
0
15 Sep 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview
  Pedestrian Detection with Attention
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
31
3
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
16
34
0
18 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
18
0
0
09 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly
  Supervised Scene Graph Generation
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
18
26
0
03 Aug 2022
Explicit Image Caption Editing
Explicit Image Caption Editing
Zhen Wang
Long Chen
Wenbo Ma
G. Han
Yulei Niu
Jian Shao
Jun Xiao
25
12
0
20 Jul 2022
Trichomonas Vaginalis Segmentation in Microscope Images
Trichomonas Vaginalis Segmentation in Microscope Images
Lin Li
Jingyi Liu
Shuo Wang
Xun Wang
Tian-Zhu Xiang
37
7
0
03 Jul 2022
Dual Windows Are Significant: Learning from Mediastinal Window and
  Focusing on Lung Window
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Qiuli Wang
Xin Tan
Chen Liu
23
0
0
08 Jun 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question
  Answering
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song
Pengpeng Zeng
Lianli Gao
Heng Tao Shen
32
62
0
04 Jun 2022
Egocentric Video-Language Pretraining
Egocentric Video-Language Pretraining
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLM
EgoV
46
189
0
03 Jun 2022
WaveMix: A Resource-efficient Neural Network for Image Analysis
WaveMix: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan
Kavitha Viswanathan
S. AnanduA
A. Sethi
20
20
0
28 May 2022
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural
  Network for Multisource Remote Sensing Data Classification
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification
Hengchao Li
Wen-Shuai Hu
Wei Li
Jun Li
Q. Du
Antonio J. Plaza
31
96
0
09 Apr 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
34
301
0
09 Mar 2022
Adaptive Cross-Layer Attention for Image Restoration
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
29
3
0
04 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and Challenges
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
27
85
0
02 Mar 2022
Sensing accident-prone features in urban scenes for proactive driving
  and accident prevention
Sensing accident-prone features in urban scenes for proactive driving and accident prevention
Sumit Mishra
Praveenbalaji Rajendran
L. Vecchietti
Dongsoo Har
19
13
0
25 Feb 2022
Visual Attention Network
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
24
637
0
20 Feb 2022
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU
  Signals
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals
Wenjin Tao
Haodong Chen
Md Moniruzzaman
M. C. Leu
Zhaozheng Yi
Ruwen Qin
16
10
0
20 Dec 2021
Event-guided Deblurring of Unknown Exposure Time Videos
Event-guided Deblurring of Unknown Exposure Time Videos
Taewoo Kim
Jungmin Lee
Lin Wang
Kuk-Jin Yoon
19
32
0
13 Dec 2021
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Praveen Kandula
N. Rajagopalan.A.
35
0
0
12 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
31
3
0
10 Dec 2021
Classification-Then-Grounding: Reformulating Video Scene Graphs as
  Temporal Bipartite Graphs
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Kaifeng Gao
Long Chen
Yulei Niu
Jian Shao
Jun Xiao
15
29
0
08 Dec 2021
1234
Next