Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.05594
Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"
50 / 182 papers shown
Title
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
45
0
0
03 Apr 2025
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
Jutika Borah
H. Singh
MedIm
45
0
0
14 Mar 2025
HASN: Hybrid Attention Separable Network for Efficient Image Super-resolution
Weifeng Cao
Xiaoyan Lei
Jun Shi
Wanyong Liang
Jie Liu
Zongfei Bai
SupR
29
0
0
13 Oct 2024
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation Models
Chao Tang
Dehao Huang
Wenlong Dong
Ruinian Xu
Hong Zhang
36
11
0
16 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
CompenHR: Efficient Full Compensation for High-resolution Projector
Yuxi Wang
H. Ling
Bingyao Huang
3DV
16
4
0
22 Nov 2023
Channel-Wise Contrastive Learning for Learning with Noisy Labels
Hui-Sung Kang
Sheng Liu
Huaxi Huang
Tongliang Liu
NoLa
42
0
0
14 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
Zheng Chen
Yulun Zhang
Jinjin Gu
L. Kong
Xiaokang Yang
Feng Yu
ViT
25
167
0
07 Aug 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Junjie Fei
Teng Wang
Jinrui Zhang
Zhenyu He
Chengjie Wang
Feng Zheng
VLM
28
34
0
31 Jul 2023
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
32
78
0
29 May 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
38
11
0
08 Apr 2023
Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
24
44
0
29 Mar 2023
SiamTHN: Siamese Target Highlight Network for Visual Tracking
Jiahao Bao
Kaiqiang Chen
Xian Sun
Liangjin Zhao
Wenhui Diao
M. Yan
28
13
0
22 Mar 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
25
16
0
23 Feb 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
31
4
0
08 Feb 2023
Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges
Yushan Han
Hui Zhang
Huifang Li
Yi Jin
Congyan Lang
Yidong Li
34
100
0
16 Jan 2023
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
19
3
0
13 Dec 2022
Semiconductor Defect Pattern Classification by Self-Proliferation-and-Attention Neural Network
Yuanfu Yang
Min Sun
38
6
0
01 Dec 2022
ExpNet: A unified network for Expert-Level Classification
Junde Wu
Huihui Fang
Yehui Yang
Yu Zhang
Haoyi Xiong
Huazhu Fu
Yanwu Xu
27
0
0
29 Nov 2022
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class Information
Binoy Saha
Sukhendu Das
27
0
0
27 Nov 2022
PKCAM: Previous Knowledge Channel Attention Module
Eslam Mohamed Bakr
Ahmad El-Sallab
M. Rashwan
24
1
0
14 Nov 2022
Handwashing Action Detection System for an Autonomous Social Robot
Sreejith Sasidharan
P. Prabha
Devasena Pasupuleti
Anand M. Das
Chaitanya Kapoor
Gayathri Manikutty
Praveen Pankajakshan
Bhavani R. Rao
20
2
0
27 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
24
46
0
19 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
24
9
0
13 Oct 2022
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection
Runmin Cong
Qin Lin
Chen Zhang
Chongyi Li
Xiaochun Cao
Qingming Huang
Yao-Min Zhao
ObjD
38
124
0
06 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
40
10
0
04 Oct 2022
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval
Yang Shen
Xuhao Sun
Xiu-Shen Wei
Qing-Yuan Jiang
Jian Yang
34
18
0
28 Sep 2022
Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation
Hailin Shi
Hang Du
Yibo Hu
Jun Wang
Dan Zeng
Ting Yao
CVBM
15
0
0
19 Sep 2022
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results
Ruicheng Feng
Chongyi Li
Shangchen Zhou
W. Sun
Qingpeng Zhu
Jun Jiang
Qingyu Yang
Chen Change Loy
Liang Feng
40
20
0
15 Sep 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
31
3
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
16
34
0
18 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
18
0
0
09 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
18
26
0
03 Aug 2022
Explicit Image Caption Editing
Zhen Wang
Long Chen
Wenbo Ma
G. Han
Yulei Niu
Jian Shao
Jun Xiao
25
12
0
20 Jul 2022
Trichomonas Vaginalis Segmentation in Microscope Images
Lin Li
Jingyi Liu
Shuo Wang
Xun Wang
Tian-Zhu Xiang
37
7
0
03 Jul 2022
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Qiuli Wang
Xin Tan
Chen Liu
23
0
0
08 Jun 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song
Pengpeng Zeng
Lianli Gao
Heng Tao Shen
32
62
0
04 Jun 2022
Egocentric Video-Language Pretraining
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLM
EgoV
46
189
0
03 Jun 2022
WaveMix: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan
Kavitha Viswanathan
S. AnanduA
A. Sethi
20
20
0
28 May 2022
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification
Hengchao Li
Wen-Shuai Hu
Wei Li
Jun Li
Q. Du
Antonio J. Plaza
31
96
0
09 Apr 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
34
301
0
09 Mar 2022
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
29
3
0
04 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
27
85
0
02 Mar 2022
Sensing accident-prone features in urban scenes for proactive driving and accident prevention
Sumit Mishra
Praveenbalaji Rajendran
L. Vecchietti
Dongsoo Har
19
13
0
25 Feb 2022
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
24
637
0
20 Feb 2022
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals
Wenjin Tao
Haodong Chen
Md Moniruzzaman
M. C. Leu
Zhaozheng Yi
Ruwen Qin
16
10
0
20 Dec 2021
Event-guided Deblurring of Unknown Exposure Time Videos
Taewoo Kim
Jungmin Lee
Lin Wang
Kuk-Jin Yoon
19
32
0
13 Dec 2021
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Praveen Kandula
N. Rajagopalan.A.
35
0
0
12 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
31
3
0
10 Dec 2021
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Kaifeng Gao
Long Chen
Yulei Niu
Jian Shao
Jun Xiao
15
29
0
08 Dec 2021
1
2
3
4
Next