ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05594
  4. Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning
v1v2 (latest)

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
ArXiv (abs)PDFHTML

Papers citing "SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"

50 / 435 papers shown
Title
Egocentric Video-Language Pretraining
Egocentric Video-Language Pretraining
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLMEgoV
104
207
0
03 Jun 2022
WaveMix: A Resource-efficient Neural Network for Image Analysis
WaveMix: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan
Kavitha Viswanathan
S. AnanduA
A. Sethi
111
21
0
28 May 2022
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual
  Context for Image Captioning
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Chia-Wen Kuo
Z. Kira
97
56
0
09 May 2022
A survey on attention mechanisms for medical applications: are we moving
  towards better algorithms?
A survey on attention mechanisms for medical applications: are we moving towards better algorithms?
Tiago Gonçalves
Isabel Rio-Torto
Luís F. Teixeira
J. S. Cardoso
OODMedIm
103
41
0
26 Apr 2022
Attention in Reasoning: Dataset, Analysis, and Modeling
Attention in Reasoning: Dataset, Analysis, and Modeling
Shi Chen
Ming Jiang
Jinhui Yang
Qi Zhao
LRM
48
3
0
20 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Visual Attention Methods in Deep Learning: An In-Depth Survey
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Ajmal Mian
136
166
0
16 Apr 2022
Image Captioning In the Transformer Age
Image Captioning In the Transformer Age
Yangliu Xu
Li Li
Haiyang Xu
Songfang Huang
Fei Huang
Jianfei Cai
ViT
59
6
0
15 Apr 2022
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural
  Network for Multisource Remote Sensing Data Classification
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification
Hengchao Li
Wen-Shuai Hu
Wei Li
Jun Li
Q. Du
Antonio J. Plaza
57
97
0
09 Apr 2022
Revisiting Near/Remote Sensing with Geospatial Attention
Revisiting Near/Remote Sensing with Geospatial Attention
Scott Workman
M. U. Rafique
Hunter Blanton
Nathan Jacobs
121
17
0
04 Apr 2022
Point-Unet: A Context-aware Point-based Neural Network for Volumetric
  Segmentation
Point-Unet: A Context-aware Point-based Neural Network for Volumetric Segmentation
Ngoc-Vuong Ho
Tan H. Nguyen
Gia-Han Diep
Ngan Le
Binh-Son Hua
3DPC
80
24
0
16 Mar 2022
Dynamic Instance Domain Adaptation
Dynamic Instance Domain Adaptation
Zhongying Deng
Kaiyang Zhou
Da Li
Junjun He
Yi-Zhe Song
Tao Xiang
OOD
99
34
0
09 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
98
335
0
09 Mar 2022
Structure-Aware Flow Generation for Human Body Reshaping
Structure-Aware Flow Generation for Human Body Reshaping
Jianqiang Ren
Yuan Yao
Biwen Lei
Miaomiao Cui
Xuansong Xie
3DH
55
6
0
09 Mar 2022
Adaptive Cross-Layer Attention for Image Restoration
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
93
3
0
04 Mar 2022
ADVISE: ADaptive Feature Relevance and VISual Explanations for
  Convolutional Neural Networks
ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks
Mohammad Mahdi Dehshibi
Mona Ashtari-Majlan
Gereziher W. Adhane
David Masip
AAMLFAtt
52
3
0
02 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and Challenges
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
124
93
0
02 Mar 2022
Sensing accident-prone features in urban scenes for proactive driving
  and accident prevention
Sensing accident-prone features in urban scenes for proactive driving and accident prevention
Sumit Mishra
Praveenbalaji Rajendran
L. Vecchietti
Dongsoo Har
74
13
0
25 Feb 2022
Faithful learning with sure data for lung nodule diagnosis
Faithful learning with sure data for lung nodule diagnosis
Hanxiao Zhang
Liang Chen
Xiao Gu
Minghui Zhang
Yulei Qin
Feng Yao
Zhexin Wang
Yun Gu
Guangyao Yang
39
1
0
25 Feb 2022
Visual Attention Network
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViTVLM
143
680
0
20 Feb 2022
A Frustratingly Simple Approach for End-to-End Image Captioning
A Frustratingly Simple Approach for End-to-End Image Captioning
Ziyang Luo
Yadong Xi
Rongsheng Zhang
Jing Ma
VLMMLLM
79
16
0
30 Jan 2022
Learning Spatially-Adaptive Squeeze-Excitation Networks for Image
  Synthesis and Image Recognition
Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition
Jianghao Shen
Tianfu Wu
ViT
49
0
0
29 Dec 2021
Associative Adversarial Learning Based on Selective Attack
Associative Adversarial Learning Based on Selective Attack
Runqi Wang
Xiaoyue Duan
Baochang Zhang
Shenjun Xue
Wentao Zhu
David Doermann
G. Guo
AAML
74
0
0
28 Dec 2021
DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant
  Brain Image Segmentation
DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image Segmentation
Dinh-Hieu Hoang
Gia-Han Diep
Minh-Triet Tran
Ngan T. H Le
65
8
0
27 Dec 2021
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU
  Signals
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals
Wenjin Tao
Haodong Chen
Md Moniruzzaman
M. C. Leu
Zhaozheng Yi
Ruwen Qin
49
11
0
20 Dec 2021
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch
  Fusion
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch Fusion
Haoyi Wang
Victor Sanchez
Chang-Tsun Li
CVBM
53
31
0
19 Dec 2021
Event-guided Deblurring of Unknown Exposure Time Videos
Event-guided Deblurring of Unknown Exposure Time Videos
Taewoo Kim
Jungmin Lee
Lin Wang
Kuk-Jin Yoon
89
32
0
13 Dec 2021
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Praveen Kandula
N. Rajagopalan.A.
120
0
0
12 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
89
3
0
10 Dec 2021
Rethinking the Two-Stage Framework for Grounded Situation Recognition
Rethinking the Two-Stage Framework for Grounded Situation Recognition
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Tat-Seng Chua
91
31
0
10 Dec 2021
Classification-Then-Grounding: Reformulating Video Scene Graphs as
  Temporal Bipartite Graphs
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Kaifeng Gao
Long Chen
Yulei Niu
Jian Shao
Jun Xiao
68
29
0
08 Dec 2021
ADD: Frequency Attention and Multi-View based Knowledge Distillation to
  Detect Low-Quality Compressed Deepfake Images
ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
B. Le
Simon S. Woo
AAML
92
83
0
07 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
95
47
0
29 Nov 2021
TDAM: Top-Down Attention Module for Contextually Guided Feature
  Selection in CNNs
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs
Shantanu Jaiswal
Basura Fernando
Cheston Tan
ViT
71
16
0
26 Nov 2021
ClipCap: CLIP Prefix for Image Captioning
ClipCap: CLIP Prefix for Image Captioning
Ron Mokady
Amir Hertz
Amit H. Bermano
CLIPVLM
81
684
0
18 Nov 2021
Image-specific Convolutional Kernel Modulation for Single Image
  Super-resolution
Image-specific Convolutional Kernel Modulation for Single Image Super-resolution
Yuanfei Huang
Jie Li
Yanting Hu
Xinbo Gao
Huan Huang
SupR
71
0
0
16 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
144
1,745
0
15 Nov 2021
Co-segmentation Inspired Attention Module for Video-based Computer
  Vision Tasks
Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks
Arulkumar Subramaniam
Jayesh Vaidya
Muhammed Ameen
Athira M. Nambiar
Anurag Mittal
71
7
0
14 Nov 2021
Local Multi-Head Channel Self-Attention for Facial Expression
  Recognition
Local Multi-Head Channel Self-Attention for Facial Expression Recognition
Roberto Pecoraro
Valerio Basile
Viviana Bono
Sara Gallo
ViT
139
52
0
14 Nov 2021
Explaining Face Presentation Attack Detection Using Natural Language
Explaining Face Presentation Attack Detection Using Natural Language
H. Mirzaalian
Mohamed E. Hussein
L. Spinoulas
Jonathan May
Wael AbdAlmageed
CVBMFAttAAML
66
5
0
08 Nov 2021
Dense Prediction with Attentive Feature Aggregation
Dense Prediction with Attentive Feature Aggregation
Yung-Hsu Yang
Thomas E. Huang
Min Sun
Samuel Rota Buló
Peter Kontschieder
Feng Yu
99
7
0
01 Nov 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal
  Information for Video Recognition
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
54
1
0
29 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with
  Recurrent Layer Aggregation
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation
Jingyu Zhao
Yanwen Fang
Guodong Li
69
24
0
22 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Topic Scene Graph Generation by Attention Distillation from Caption
Wenbin Wang
R. Wang
X. Chen
DiffM
94
14
0
12 Oct 2021
Recurrent Attention Models with Object-centric Capsule Representation
  for Multi-object Recognition
Recurrent Attention Models with Object-centric Capsule Representation for Multi-object Recognition
Hossein Adeli
Seoyoung Ahn
G. Zelinsky
OCL
58
3
0
11 Oct 2021
Counterfactual Samples Synthesizing and Training for Robust Visual
  Question Answering
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
Long Chen
Yuhang Zheng
Yulei Niu
Hanwang Zhang
Jun Xiao
AAMLOOD
119
37
0
03 Oct 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong Liu
Chunyan Miao
ViT
47
3
0
29 Sep 2021
Contrastive Video-Language Segmentation
Contrastive Video-Language Segmentation
Chen Liang
Yawei Luo
Yu Wu
Yi Yang
VLMVOS
110
1
0
29 Sep 2021
Multi-Level Visual Similarity Based Personalized Tourist Attraction
  Recommendation Using Geo-Tagged Photos
Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos
Ling Chen
Dandan Lyu
Shanshan Yu
Gencai Chen
66
10
0
17 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
84
43
0
16 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning
  approach
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
55
19
0
11 Sep 2021
Previous
123456789
Next