Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.01417
Cited By
Dynamic Memory Networks for Visual and Textual Question Answering
4 March 2016
Caiming Xiong
Stephen Merity
R. Socher
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic Memory Networks for Visual and Textual Question Answering"
50 / 113 papers shown
Title
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Qiming Bao
A. Peng
Tim Hartill
N. Tan
Zhenyun Deng
Michael Witbrock
Jiamou Liu
ReLM
OOD
NAI
LRM
37
13
0
28 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
27
1
0
20 Jul 2022
Seeing the forest and the tree: Building representations of both individual and collective dynamics with transformers
Ran Liu
Mehdi Azabou
M. Dabagia
Jingyun Xiao
Eva L. Dyer
AI4CE
32
19
0
10 Jun 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song
Pengpeng Zeng
Lianli Gao
Heng Tao Shen
32
62
0
04 Jun 2022
Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity
Thomas Limbacher
Ozan Özdenizci
Robert Legenstein
21
2
0
23 May 2022
Learning to Answer Visual Questions from Web Videos
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
37
33
0
10 May 2022
Attention Mechanism based Cognition-level Scene Understanding
Xuejiao Tang
Tai Le Quy
LRM
30
0
0
17 Apr 2022
MeMOT: Multi-Object Tracking with Memory
Jiarui Cai
Mingze Xu
Wei Li
Yuanjun Xiong
Wei Xia
Zhuowen Tu
Stefano Soatto
VOT
36
148
0
31 Mar 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering
Peixi Xiong
Quanzeng You
Pei Yu
Zicheng Liu
Ying Wu
24
5
0
25 Jan 2022
Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Xing Di
Yu Cheng
Zichuan Xu
Pan Zhou
33
58
0
03 Jan 2022
Zero-Shot Open-Book Question Answering
Sia Gholami
M. Noori
RALM
16
10
0
22 Nov 2021
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
92
0
08 Nov 2021
A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis
Jiawei Liu
Kaisong Song
Yangyang Kang
Guoxiu He
Zhuoren Jiang
Changlong Sun
Wei Lu
Xiaozhong Liu
37
7
0
17 Sep 2021
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
30
20
0
16 Sep 2021
Progressively Guide to Attend: An Iterative Alignment Framework for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Pan Zhou
18
46
0
14 Sep 2021
DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering
Jianyu Wang
Bingkun Bao
Changsheng Xu
19
75
0
10 Jul 2021
PEN4Rec: Preference Evolution Networks for Session-based Recommendation
Dou Hu
Lingwei Wei
Wei Zhou
X. Huai
Zhiqi Fang
Songlin Hu
19
4
0
17 Jun 2021
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
58
816
0
14 Jun 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
167
100
0
29 Apr 2021
Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension
Preslav Nakov
Zhi Cui
Jiayi Zhang
Chen Wei
Jianwei Cui
Bin Wang
Dongyan Zhao
Rui Yan
25
14
0
14 Dec 2020
Dual ResGCN for Balanced Scene GraphGeneration
Jingyi Zhang
Yong Zhang
Baoyuan Wu
Yanbo Fan
Fumin Shen
Heng Tao Shen
28
12
0
09 Nov 2020
Learning to Respond with Your Favorite Stickers: A Framework of Unifying Multi-Modality and User Preference in Multi-Turn Dialog
Shen Gao
Preslav Nakov
Li Liu
Dongyan Zhao
Rui Yan
24
14
0
05 Nov 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
32
56
0
27 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei Chen
Weiping Wang
Li Liu
M. Lew
VLM
118
31
0
16 Oct 2020
VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles
Li Mingzhe
Preslav Nakov
Shen Gao
Zhangming Chan
Dongyan Zhao
Rui Yan
33
82
0
12 Oct 2020
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo Qin
Wanxiang Che
Yangming Li
Minheng Ni
Ting Liu
12
93
0
16 Aug 2020
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge
Peng Wang
Dongyang Liu
Hui Li
Qi Wu
ObjD
24
19
0
02 Jun 2020
Visual Relationship Detection using Scene Graphs: A Survey
Aniket Agarwal
Ayush Mangal
Vipul
GNN
25
20
0
16 May 2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Jie Lei
Liwei Wang
Yelong Shen
Dong Yu
Tamara L. Berg
Joey Tianyi Zhou
27
186
0
11 May 2020
Memorizing Comprehensively to Learn Adaptively: Unsupervised Cross-Domain Person Re-ID with Multi-level Memory
Xinyu Zhang
Dong Gong
Jiewei Cao
Chunhua Shen
32
6
0
13 Jan 2020
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Badri N. Patro
Anupriy
Vinay P. Namboodiri
AAML
FAtt
48
26
0
19 Nov 2019
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines
Jingxiang Lin
Unnat Jain
A. Schwing
LRM
ReLM
34
9
0
31 Oct 2019
Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
Shumin Deng
Ningyu Zhang
Jiaojian Kang
Yichi Zhang
Wei Zhang
Huajun Chen
31
131
0
25 Oct 2019
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Wang
Hongzhi Li
29
38
0
11 Oct 2019
Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking
Jianguo Zhang
Kazuma Hashimoto
Chien-Sheng Wu
Yao Wan
Philip S. Yu
R. Socher
Caiming Xiong
50
167
0
08 Oct 2019
Multi-sense Definition Modeling using Word Sense Decompositions
Ruimin Zhu
Thanapon Noraset
Alisa Liu
Wenxin Jiang
Doug Downey
9
9
0
19 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
Memorizing All for Implicit Discourse Relation Recognition
Hongxiao Bai
Hai Zhao
Junhan Zhao
19
10
0
29 Aug 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
21
11
0
26 Aug 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAtt
UQCV
27
76
0
17 Aug 2019
A Road-map Towards Explainable Question Answering A Solution for Information Pollution
Zhilin Yang
William W. Cohen
19
0
0
04 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
8
75
0
30 Jun 2019
Adversarial Mahalanobis Distance-based Attentive Song Recommender for Automatic Playlist Continuation
Thanh-Binh Tran
Renee Sweeney
Kyumin Lee
36
32
0
08 Jun 2019
EKT: Exercise-aware Knowledge Tracing for Student Performance Prediction
Qi Liu
Zhenya Huang
Yu Yin
Enhong Chen
Hui Xiong
Yu Su
Guoping Hu
AI4Ed
19
379
0
07 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
24
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Episodic Memory Reader: Learning What to Remember for Question Answering from Streaming Data
Moonsu Han
Minki Kang
Hyunwoo Jung
Sung Ju Hwang
RALM
27
19
0
14 Mar 2019
Pedestrian Attribute Recognition: A Survey
Tianlin Li
Shaofei Zheng
Rui Yang
Aihua Zheng
Zhe Chen
Jin Tang
Bin Luo
CVBM
28
127
0
22 Jan 2019
1
2
3
Next