ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.01417
  4. Cited By
Dynamic Memory Networks for Visual and Textual Question Answering

Dynamic Memory Networks for Visual and Textual Question Answering

4 March 2016
Caiming Xiong
Stephen Merity
R. Socher
ArXivPDFHTML

Papers citing "Dynamic Memory Networks for Visual and Textual Question Answering"

50 / 113 papers shown
Title
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Qiming Bao
A. Peng
Tim Hartill
N. Tan
Zhenyun Deng
Michael Witbrock
Jiamou Liu
ReLM
OOD
NAI
LRM
37
13
0
28 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
27
1
0
20 Jul 2022
Seeing the forest and the tree: Building representations of both
  individual and collective dynamics with transformers
Seeing the forest and the tree: Building representations of both individual and collective dynamics with transformers
Ran Liu
Mehdi Azabou
M. Dabagia
Jingyun Xiao
Eva L. Dyer
AI4CE
32
19
0
10 Jun 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question
  Answering
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song
Pengpeng Zeng
Lianli Gao
Heng Tao Shen
32
62
0
04 Jun 2022
Memory-enriched computation and learning in spiking neural networks
  through Hebbian plasticity
Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity
Thomas Limbacher
Ozan Özdenizci
Robert Legenstein
21
2
0
23 May 2022
Learning to Answer Visual Questions from Web Videos
Learning to Answer Visual Questions from Web Videos
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
37
33
0
10 May 2022
Attention Mechanism based Cognition-level Scene Understanding
Attention Mechanism based Cognition-level Scene Understanding
Xuejiao Tang
Tai Le Quy
LRM
30
0
0
17 Apr 2022
MeMOT: Multi-Object Tracking with Memory
MeMOT: Multi-Object Tracking with Memory
Jiarui Cai
Mingze Xu
Wei Li
Yuanjun Xiong
Wei Xia
Zhuowen Tu
Stefano Soatto
VOT
36
148
0
31 Mar 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for
  Visual Question Answering
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering
Peixi Xiong
Quanzeng You
Pei Yu
Zicheng Liu
Ying Wu
24
5
0
25 Jan 2022
Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Xing Di
Yu Cheng
Zichuan Xu
Pan Zhou
33
58
0
03 Jan 2022
Zero-Shot Open-Book Question Answering
Zero-Shot Open-Book Question Answering
Sia Gholami
M. Noori
RALM
16
10
0
22 Nov 2021
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
92
0
08 Nov 2021
A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff
  and Service Satisfaction Analysis
A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis
Jiawei Liu
Kaisong Song
Yangyang Kang
Guoxiu He
Zhuoren Jiang
Changlong Sun
Wei Lu
Xiaozhong Liu
37
7
0
17 Sep 2021
Knowledge-based Embodied Question Answering
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
30
20
0
16 Sep 2021
Progressively Guide to Attend: An Iterative Alignment Framework for
  Temporal Sentence Grounding
Progressively Guide to Attend: An Iterative Alignment Framework for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Pan Zhou
18
46
0
14 Sep 2021
DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering
DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering
Jianyu Wang
Bingkun Bao
Changsheng Xu
19
75
0
10 Jul 2021
PEN4Rec: Preference Evolution Networks for Session-based Recommendation
PEN4Rec: Preference Evolution Networks for Session-based Recommendation
Dou Hu
Lingwei Wei
Wei Zhou
X. Huai
Zhiqi Fang
Songlin Hu
19
4
0
17 Jun 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
58
816
0
14 Jun 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video
  Question Answering
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
167
100
0
29 Apr 2021
Reasoning in Dialog: Improving Response Generation by Context Reading
  Comprehension
Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension
Preslav Nakov
Zhi Cui
Jiayi Zhang
Chen Wei
Jianwei Cui
Bin Wang
Dongyan Zhao
Rui Yan
25
14
0
14 Dec 2020
Dual ResGCN for Balanced Scene GraphGeneration
Dual ResGCN for Balanced Scene GraphGeneration
Jingyi Zhang
Yong Zhang
Baoyuan Wu
Yanbo Fan
Fumin Shen
Heng Tao Shen
28
12
0
09 Nov 2020
Learning to Respond with Your Favorite Stickers: A Framework of Unifying
  Multi-Modality and User Preference in Multi-Turn Dialog
Learning to Respond with Your Favorite Stickers: A Framework of Unifying Multi-Modality and User Preference in Multi-Turn Dialog
Shen Gao
Preslav Nakov
Li Liu
Dongyan Zhao
Rui Yan
24
14
0
05 Nov 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual
  Question Answering
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
32
56
0
27 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei Chen
Weiping Wang
Li Liu
M. Lew
VLM
118
31
0
16 Oct 2020
VMSMO: Learning to Generate Multimodal Summary for Video-based News
  Articles
VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles
Li Mingzhe
Preslav Nakov
Shen Gao
Zhangming Chan
Dongyan Zhao
Rui Yan
33
82
0
12 Oct 2020
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
  Recognition and Sentiment Classification
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo Qin
Wanxiang Che
Yangming Li
Minheng Ni
Ting Liu
12
93
0
16 Aug 2020
Give Me Something to Eat: Referring Expression Comprehension with
  Commonsense Knowledge
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge
Peng Wang
Dongyang Liu
Hui Li
Qi Wu
ObjD
24
19
0
02 Jun 2020
Visual Relationship Detection using Scene Graphs: A Survey
Visual Relationship Detection using Scene Graphs: A Survey
Aniket Agarwal
Ayush Mangal
Vipul
GNN
25
20
0
16 May 2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video
  Paragraph Captioning
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Jie Lei
Liwei Wang
Yelong Shen
Dong Yu
Tamara L. Berg
Joey Tianyi Zhou
27
186
0
11 May 2020
Memorizing Comprehensively to Learn Adaptively: Unsupervised
  Cross-Domain Person Re-ID with Multi-level Memory
Memorizing Comprehensively to Learn Adaptively: Unsupervised Cross-Domain Person Re-ID with Multi-level Memory
Xinyu Zhang
Dong Gong
Jiewei Cao
Chunhua Shen
32
6
0
13 Jan 2020
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Badri N. Patro
Anupriy
Vinay P. Namboodiri
AAML
FAtt
48
26
0
19 Nov 2019
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning
  Baselines
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines
Jingxiang Lin
Unnat Jain
A. Schwing
LRM
ReLM
34
9
0
31 Oct 2019
Meta-Learning with Dynamic-Memory-Based Prototypical Network for
  Few-Shot Event Detection
Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
Shumin Deng
Ningyu Zhang
Jiaojian Kang
Yichi Zhang
Wei Zhang
Huajun Chen
31
131
0
25 Oct 2019
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Wang
Hongzhi Li
29
38
0
11 Oct 2019
Find or Classify? Dual Strategy for Slot-Value Predictions on
  Multi-Domain Dialog State Tracking
Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking
Jianguo Zhang
Kazuma Hashimoto
Chien-Sheng Wu
Yao Wan
Philip S. Yu
R. Socher
Caiming Xiong
50
167
0
08 Oct 2019
Multi-sense Definition Modeling using Word Sense Decompositions
Multi-sense Definition Modeling using Word Sense Decompositions
Ruimin Zhu
Thanapon Noraset
Alisa Liu
Wenxin Jiang
Doug Downey
9
9
0
19 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question
  Answering
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
Memorizing All for Implicit Discourse Relation Recognition
Memorizing All for Implicit Discourse Relation Recognition
Hongxiao Bai
Hai Zhao
Junhan Zhao
19
10
0
29 Aug 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
21
11
0
26 Aug 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAtt
UQCV
27
76
0
17 Aug 2019
A Road-map Towards Explainable Question Answering A Solution for
  Information Pollution
A Road-map Towards Explainable Question Answering A Solution for Information Pollution
Zhilin Yang
William W. Cohen
19
0
0
04 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
8
75
0
30 Jun 2019
Adversarial Mahalanobis Distance-based Attentive Song Recommender for
  Automatic Playlist Continuation
Adversarial Mahalanobis Distance-based Attentive Song Recommender for Automatic Playlist Continuation
Thanh-Binh Tran
Renee Sweeney
Kyumin Lee
36
32
0
08 Jun 2019
EKT: Exercise-aware Knowledge Tracing for Student Performance Prediction
EKT: Exercise-aware Knowledge Tracing for Student Performance Prediction
Qi Liu
Zhenya Huang
Yu Yin
Enhong Chen
Hui Xiong
Yu Su
Guoping Hu
AI4Ed
19
379
0
07 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
24
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Episodic Memory Reader: Learning What to Remember for Question Answering
  from Streaming Data
Episodic Memory Reader: Learning What to Remember for Question Answering from Streaming Data
Moonsu Han
Minki Kang
Hyunwoo Jung
Sung Ju Hwang
RALM
27
19
0
14 Mar 2019
Pedestrian Attribute Recognition: A Survey
Pedestrian Attribute Recognition: A Survey
Tianlin Li
Shaofei Zheng
Rui Yang
Aihua Zheng
Zhe Chen
Jin Tang
Bin Luo
CVBM
28
127
0
22 Jan 2019
123
Next