Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1605.09782
Cited By
Adversarial Feature Learning
31 May 2016
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adversarial Feature Learning"
50 / 642 papers shown
Title
RGB-D Salient Object Detection: A Survey
Tao Zhou
Deng-Ping Fan
Ming-Ming Cheng
Jianbing Shen
Ling Shao
40
248
0
01 Aug 2020
REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering
Siwen Luo
S. Han
Kaiyuan Sun
Josiah Poon
CoGe
LRM
ReLM
26
4
0
27 Jul 2020
Spatially Aware Multimodal Transformers for TextVQA
Yash Kant
Dhruv Batra
Peter Anderson
A. Schwing
Devi Parikh
Jiasen Lu
Harsh Agrawal
22
85
0
23 Jul 2020
Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering
Ruixue Tang
Chao Ma
W. Zhang
Qi Wu
Xiaokang Yang
OOD
31
48
0
19 Jul 2020
Kronecker Attention Networks
Hongyang Gao
Zhengyang Wang
Shuiwang Ji
24
33
0
16 Jul 2020
Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation
Guolei Sun
Wenguan Wang
Jifeng Dai
Luc Van Gool
21
308
0
03 Jul 2020
The Impact of Explanations on AI Competency Prediction in VQA
Kamran Alipour
Arijit Ray
Xiaoyu Lin
J. Schulze
Yi Yao
Giedrius Burachas
27
9
0
02 Jul 2020
Modality-Agnostic Attention Fusion for visual search with text feedback
Eric Dodds
Jack Culpepper
Simão Herdade
Yang Zhang
K. Boakye
EgoV
18
71
0
30 Jun 2020
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
C. Sur
10
9
0
25 Jun 2020
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
Zihao Zhu
Jiahao Yu
Yujing Wang
Yajing Sun
Yue Hu
Qi Wu
30
125
0
16 Jun 2020
ORD: Object Relationship Discovery for Visual Dialogue Generation
Ziwei Wang
Zi Huang
Yadan Luo
Huimin Lu
19
4
0
15 Jun 2020
Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Zhe Gan
Yen-Chun Chen
Linjie Li
Chen Zhu
Yu Cheng
Jingjing Liu
ObjD
VLM
35
488
0
11 Jun 2020
Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Lluís Gómez
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Marçal Rusiñol
Ernest Valveny
Dimosthenis Karatzas
8
20
0
01 Jun 2020
Visual Interest Prediction with Attentive Multi-Task Transfer Learning
Deepanway Ghosal
M. Kolekar
29
1
0
26 May 2020
Visual Relationship Detection using Scene Graphs: A Survey
Aniket Agarwal
Ayush Mangal
Vipul
GNN
25
20
0
16 May 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Hyounghun Kim
Zineng Tang
Joey Tianyi Zhou
33
31
0
13 May 2020
Modeling Human Visual Search Performance on Realistic Webpages Using Analytical and Deep Learning Methods
Arianna Yuan
Yong Li
HAI
25
24
0
07 May 2020
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Arjun Majumdar
Ayush Shrivastava
Stefan Lee
Peter Anderson
Devi Parikh
Dhruv Batra
LM&Ro
47
230
0
30 Apr 2020
Towards Persona-Based Empathetic Conversational Models
Peixiang Zhong
Chen Zhang
Hao Wang
Yong Liu
Steven C. H. Hoi
27
103
0
26 Apr 2020
GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media
Yi-Ju Lu
Cheng-Te Li
GNN
11
388
0
24 Apr 2020
Are we pretraining it right? Digging deeper into visio-linguistic pretraining
Amanpreet Singh
Vedanuj Goswami
Devi Parikh
VLM
40
48
0
19 Apr 2020
An Entropy Clustering Approach for Assessing Visual Question Difficulty
K. Terao
Toru Tamaki
B. Raytchev
K. Kaneda
Shuníchi Satoh
OOD
AAML
26
1
0
12 Apr 2020
Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking
Shi Yin
Shangfei Wang
Xiaoping Chen
Enhong Chen
CVBM
23
16
0
05 Apr 2020
Consistent Multiple Sequence Decoding
Bicheng Xu
Leonid Sigal
34
0
0
02 Apr 2020
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
Difei Gao
Ke Li
Ruiping Wang
Shiguang Shan
Xilin Chen
16
111
0
31 Mar 2020
Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
.Ilker Kesen
Ozan Arkan Can
Erkut Erdem
Aykut Erdem
Deniz Yuret
VLM
10
1
0
28 Mar 2020
VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
J. Liu
Wenhu Chen
Yu Cheng
Zhe Gan
Licheng Yu
Yiming Yang
Jingjing Liu
MLLM
VGen
43
68
0
25 Mar 2020
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
18
3
0
23 Mar 2020
Visual Question Answering for Cultural Heritage
P. Bongini
Federico Becattini
Andrew D. Bagdanov
A. Bimbo
238
22
0
22 Mar 2020
Causal Interpretability for Machine Learning -- Problems, Methods and Evaluation
Raha Moraffah
Mansooreh Karami
Ruocheng Guo
A. Raglin
Huan Liu
CML
ELM
XAI
27
213
0
09 Mar 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
Tianlang Chen
Jiajun Deng
Jiebo Luo
181
68
0
07 Mar 2020
A Study on Multimodal and Interactive Explanations for Visual Question Answering
Kamran Alipour
J. Schulze
Yi Yao
Avi Ziskind
Giedrius Burachas
32
27
0
01 Mar 2020
Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition
Jiaming Wang
Jun Du
Jianshu Zhang
27
24
0
20 Feb 2020
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
16
69
0
20 Feb 2020
CQ-VQA: Visual Question Answering on Categorized Questions
Aakansha Mishra
A. Anand
Prithwijit Guha
33
6
0
17 Feb 2020
Sparse and Structured Visual Attention
Pedro Henrique Martins
S. Becker
Zita Marinho
Michael Arens
32
8
0
13 Feb 2020
Component Analysis for Visual Question Answering Architectures
Camila Kolling
Jonatas Wehrmann
Rodrigo C. Barros
CoGe
15
2
0
12 Feb 2020
Simultaneous Enhancement and Super-Resolution of Underwater Imagery for Improved Visual Perception
M. Islam
Peigen Luo
Junaed Sattar
23
186
0
04 Feb 2020
Augmenting Visual Question Answering with Semantic Frame Information in a Multitask Learning Approach
Mehrdad Alizadeh
Barbara Maria Di Eugenio
11
3
0
31 Jan 2020
Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings
Mennatullah Siam
Naren Doraiswamy
Boris N. Oreshkin
Hengshuai Yao
Martin Jägersand
29
8
0
26 Jan 2020
Multimodal Data Fusion based on the Global Workspace Theory
C. Bao
Z. Fountas
Temitayo A. Olugbade
N. Bianchi-Berthouze
30
7
0
26 Jan 2020
Uncertainty based Class Activation Maps for Visual Question Answering
Badri N. Patro
Mayank Lunayach
Vinay P. Namboodiri
FAtt
UQCV
11
1
0
23 Jan 2020
Robust Explanations for Visual Question Answering
Badri N. Patro
Shivansh Pate
Vinay P. Namboodiri
OOD
AAML
25
20
0
23 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
M. Farazi
Salman H. Khan
Nick Barnes
23
17
0
20 Jan 2020
See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks
Xiankai Lu
Wenguan Wang
Chao Ma
Jianbing Shen
Ling Shao
Fatih Porikli
VOS
19
461
0
19 Jan 2020
Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks
Wenguan Wang
Xiankai Lu
Jianbing Shen
David J. Crandall
Ling Shao
VOS
18
271
0
19 Jan 2020
Modality-Balanced Models for Visual Dialogue
Hyounghun Kim
Hao Tan
Joey Tianyi Zhou
30
27
0
17 Jan 2020
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue System
Yun-Wei Chu
Kuan-Yen Lin
Chao-Chun Hsu
Lun-Wei Ku
24
22
0
17 Jan 2020
Visual Question Answering on 360° Images
Shih-Han Chou
Wei-Lun Chao
Wei-Sheng Lai
Min Sun
Ming-Hsuan Yang
22
21
0
10 Jan 2020
Explainable Deep Relational Networks for Predicting Compound-Protein Affinities and Contacts
Mostafa Karimi
Di Wu
Zhangyang Wang
Yang Shen
27
46
0
29 Dec 2019
Previous
1
2
3
...
6
7
8
...
11
12
13
Next