Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
v1
v2 (latest)
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,349 papers shown
Title
Learning from Implicit Information in Natural Language Instructions for Robotic Manipulations
Ozan Arkan Can
Pedro Zuidberg Dos Martires
Andreas Persson
Julian Gaal
Amy Loutfi
Luc de Raedt
Deniz Yuret
A. Saffiotti
LM&Ro
38
4
0
30 Apr 2019
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Jiayuan Mao
Chuang Gan
Pushmeet Kohli
J. Tenenbaum
Jiajun Wu
NAI
171
706
0
26 Apr 2019
Inner-Imaging Networks: Put Lenses into Convolutional Structure
Yang Hu
Guihua Wen
Mingnan Luo
Dan Dai
Wenming Cao
Zhiwen Yu
Wendy Hall
33
2
0
22 Apr 2019
Compositional generalization in a deep seq2seq model by separating syntax and semantics
Jacob Russin
Jason Jo
R. C. O'Reilly
Yoshua Bengio
97
103
0
22 Apr 2019
Attentive Single-Tasking of Multiple Tasks
Kevis-Kokitsi Maninis
Ilija Radosavovic
Iasonas Kokkinos
204
251
0
18 Apr 2019
Question Guided Modular Routing Networks for Visual Question Answering
Yanze Wu
Qiang Sun
Jianqi Ma
Bin Li
Yanwei Fu
Yao Peng
Xiangyang Xue
69
1
0
17 Apr 2019
Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders
Adrien Bitton
P. Esling
Antoine Caillon
Martin Fouilleul
73
10
0
12 Apr 2019
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning
Xin Wang
Feng Yu
Ruth Wang
Trevor Darrell
Joseph E. Gonzalez
83
105
0
11 Apr 2019
Gating Mechanisms for Combining Character and Word-level Word Representations: An Empirical Study
Jorge A. Balazs
Y. Matsuo
AI4CE
NAI
49
3
0
11 Apr 2019
3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization
Tsun-Hsuan Wang
Hou-Ning Hu
Chieh Hubert Lin
Yi-Hsuan Tsai
Wei-Chen Chiu
Min Sun
3DV
87
37
0
05 Apr 2019
Neural Models of the Psychosemantics of `Most'
Lewis O'Sullivan
Shane Steinert-Threlkeld
28
1
0
04 Apr 2019
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GAN
OCL
76
118
0
04 Apr 2019
Stacked Semantic-Guided Network for Zero-Shot Sketch-Based Image Retrieval
Hao Wang
Cheng Deng
Xinxu Xu
Wen Liu
Xinbo Gao
Dacheng Tao
18
2
0
03 Apr 2019
C2AE: Class Conditioned Auto-Encoder for Open-set Recognition
Poojan Oza
Vishal M. Patel
124
328
0
02 Apr 2019
Dance with Flow: Two-in-One Stream Action Detection
Jiaojiao Zhao
Cees G. M. Snoek
112
84
0
01 Apr 2019
Disentangled Representation Learning in Cardiac Image Analysis
A. Chartsias
T. Joyce
G. Papanastasiou
M. Williams
D. Newby
R. Dharmakumar
Sotirios A. Tsaftaris
DRL
140
128
0
22 Mar 2019
Bilinear Representation for Language-based Image Editing Using Conditional Generative Adversarial Networks
Xiaofeng Mao
YueFeng Chen
Yuhong Li
T. Xiong
Yuan He
Hui Xue
GAN
81
21
0
18 Mar 2019
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
Chi Zhang
Feng Gao
Baoxiong Jia
Yixin Zhu
Song-Chun Zhu
AIMat
74
312
0
07 Mar 2019
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
79
69
0
01 Mar 2019
Answer Them All! Toward Universal Visual Question Answering Models
Robik Shrestha
Kushal Kafle
Christopher Kanan
88
83
0
01 Mar 2019
From Visual to Acoustic Question Answering
Jerome Abdelnour
G. Salvi
Jean Rouat
70
3
0
28 Feb 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering
Rémi Cadène
H. Ben-younes
Matthieu Cord
Nicolas Thome
LRM
88
277
0
25 Feb 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
Ramakrishna Vedantam
Karan Desai
Stefan Lee
Marcus Rohrbach
Dhruv Batra
Devi Parikh
NAI
BDL
97
87
0
21 Feb 2019
From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following
Justin Fu
Anoop Korattikara Balan
Sergey Levine
S. Guadarrama
OffRL
LM&Ro
83
125
0
20 Feb 2019
Cycle-Consistency for Robust Visual Question Answering
Meet Shah
Xinlei Chen
Marcus Rohrbach
Devi Parikh
OOD
85
190
0
15 Feb 2019
Embodied Multimodal Multitask Learning
Devendra Singh Chaplot
Lisa Lee
Ruslan Salakhutdinov
Devi Parikh
Dhruv Batra
LM&Ro
96
24
0
04 Feb 2019
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
240
4,553
0
02 Feb 2019
Pixelated Semantic Colorization
Jiaojiao Zhao
Jiawei Han
Ling Shao
Cees G. M. Snoek
135
80
0
27 Jan 2019
Spatial Broadcast Decoder: A Simple Architecture for Learning Disentangled Representations in VAEs
Nicholas Watters
Loic Matthey
Christopher P. Burgess
Alexander Lerchner
CoGe
111
169
0
21 Jan 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
127
327
0
20 Jan 2019
Variation Network: Learning High-level Attributes for Controlled Input Manipulation
Gaëtan Hadjeres
Frank Nielsen
55
2
0
11 Jan 2019
Robust Change Captioning
Dong Huk Park
Trevor Darrell
Anna Rohrbach
46
5
0
08 Jan 2019
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
Runtao Liu
Chenxi Liu
Yutong Bai
Alan Yuille
NAI
ObjD
135
123
0
03 Jan 2019
Plugin Networks for Inference under Partial Evidence
Michal Koperski
Tomasz Konopczynski
Rafał Nowak
Piotr Semberecki
Tomasz Trzciñski
51
7
0
02 Jan 2019
The meaning of "most" for visual question answering models
A. Kuhnle
Ann A. Copestake
38
4
0
31 Dec 2018
Slimmable Neural Networks
Jiahui Yu
L. Yang
N. Xu
Jianchao Yang
Thomas Huang
95
559
0
21 Dec 2018
Toward Multimodal Model-Agnostic Meta-Learning
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
95
32
0
18 Dec 2018
Composing Text and Image for Image Retrieval - An Empirical Odyssey
Nam S. Vo
Lu Jiang
Chen Sun
Kevin Patrick Murphy
Li Li
Li Fei-Fei
James Hays
CoGe
76
370
0
18 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
69
33
0
17 Dec 2018
Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning
Ehsan Abbasnejad
Iman Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
OffRL
87
5
0
16 Dec 2018
Spatial Knowledge Distillation to aid Visual Reasoning
Somak Aditya
Rudra Saha
Yezhou Yang
Chitta Baral
72
15
0
10 Dec 2018
Meta-Transfer Learning for Few-Shot Learning
Qianru Sun
Yaoyao Liu
Tat-Seng Chua
Bernt Schiele
227
1,077
0
06 Dec 2018
Explainable and Explicit Visual Reasoning over Scene Graphs
Jiaxin Shi
Hanwang Zhang
Juan-Zi Li
OCL
212
235
0
05 Dec 2018
Cross-Modulation Networks for Few-Shot Learning
Hugo Prol
Vincent Dumoulin
Luis Herranz
71
15
0
01 Dec 2018
CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning
Jerome Abdelnour
G. Salvi
Jean Rouat
45
14
0
26 Nov 2018
Deep Network Interpolation for Continuous Imagery Effect Transition
Xintao Wang
K. Yu
Chao Dong
Xiaoou Tang
Chen Change Loy
SupR
109
106
0
26 Nov 2018
Spatially Controllable Image Synthesis with Internal Representation Collaging
Ryohei Suzuki
Masanori Koyama
Takeru Miyato
Taizan Yonetsuji
Huachun Zhu
76
41
0
26 Nov 2018
Efficient Video Understanding via Layered Multi Frame-Rate Analysis
Ziyao Tang
Y. Lu
T. Javidi
24
0
0
24 Nov 2018
Adjustable Real-time Style Transfer
Mohammad Babaeizadeh
Golnaz Ghiasi
OOD
50
21
0
21 Nov 2018
Previous
1
2
3
...
25
26
27
Next