Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,315 papers shown
Title
Working Memory Graphs
Ricky Loynd
Roland Fernandez
Asli Celikyilmaz
Adith Swaminathan
Matthew J. Hausknecht
19
40
0
17 Nov 2019
Attention on Abstract Visual Reasoning
Lukas Hahne
Timo Lüddecke
F. Worgotter
David Kappel
GNN
25
23
0
14 Nov 2019
Disentangle, align and fuse for multimodal and semi-supervised image segmentation
A. Chartsias
G. Papanastasiou
Chengjia Wang
S. Semple
D. Newby
R. Dharmakumar
Sotirios A. Tsaftaris
24
13
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
35
324
0
10 Nov 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
30
219
0
30 Oct 2019
Extending Event Detection to New Types with Learning from Keywords
Viet Dac Lai
Thien Huu Nguyen
ObjD
NAI
24
23
0
24 Oct 2019
Guided Image-to-Image Translation with Bi-Directional Feature Transformation
Badour Albahar
Jia-Bin Huang
36
92
0
24 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay
Geoffrey Cideron
Mathieu Seurin
Florian Strub
Olivier Pietquin
32
37
0
21 Oct 2019
RTFM: Generalising to Novel Environment Dynamics via Reading
Victor Zhong
Tim Rocktaschel
Edward Grefenstette
LLMAG
OffRL
AI4CE
19
54
0
18 Oct 2019
Audio-Conditioned U-Net for Position Estimation in Full Sheet Images
Florian Henkel
Rainer Kelz
Gerhard Widmer
27
4
0
16 Oct 2019
Meta Module Network for Compositional Visual Reasoning
Wenhu Chen
Zhe Gan
Linjie Li
Yu Cheng
Wei Wang
Jingjing Liu
LRM
25
68
0
08 Oct 2019
Meta-Transfer Learning through Hard Tasks
Qianru Sun
Yaoyao Liu
Zhaozheng Chen
Tat-Seng Chua
Bernt Schiele
14
98
0
07 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
43
457
0
03 Oct 2019
CoPhy: Counterfactual Learning of Physical Dynamics
Fabien Baradel
Natalia Neverova
J. Mille
Greg Mori
Christian Wolf
CML
AI4CE
25
97
0
26 Sep 2019
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
Arna Ghosh
Richard Y. Zhang
P. Dokania
Oliver Wang
Alexei A. Efros
Philip Torr
Eli Shechtman
VLM
DiffM
24
130
0
24 Sep 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
13
0
23 Sep 2019
Meta-Neighborhoods
Siyuan Shan
Yang Li
Junier Oliva
27
13
0
18 Sep 2019
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations
Sawyer Birnbaum
Volodymyr Kuleshov
S. Enam
Pang Wei Koh
Stefano Ermon
AI4TS
24
68
0
14 Sep 2019
Hierarchical Scene Coordinate Classification and Regression for Visual Localization
Xiaotian Li
Shuzhe Wang
Yi Zhao
Jakob Verbeek
Arno Solin
82
127
0
13 Sep 2019
Finding Generalizable Evidence by Learning to Convince Q&A Models
Ethan Perez
Siddharth Karamcheti
Rob Fergus
Jason Weston
Douwe Kiela
Kyunghyun Cho
RALM
33
37
0
12 Sep 2019
Domain-Agnostic Few-Shot Classification by Learning Disparate Modulators
Yongseok Choi
Junyoung Park
Subin Yi
D.-Y. Cho
OOD
21
0
0
11 Sep 2019
Relationships from Entity Stream
Martin Andrews
Sam Witteveen
AI4TS
GNN
16
0
0
07 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text
Douwe Kiela
Suvrat Bhooshan
Hamed Firooz
Ethan Perez
Davide Testuggine
59
242
0
06 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
33
57
0
04 Sep 2019
Meta-Learning with Warped Gradient Descent
Sebastian Flennerhag
Andrei A. Rusu
Razvan Pascanu
Francesco Visin
Hujun Yin
R. Hadsell
8
209
0
30 Aug 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts
Sandro Pezzelle
Raquel Fernández
VLM
17
18
0
27 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLM
MLLM
96
2,456
0
20 Aug 2019
Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image
Roman Klokov
Jakob Verbeek
Edmond Boyer
3DV
33
15
0
20 Aug 2019
What is needed for simple spatial language capabilities in VQA?
A. Kuhnle
Ann A. Copestake
CoGe
23
1
0
17 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning
A. Bakhtin
L. V. D. van der Maaten
Justin Johnson
Laura Gustafson
Ross B. Girshick
LRM
24
122
0
15 Aug 2019
Mastering emergent language: learning to guide in simulated navigation
Mathijs Mul
Diane Bouchacourt
Elia Bruni
LLMAG
27
9
0
14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lio
Aaron Courville
16
16
0
14 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
27
38
0
12 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
25
82
0
10 Aug 2019
Dynamic Scale Inference by Entropy Minimization
Dequan Wang
Evan Shelhamer
Bruno A. Olshausen
Trevor Darrell
27
7
0
08 Aug 2019
Answering Questions about Data Visualizations using Efficient Bimodal Fusion
Kushal Kafle
Robik Shrestha
Brian L. Price
Scott D. Cohen
Christopher Kanan
25
58
0
05 Aug 2019
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Vincent Michalski
Vikram S. Voleti
Samira Ebrahimi Kahou
Anthony Ortiz
Pascal Vincent
C. Pal
Doina Precup
BDL
22
6
0
31 Jul 2019
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
16
6
0
31 Jul 2019
Segmenting Objects in Day and Night:Edge-Conditioned CNN for Thermal Image Semantic Segmentation
Chenglong Li
W. Xia
Yan Yan
Bin Luo
Jin Tang
22
119
0
24 Jul 2019
Metalearned Neural Memory
Tsendsuren Munkhdalai
Alessandro Sordoni
Tong Wang
Adam Trischler
KELM
17
60
0
23 Jul 2019
Switchable Normalization for Learning-to-Normalize Deep Representation
Ping Luo
Ruimao Zhang
Jiamin Ren
Zhanglin Peng
Jingyu Li
30
73
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
132
0
22 Jul 2019
Neural Drum Machine : An Interactive System for Real-time Synthesis of Drum Sounds
Cyran Aouameur
P. Esling
Gaëtan Hadjeres
21
21
0
04 Jul 2019
Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations
Gabriel Meseguer-Brocal
Geoffroy Peeters
19
61
0
02 Jul 2019
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
Marc Brockschmidt
28
134
0
28 Jun 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders
Yin-Jyun Luo
Kat R. Agres
Dorien Herremans
22
46
0
19 Jun 2019
Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes
James Requeima
Jonathan Gordon
J. Bronskill
Sebastian Nowozin
Richard Turner
24
241
0
18 Jun 2019
Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Yiding Jiang
S. Gu
Kevin Patrick Murphy
Chelsea Finn
OffRL
20
223
0
18 Jun 2019
Task-Aware Feature Generation for Zero-Shot Compositional Learning
Xin Wang
Feng Yu
Trevor Darrell
Joseph E. Gonzalez
VLM
CoGe
21
16
0
11 Jun 2019
Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering
Claudio Greco
Barbara Plank
Raquel Fernández
Raffaella Bernardi
CLL
KELM
17
48
0
10 Jun 2019
Previous
1
2
3
...
23
24
25
26
27
Next