ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,315 papers shown
Title
Cross-Modality Relevance for Reasoning on Language and Vision
Cross-Modality Relevance for Reasoning on Language and Vision
Chen Zheng
Quan Guo
Parisa Kordjamshidi
LRM
46
36
0
12 May 2020
Group Equivariant Generative Adversarial Networks
Group Equivariant Generative Adversarial Networks
Neel Dey
Antong Chen
S. Ghafurian
GAN
30
33
0
04 May 2020
Dynamic Language Binding in Relational Visual Reasoning
Dynamic Language Binding in Relational Visual Reasoning
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
NAI
26
19
0
30 Apr 2020
Unnatural Language Processing: Bridging the Gap Between Synthetic and
  Natural Language Data
Unnatural Language Processing: Bridging the Gap Between Synthetic and Natural Language Data
Alana Marzoev
Samuel Madden
M. Kaashoek
Michael Cafarella
Jacob Andreas
SyDa
27
32
0
28 Apr 2020
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
Duy-Kien Nguyen
Vedanuj Goswami
Xinlei Chen
39
23
0
24 Apr 2020
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
33
201
0
20 Apr 2020
Zero-Shot Compositional Policy Learning via Language Grounding
Zero-Shot Compositional Policy Learning via Language Grounding
Tianshi Cao
Jingkang Wang
Yining Zhang
S. Manivasagam
LM&Ro
34
1
0
15 Apr 2020
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised
  Learning
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning
Kangning Liu
Shuhang Gu
Andrés Romero
Radu Timofte
17
9
0
14 Apr 2020
Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
Mateusz Michalkiewicz
Sarah Parisot
Stavros Tsogkas
Mahsa Baktash
Anders P. Eriksson
Eugene Belilovsky
3DV
27
25
0
14 Apr 2020
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
Michal Nazarczuk
K. Mikolajczyk
28
21
0
06 Apr 2020
FusedProp: Towards Efficient Training of Generative Adversarial Networks
FusedProp: Towards Efficient Training of Generative Adversarial Networks
Zachary Polizzi
Chuan-Yung Tsai
GAN
22
1
0
30 Mar 2020
Modulating Bottom-Up and Top-Down Visual Processing via
  Language-Conditional Filters
Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
.Ilker Kesen
Ozan Arkan Can
Erkut Erdem
Aykut Erdem
Deniz Yuret
VLM
16
1
0
28 Mar 2020
CurlingNet: Compositional Learning between Images and Text for Fashion
  IQ Data
CurlingNet: Compositional Learning between Images and Text for Fashion IQ Data
Youngjae Yu
Seunghwan Lee
Yuncheol Choi
Gunhee Kim
CoGe
19
37
0
27 Mar 2020
TRACER: A Framework for Facilitating Accurate and Interpretable
  Analytics for High Stakes Applications
TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications
Kaiping Zheng
Shaofeng Cai
H. Chua
Wei Wang
K. Ngiam
Beng Chin Ooi
AI4TS
21
26
0
24 Mar 2020
Linguistically Driven Graph Capsule Network for Visual Question
  Reasoning
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
18
3
0
23 Mar 2020
Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using
  a View-based Representation
Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation
Sai Rajeswar
Fahim Mannan
Florian Golemo
Jérôme Parent-Lévesque
David Vazquez
Derek Nowrouzezahrai
Aaron Courville
3DV
42
11
0
23 Mar 2020
Selecting Relevant Features from a Multi-domain Representation for
  Few-shot Classification
Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification
Nikita Dvornik
Cordelia Schmid
Julien Mairal
VLM
178
24
0
20 Mar 2020
XtarNet: Learning to Extract Task-Adaptive Representation for
  Incremental Few-Shot Learning
XtarNet: Learning to Extract Task-Adaptive Representation for Incremental Few-Shot Learning
Sung Whan Yoon
Do-Yeon Kim
Jun Seo
Jaekyun Moon
CLL
186
46
0
19 Mar 2020
Neural Pose Transfer by Spatially Adaptive Instance Normalization
Neural Pose Transfer by Spatially Adaptive Instance Normalization
Jiashun Wang
Chao Wen
Yanwei Fu
Haitao Lin
Tianyun Zou
Xiangyang Xue
Yinda Zhang
3DH
22
57
0
16 Mar 2020
Conditional Convolutions for Instance Segmentation
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
196
599
0
12 Mar 2020
Automatic segmentation of spinal multiple sclerosis lesions: How to
  generalize across MRI contrasts?
Automatic segmentation of spinal multiple sclerosis lesions: How to generalize across MRI contrasts?
Olivier Vincent
C. Gros
Joseph Paul Cohen
Julien Cohen-Adad
27
5
0
09 Mar 2020
Learning to Shadow Hand-drawn Sketches
Learning to Shadow Hand-drawn Sketches
Qingyuan Zheng
Zhuoru Li
Adam W. Bargteil
GAN
3DH
3DV
22
26
0
26 Feb 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Hierarchical Conditional Relation Networks for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
14
258
0
25 Feb 2020
Mnemonics Training: Multi-Class Incremental Learning without Forgetting
Mnemonics Training: Multi-Class Incremental Learning without Forgetting
Yaoyao Liu
Yuting Su
Anan Liu
Bernt Schiele
Qianru Sun
CLL
28
337
0
24 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
27
261
0
20 Feb 2020
Contextual Lensing of Universal Sentence Representations
Contextual Lensing of Universal Sentence Representations
J. Kiros
15
5
0
20 Feb 2020
BatchEnsemble: An Alternative Approach to Efficient Ensemble and
  Lifelong Learning
BatchEnsemble: An Alternative Approach to Efficient Ensemble and Lifelong Learning
Yeming Wen
Dustin Tran
Jimmy Ba
OOD
FedML
UQCV
32
483
0
17 Feb 2020
Meta-learning framework with applications to zero-shot time-series
  forecasting
Meta-learning framework with applications to zero-shot time-series forecasting
Boris N. Oreshkin
Dmitri Carpov
Nicolas Chapados
Yoshua Bengio
UQCV
AI4TS
AI4CE
41
106
0
07 Feb 2020
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional
  Neural Networks
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks
Xiaoliang Luo
Brett D. Roads
Bradley C. Love
31
18
0
06 Feb 2020
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
C. Qi
Xinlei Chen
Or Litany
Leonidas J. Guibas
3DPC
197
249
0
29 Jan 2020
Gesticulator: A framework for semantically-aware speech-driven gesture
  generation
Gesticulator: A framework for semantically-aware speech-driven gesture generation
Taras Kucherenko
Patrik Jonell
S. V. Waveren
G. Henter
Simon Alexanderson
Iolanda Leite
Hedvig Kjellström
SLR
21
179
0
25 Jan 2020
Compositional properties of emergent languages in deep learning
Compositional properties of emergent languages in deep learning
Bence Keresztury
Elia Bruni
CoGe
24
6
0
23 Jan 2020
Exploiting Language Instructions for Interpretable and Compositional
  Reinforcement Learning
Exploiting Language Instructions for Interpretable and Compositional Reinforcement Learning
Michiel van der Meer
Matteo Pirotta
Elia Bruni
27
1
0
13 Jan 2020
Fine-grained Image-to-Image Transformation towards Visual Recognition
Fine-grained Image-to-Image Transformation towards Visual Recognition
Wei Xiong
Yutong He
Yixuan Zhang
Wenhan Luo
Lin Ma
Jiebo Luo
CVBM
35
26
0
12 Jan 2020
In Defense of Grid Features for Visual Question Answering
In Defense of Grid Features for Visual Question Answering
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OOD
ObjD
23
318
0
10 Jan 2020
Generalizing Emergent Communication
Generalizing Emergent Communication
Thomas A. Unger
Elia Bruni
20
1
0
06 Jan 2020
Automated Relational Meta-learning
Automated Relational Meta-learning
Huaxiu Yao
Xian Wu
Zhiqiang Tao
Yaliang Li
Bolin Ding
Ruirui Li
Z. Li
41
89
0
03 Jan 2020
Side-Tuning: A Baseline for Network Adaptation via Additive Side
  Networks
Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks
Jeffrey O. Zhang
Alexander Sax
Amir Zamir
Leonidas J. Guibas
Jitendra Malik
36
28
0
31 Dec 2019
Reward-Conditioned Policies
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
22
92
0
31 Dec 2019
Asymmetric GAN for Unpaired Image-to-image Translation
Asymmetric GAN for Unpaired Image-to-image Translation
Yu Li
Sheng Tang
Rui Zhang
Yongdong Zhang
Jintao Li
Shuicheng Yan
GAN
24
62
0
25 Dec 2019
Smart Home Appliances: Chat with Your Fridge
Smart Home Appliances: Chat with Your Fridge
Denis A. Gudovskiy
Gyuri Han
Takuya Yamaguchi
Sotaro Tsukizawa
LRM
13
3
0
19 Dec 2019
ManiGAN: Text-Guided Image Manipulation
ManiGAN: Text-Guided Image Manipulation
Bowen Li
Xiaojuan Qi
Thomas Lukasiewicz
Philip Torr
EGVM
61
284
0
12 Dec 2019
CLOSURE: Assessing Systematic Generalization of CLEVR Models
CLOSURE: Assessing Systematic Generalization of CLEVR Models
Dzmitry Bahdanau
H. D. Vries
Timothy J. O'Donnell
Shikhar Murty
Philippe Beaudoin
Yoshua Bengio
Aaron Courville
NAI
15
90
0
12 Dec 2019
Learning to Request Guidance in Emergent Communication
Learning to Request Guidance in Emergent Communication
Benjamin Kolb
Leon Lang
H. Bartsch
Arwin Gansekoele
Raymond Koopmanschap
Leonardo Romor
David Speck
Mathijs Mul
Elia Bruni
27
0
0
11 Dec 2019
Improved Few-Shot Visual Classification
Improved Few-Shot Visual Classification
Peyman Bateni
Raghav Goyal
Vaden Masrani
Frank Wood
Leonid Sigal
VLM
22
228
0
07 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves
  Vision-Language Tasks
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
21
15
0
06 Dec 2019
Learning to synthesise the ageing brain without longitudinal data
Learning to synthesise the ageing brain without longitudinal data
Tian Xia
A. Chartsias
Chengjia Wang
Sotirios A. Tsaftaris
OOD
MedIm
17
53
0
04 Dec 2019
Deep Object Co-segmentation via Spatial-Semantic Network Modulation
Deep Object Co-segmentation via Spatial-Semantic Network Modulation
Kaihua Zhang
Jin Chen
Bo Liu
Qingshan Liu
30
40
0
29 Nov 2019
Transfer Learning in Visual and Relational Reasoning
Transfer Learning in Visual and Relational Reasoning
T. S. Jayram
Vincent Marois
Tomasz Kornuta
V. Albouy
Emre Sevgen
A. Ozcan
NAI
OOD
LRM
19
2
0
27 Nov 2019
Temporal Reasoning via Audio Question Answering
Temporal Reasoning via Audio Question Answering
Haytham M. Fayek
Justin Johnson
30
51
0
21 Nov 2019
Previous
123...222324252627
Next