ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
v1v2 (latest)

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAttAIMatOffRLAI4CE
ArXiv (abs)PDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,349 papers shown
Title
Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma
  Augmented Gaussian Processes
Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes
Jake C. Snell
R. Zemel
111
63
0
20 Jul 2020
Discrete Point Flow Networks for Efficient Point Cloud Generation
Discrete Point Flow Networks for Efficient Point Cloud Generation
Roman Klokov
Edmond Boyer
Jakob Verbeek
3DPC
92
107
0
20 Jul 2020
XingGAN for Person Image Generation
XingGAN for Person Image Generation
Hao Tang
S. Bai
Li Zhang
Philip Torr
N. Sebe
GAN
75
173
0
17 Jul 2020
Contextualizing Enhances Gradient Based Meta Learning
Contextualizing Enhances Gradient Based Meta Learning
Evan Vogelbaum
Rumen Dangovski
L. Jing
Marin Soljacic
126
3
0
17 Jul 2020
RGB-D Salient Object Detection with Cross-Modality Modulation and
  Selection
RGB-D Salient Object Detection with Cross-Modality Modulation and Selection
Chongyi Li
Runmin Cong
Yongri Piao
Qianqian Xu
Chen Change Loy
84
134
0
14 Jul 2020
Visual Tracking by TridentAlign and Context Embedding
Visual Tracking by TridentAlign and Context Embedding
Janghoon Choi
Junseok Kwon
Kyoung Mu Lee
70
16
0
14 Jul 2020
Reducing Language Biases in Visual Question Answering with
  Visually-Grounded Question Encoder
Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
K. Gouthaman
Anurag Mittal
98
79
0
13 Jul 2020
ID-Conditioned Auto-Encoder for Unsupervised Anomaly Detection
ID-Conditioned Auto-Encoder for Unsupervised Anomaly Detection
Slawomir Kapka
CML
148
30
0
10 Jul 2020
Meta Learning for Causal Direction
Meta Learning for Causal Direction
Jean-François Ton
Dino Sejdinovic
Kenji Fukumizu
CMLOOD
68
18
0
06 Jul 2020
A Competence-aware Curriculum for Visual Concepts Learning via Question
  Answering
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering
Qing Li
Siyuan Huang
Yining Hong
Song-Chun Zhu
119
29
0
03 Jul 2020
Exploring the time-domain deep attractor network with two-stream
  architectures in a reverberant environment
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
43
6
0
01 Jul 2020
Latent Compositional Representations Improve Systematic Generalization
  in Grounded Question Answering
Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
Ben Bogin
Sanjay Subramanian
Matt Gardner
Jonathan Berant
ReLMOODBDLLRM
57
28
0
01 Jul 2020
Modality-Agnostic Attention Fusion for visual search with text feedback
Modality-Agnostic Attention Fusion for visual search with text feedback
Eric Dodds
Jack Culpepper
Simão Herdade
Yang Zhang
K. Boakye
EgoV
100
74
0
30 Jun 2020
Hybrid Models for Learning to Branch
Hybrid Models for Learning to Branch
Prateek Gupta
Maxime Gasse
Elias Boutros Khalil
M. P. Kumar
Andrea Lodi
Yoshua Bengio
GNN
193
127
0
26 Jun 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
Hyperparameter Ensembles for Robustness and Uncertainty Quantification
Hyperparameter Ensembles for Robustness and Uncertainty Quantification
F. Wenzel
Jasper Snoek
Dustin Tran
Rodolphe Jenatton
UQCV
117
212
0
24 Jun 2020
Crossmodal Language Grounding in an Embodied Neurocognitive Model
Crossmodal Language Grounding in an Embodied Neurocognitive Model
Stefan Heinrich
Yuan Yao
Tobias Hinz
Zhiyuan Liu
Thomas Hummel
Matthias Kerzel
C. Weber
S. Wermter
LM&Ro
67
19
0
24 Jun 2020
Flexible Image Denoising with Multi-layer Conditional Feature Modulation
Flexible Image Denoising with Multi-layer Conditional Feature Modulation
Jiazhi Du
Xin Qiao
Zifei Yan
Hongzhi Zhang
W. Zuo
119
2
0
24 Jun 2020
Face-to-Music Translation Using a Distance-Preserving Generative
  Adversarial Network with an Auxiliary Discriminator
Face-to-Music Translation Using a Distance-Preserving Generative Adversarial Network with an Auxiliary Discriminator
Chelhwon Kim
Andrew Allan Port
Mitesh Patel
CVBM
34
1
0
24 Jun 2020
A Universal Representation Transformer Layer for Few-Shot Image
  Classification
A Universal Representation Transformer Layer for Few-Shot Image Classification
Lu Liu
William L. Hamilton
Guodong Long
Jing Jiang
Hugo Larochelle
ViT
105
127
0
21 Jun 2020
Compositional Learning of Image-Text Query for Image Retrieval
Compositional Learning of Image-Text Query for Image Retrieval
Muhammad Umer Anwaar
Egor Labintcev
M. Kleinsteuber
CoGe
111
96
0
19 Jun 2020
Tent: Fully Test-time Adaptation by Entropy Minimization
Tent: Fully Test-time Adaptation by Entropy Minimization
Dequan Wang
Evan Shelhamer
Shaoteng Liu
Bruno A. Olshausen
Trevor Darrell
OOD
140
53
0
18 Jun 2020
Language Guided Networks for Cross-modal Moment Retrieval
Language Guided Networks for Cross-modal Moment Retrieval
Kun Liu
Huadong Ma
Chuang Gan
30
2
0
18 Jun 2020
Enhancing Few-Shot Image Classification with Unlabelled Examples
Enhancing Few-Shot Image Classification with Unlabelled Examples
Peyman Bateni
Jarred Barber
Jan-Willem van de Meent
Frank Wood
VLMSSL
132
57
0
17 Jun 2020
Structure by Architecture: Structured Representations without
  Regularization
Structure by Architecture: Structured Representations without Regularization
Felix Leeb
Giulia Lanzillotta
Yashas Annadani
M. Besserve
Stefan Bauer
Bernhard Schölkopf
OODCML
94
8
0
14 Jun 2020
GAN Memory with No Forgetting
GAN Memory with No Forgetting
Yulai Cong
Miaoyun Zhao
Jianqiao Li
Sijia Wang
Lawrence Carin
CLL
80
124
0
13 Jun 2020
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and
  Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
Jeongun Ryu
Jaewoong Shin
Haebeom Lee
Sung Ju Hwang
AAMLOOD
50
8
0
13 Jun 2020
Attentive Feature Reuse for Multi Task Meta learning
Attentive Feature Reuse for Multi Task Meta learning
Kiran Lekkala
Laurent Itti
OOD
71
7
0
12 Jun 2020
Gaussian Gated Linear Networks
Gaussian Gated Linear Networks
David Budden
Adam H. Marblestone
Eren Sezener
Tor Lattimore
Greg Wayne
J. Veness
BDLAI4CE
67
12
0
10 Jun 2020
Deep generative models for musical audio synthesis
Deep generative models for musical audio synthesis
M. Huzaifah
L. Wyse
210
20
0
10 Jun 2020
Realistic text replacement with non-uniform style conditioning
Realistic text replacement with non-uniform style conditioning
Arseny Nerinovsky
Igor Buzhinsky
Andrey Filchenkov
31
0
0
07 Jun 2020
Predicting Goal-directed Human Attention Using Inverse Reinforcement
  Learning
Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning
Zhibo Yang
Lihan Huang
Yupei Chen
Zijun Wei
Seoyoung Ahn
G. Zelinsky
Dimitris Samaras
Minh Hoai
81
98
0
28 May 2020
Long-Term Cloth-Changing Person Re-identification
Long-Term Cloth-Changing Person Re-identification
Xuelin Qian
Wenxuan Wang
Li Zhang
Fangrui Zhu
Yanwei Fu
Tao Xiang
Yu-Gang Jiang
Xiangyang Xue
88
159
0
26 May 2020
Customized Graph Neural Networks
Customized Graph Neural Networks
Yiqi Wang
Yao Ma
Wei Jin
Chaozhuo Li
Charu C. Aggarwal
Jiliang Tang
94
3
0
22 May 2020
Ambient Sound Helps: Audiovisual Crowd Counting in Extreme Conditions
Ambient Sound Helps: Audiovisual Crowd Counting in Extreme Conditions
Di Hu
Lichao Mou
Qingzhong Wang
Junyu Gao
Yuansheng Hua
Dejing Dou
Xiaoxiang Zhu
72
31
0
14 May 2020
Cross-Modality Relevance for Reasoning on Language and Vision
Cross-Modality Relevance for Reasoning on Language and Vision
Chen Zheng
Quan Guo
Parisa Kordjamshidi
LRM
88
36
0
12 May 2020
Group Equivariant Generative Adversarial Networks
Group Equivariant Generative Adversarial Networks
Neel Dey
Antong Chen
S. Ghafurian
GAN
94
33
0
04 May 2020
Dynamic Language Binding in Relational Visual Reasoning
Dynamic Language Binding in Relational Visual Reasoning
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
NAI
71
19
0
30 Apr 2020
Unnatural Language Processing: Bridging the Gap Between Synthetic and
  Natural Language Data
Unnatural Language Processing: Bridging the Gap Between Synthetic and Natural Language Data
Alana Marzoev
Samuel Madden
M. Kaashoek
Michael Cafarella
Jacob Andreas
SyDa
75
33
0
28 Apr 2020
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
Duy-Kien Nguyen
Vedanuj Goswami
Xinlei Chen
71
23
0
24 Apr 2020
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
97
205
0
20 Apr 2020
Zero-Shot Compositional Policy Learning via Language Grounding
Zero-Shot Compositional Policy Learning via Language Grounding
Tianshi Cao
Jingkang Wang
Yining Zhang
S. Manivasagam
LM&Ro
62
2
0
15 Apr 2020
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised
  Learning
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning
Kangning Liu
Shuhang Gu
Andrés Romero
Radu Timofte
51
9
0
14 Apr 2020
Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors
Mateusz Michalkiewicz
Sarah Parisot
Stavros Tsogkas
Mahsa Baktash
Anders P. Eriksson
Eugene Belilovsky
3DV
69
25
0
14 Apr 2020
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
Michal Nazarczuk
K. Mikolajczyk
72
21
0
06 Apr 2020
FusedProp: Towards Efficient Training of Generative Adversarial Networks
FusedProp: Towards Efficient Training of Generative Adversarial Networks
Zachary Polizzi
Chuan-Yung Tsai
GAN
22
1
0
30 Mar 2020
Modulating Bottom-Up and Top-Down Visual Processing via
  Language-Conditional Filters
Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
.Ilker Kesen
Ozan Arkan Can
Erkut Erdem
Aykut Erdem
Deniz Yuret
VLM
53
1
0
28 Mar 2020
CurlingNet: Compositional Learning between Images and Text for Fashion
  IQ Data
CurlingNet: Compositional Learning between Images and Text for Fashion IQ Data
Youngjae Yu
Seunghwan Lee
Yuncheol Choi
Gunhee Kim
CoGe
70
37
0
27 Mar 2020
TRACER: A Framework for Facilitating Accurate and Interpretable
  Analytics for High Stakes Applications
TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications
Kaiping Zheng
Shaofeng Cai
H. Chua
Wei Wang
K. Ngiam
Beng Chin Ooi
AI4TS
65
26
0
24 Mar 2020
Linguistically Driven Graph Capsule Network for Visual Question
  Reasoning
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
47
3
0
23 Mar 2020
Previous
123...222324252627
Next