ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
v1v2 (latest)

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAttAIMatOffRLAI4CE
ArXiv (abs)PDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,349 papers shown
Title
Meta-Learning via Classifier(-free) Diffusion Guidance
Meta-Learning via Classifier(-free) Diffusion Guidance
Elvis Nava
Seijin Kobayashi
Yifei Yin
Robert K. Katzschmann
Benjamin Grewe
VLM
75
6
0
17 Oct 2022
Scaling & Shifting Your Features: A New Baseline for Efficient Model
  Tuning
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
Dongze Lian
Daquan Zhou
Jiashi Feng
Xinchao Wang
122
264
0
17 Oct 2022
Neural Attentive Circuits
Neural Attentive Circuits
Nasim Rahaman
M. Weiß
Francesco Locatello
C. Pal
Yoshua Bengio
Bernhard Schölkopf
Erran L. Li
Nicolas Ballas
124
7
0
14 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
93
160
0
14 Oct 2022
Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets
Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets
Anurag Roy
David Johnson Ekka
Saptarshi Ghosh
Abir Das
62
1
0
13 Oct 2022
Individualized Conditioning and Negative Distances for Speaker
  Separation
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
61
1
0
12 Oct 2022
Toward Sustainable Continual Learning: Detection and Knowledge
  Repurposing of Similar Tasks
Toward Sustainable Continual Learning: Detection and Knowledge Repurposing of Similar Tasks
Sijia Wang
Yoojin Choi
Junya Chen
Mostafa El-Khamy
Ricardo Henao
CLL
64
0
0
11 Oct 2022
Revisiting adapters with adversarial training
Revisiting adapters with adversarial training
Sylvestre-Alvise Rebuffi
Francesco Croce
Sven Gowal
AAML
62
17
0
10 Oct 2022
Using Both Demonstrations and Language Instructions to Efficiently Learn
  Robotic Tasks
Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks
Albert Yu
Raymond J. Mooney
LM&Ro
74
20
0
10 Oct 2022
Sequential Ensembling for Semantic Segmentation
Sequential Ensembling for Semantic Segmentation
Rawal Khirodkar
Brandon A. Smith
Siddhartha Chandra
Amit Agrawal
A. Criminisi
69
2
0
08 Oct 2022
Enhancing Interpretability and Interactivity in Robot Manipulation: A
  Neurosymbolic Approach
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach
Georgios Tziafas
Hamidreza Kasaei
LM&Ro
94
3
0
03 Oct 2022
Towards Multi-spatiotemporal-scale Generalized PDE Modeling
Towards Multi-spatiotemporal-scale Generalized PDE Modeling
Jayesh K. Gupta
Johannes Brandstetter
AI4CE
139
136
0
30 Sep 2022
Continuous PDE Dynamics Forecasting with Implicit Neural Representations
Continuous PDE Dynamics Forecasting with Implicit Neural Representations
Yuan Yin
Matthieu Kirchmeyer
Jean-Yves Franceschi
A. Rakotomamonjy
Patrick Gallinari
AI4CE
94
53
0
29 Sep 2022
Attention Beats Concatenation for Conditioning Neural Fields
Attention Beats Concatenation for Conditioning Neural Fields
Daniel Rebain
Mark J. Matthews
K. M. Yi
Gopal Sharma
Dmitry Lagun
Andrea Tagliasacchi
AI4CE
90
23
0
21 Sep 2022
Explainable AI for clinical and remote health applications: a survey on
  tabular and time series data
Explainable AI for clinical and remote health applications: a survey on tabular and time series data
Flavio Di Martino
Franca Delmastro
AI4TS
66
99
0
14 Sep 2022
A Universally-Deployable ASR Frontend for Joint Acoustic Echo
  Cancellation, Speech Enhancement, and Voice Separation
A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Tom O'Malley
A. Narayanan
Quan Wang
54
5
0
14 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
304
501
0
12 Sep 2022
Instruction-driven history-aware policies for robotic manipulations
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
195
109
0
11 Sep 2022
Ask Before You Act: Generalising to Novel Environments by Asking
  Questions
Ask Before You Act: Generalising to Novel Environments by Asking Questions
Ross Murphy
S. Mosesov
Javier Leguina Peral
Thymo ter Doest
LRM
63
0
0
10 Sep 2022
Scalable Adversarial Online Continual Learning
Scalable Adversarial Online Continual Learning
T. Dam
Mahardhika Pratama
Md Meftahul Ferdaus
S. Anavatti
Hussein Abbas
CLL
86
3
0
04 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffMMedIm
533
1,428
0
02 Sep 2022
The Neural Process Family: Survey, Applications and Perspectives
The Neural Process Family: Survey, Applications and Perspectives
Saurav Jha
Dong Gong
Xuesong Wang
Richard Turner
L. Yao
BDL
166
24
0
01 Sep 2022
Light-weight probing of unsupervised representations for Reinforcement
  Learning
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSLOffRL
76
14
0
25 Aug 2022
FS-BAN: Born-Again Networks for Domain Generalization Few-Shot
  Classification
FS-BAN: Born-Again Networks for Domain Generalization Few-Shot Classification
Yunqing Zhao
Ngai-Man Cheung
BDL
76
13
0
23 Aug 2022
Neuro-Symbolic Visual Dialog
Neuro-Symbolic Visual Dialog
Adnen Abdessaied
Mihai Bâce
Andreas Bulling
NAI
55
3
0
22 Aug 2022
Structural Biases for Improving Transformers on Translation into
  Morphologically Rich Languages
Structural Biases for Improving Transformers on Translation into Morphologically Rich Languages
Paul Soulos
Sudha Rao
Caitlin Smith
Eric Rosen
Asli Celikyilmaz
...
Coleman Haley
Roland Fernandez
Hamid Palangi
Jianfeng Gao
P. Smolensky
77
7
0
11 Aug 2022
Counterfactual Image Synthesis for Discovery of Personalized Predictive
  Image Markers
Counterfactual Image Synthesis for Discovery of Personalized Predictive Image Markers
Amar Kumar
Anjun Hu
Brennan Nichyporuk
Jean-Pierre Falet
Douglas L. Arnold
Sotirios A. Tsaftaris
Tal Arbel
MedIm
69
10
0
03 Aug 2022
A New Probabilistic V-Net Model with Hierarchical Spatial Feature
  Transform for Efficient Abdominal Multi-Organ Segmentation
A New Probabilistic V-Net Model with Hierarchical Spatial Feature Transform for Efficient Abdominal Multi-Organ Segmentation
Minfeng Xu
Heng Guo
Jianfeng Zhang
K. Yan
Le Lu
42
5
0
02 Aug 2022
One for All: One-stage Referring Expression Comprehension with Dynamic
  Reasoning
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjDLRM
72
9
0
31 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa3DGS
96
139
0
27 Jul 2022
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for
  Editable Portrait Image Synthesis
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis
Jeong-gi Kwak
Yuanming Li
Dongsik Yoon
Donghyeon Kim
D. Han
Hanseok Ko
90
25
0
21 Jul 2022
Style Transfer of Audio Effects with Differentiable Signal Processing
Style Transfer of Audio Effects with Differentiable Signal Processing
C. Steinmetz
Nicholas J. Bryan
Joshua D. Reiss
69
46
0
18 Jul 2022
Context-Consistent Semantic Image Editing with Style-Preserved
  Modulation
Context-Consistent Semantic Image Editing with Style-Preserved Modulation
Wuyang Luo
Su Yang
Hong Wang
Bo Long
Weishan Zhang
59
11
0
13 Jul 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid
  Counterfactual Training for Robust Content-based Image Retrieval
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval
Wenqiao Zhang
Jiannan Guo
Meng Li
Haochen Shi
Shengyu Zhang
Juncheng Li
Siliang Tang
Yueting Zhuang
88
6
0
09 Jul 2022
L$_0$onie: Compressing COINs with L$_0$-constraints
L0_00​onie: Compressing COINs with L0_00​-constraints
Juan Ramirez
Jose Gallego-Posada
75
5
0
08 Jul 2022
End-to-End Binaural Speech Synthesis
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
65
9
0
08 Jul 2022
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for
  Grounding Relative Directions via Multi-Task Learning
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning
Kyra Ahrens
Matthias Kerzel
Jae Hee Lee
C. Weber
S. Wermter
73
0
0
06 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced
  Model Sparsity
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
76
8
0
06 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer
  Programming via Early Fixing
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
79
3
0
05 Jul 2022
Learning Noise-independent Speech Representation for High-quality Voice
  Conversion for Noisy Target Speakers
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Liumeng Xue
Shan Yang
Na Hu
Jane Polak Scowcroft
Linfu Xie
51
2
0
02 Jul 2022
SD-LayerNet: Semi-supervised retinal layer segmentation in OCT using
  disentangled representation with anatomical priors
SD-LayerNet: Semi-supervised retinal layer segmentation in OCT using disentangled representation with anatomical priors
Botond Fazekas
Guilherme Aresta
Dmitrii Lachinov
Sophie Riedl
Julia Mai
U. Schmidt-Erfurth
Hrvoje Bogunović
31
13
0
01 Jul 2022
A Hierarchical Speaker Representation Framework for One-shot Singing
  Voice Conversion
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Xu Li
Shansong Liu
Ying Shan
76
13
0
28 Jun 2022
ContraReg: Contrastive Learning of Multi-modality Unsupervised
  Deformable Image Registration
ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image Registration
Neel Dey
Jo Schlemper
S. Salehi
Bo Zhou
Guido Gerig
M. Sofka
MedIm
90
17
0
27 Jun 2022
Task-Adaptive Few-shot Node Classification
Task-Adaptive Few-shot Node Classification
Song Wang
Kaize Ding
Chuxu Zhang
Chen Chen
Jundong Li
OffRL
103
51
0
23 Jun 2022
Jointist: Joint Learning for Multi-instrument Transcription and Its
  Applications
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Amy Hung
Ju-Chiang Wang
Dorien Herremans
95
7
0
22 Jun 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image
  Classification
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Massimiliano Patacchiola
J. Bronskill
Aliaksandra Shysheya
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
72
10
0
20 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in
  Language-guided RL
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
103
26
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and
  Federated Image Classification
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DHFedML
95
28
0
17 Jun 2022
Channel Importance Matters in Few-Shot Image Classification
Channel Importance Matters in Few-Shot Image Classification
Xu Luo
Jing Xu
Zenglin Xu
VLM
92
42
0
16 Jun 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning
  Tasks
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
178
139
0
14 Jun 2022
Previous
123...151617...252627
Next