ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,315 papers shown
Title
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech
  Recognition in Multi-party Meetings
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings
Mohan Shi
Jie Zhang
Zhihao Du
Fan Yu
Qian Chen
Shiliang Zhang
Lirong Dai
51
4
0
01 Nov 2022
Modelling black-box audio effects with time-varying feature modulation
Modelling black-box audio effects with time-varying feature modulation
Marco Comunità
C. Steinmetz
Huy Phan
Joshua D. Reiss
52
14
0
01 Nov 2022
Adapting self-supervised models to multi-talker speech recognition using
  speaker embeddings
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Zili Huang
Desh Raj
Leibny Paola García-Perera
Sanjeev Khudanpur
101
24
0
01 Nov 2022
Rethinking Generalization: The Impact of Annotation Style on Medical
  Image Segmentation
Rethinking Generalization: The Impact of Annotation Style on Medical Image Segmentation
Brennan Nichyporuk
Jillian Cardinell
Justin Szeto
Raghav Mehta
Jean-Pierre Falet
Douglas L. Arnold
Sotirios A. Tsaftaris
Tal Arbel
24
7
0
31 Oct 2022
ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images
ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images
Matthew D. Hoffman
T. Le
Pavel Sountsov
Christopher Suter
Ben Lee
Vikash K. Mansinghka
Rif A. Saurous
BDL
41
12
0
27 Oct 2022
Deep Generative Models on 3D Representations: A Survey
Deep Generative Models on 3D Representations: A Survey
Zifan Shi
Sida Peng
Yinghao Xu
Andreas Geiger
Yiyi Liao
Yujun Shen
MedIm
3DV
52
0
0
27 Oct 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and
  Tasks
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
32
16
0
27 Oct 2022
CasNet: Investigating Channel Robustness for Speech Separation
CasNet: Investigating Channel Robustness for Speech Separation
Fan Wang
Yao-Fei Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
31
2
0
27 Oct 2022
Solving Audio Inverse Problems with a Diffusion Model
Solving Audio Inverse Problems with a Diffusion Model
Eloi Moliner
J. Lehtinen
Vesa Valimaki
DiffM
36
50
0
27 Oct 2022
Instruction-Following Agents with Multimodal Transformer
Instruction-Following Agents with Multimodal Transformer
Hao Liu
Lisa Lee
Kimin Lee
Pieter Abbeel
LM&Ro
48
10
0
24 Oct 2022
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker
  Embeddings for Target Speaker Separation
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation
Xiaoyu Liu
Xu Li
Joan Serrà
44
9
0
23 Oct 2022
PaCo: Parameter-Compositional Multi-Task Reinforcement Learning
PaCo: Parameter-Compositional Multi-Task Reinforcement Learning
Lingfeng Sun
Haichao Zhang
Wei Xu
Masayoshi Tomizuka
MoE
32
37
0
21 Oct 2022
Hypernetworks in Meta-Reinforcement Learning
Hypernetworks in Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Shimon Whiteson
OffRL
33
31
0
20 Oct 2022
ARAH: Animatable Volume Rendering of Articulated Human SDFs
ARAH: Animatable Volume Rendering of Articulated Human SDFs
Shaofei Wang
Katja Schwarz
Andreas Geiger
Siyu Tang
3DH
44
130
0
18 Oct 2022
CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging
  Top-Down Feedback
CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback
Alexia Jolicoeur-Martineau
Alex Lamb
Vikas Verma
Aniket Didolkar
NoLa
18
0
0
18 Oct 2022
Meta-Learning via Classifier(-free) Diffusion Guidance
Meta-Learning via Classifier(-free) Diffusion Guidance
Elvis Nava
Seijin Kobayashi
Yifei Yin
Robert K. Katzschmann
Benjamin Grewe
VLM
27
6
0
17 Oct 2022
Scaling & Shifting Your Features: A New Baseline for Efficient Model
  Tuning
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
Dongze Lian
Daquan Zhou
Jiashi Feng
Xinchao Wang
38
250
0
17 Oct 2022
Neural Attentive Circuits
Neural Attentive Circuits
Nasim Rahaman
M. Weiß
Francesco Locatello
C. Pal
Yoshua Bengio
Bernhard Schölkopf
Erran L. Li
Nicolas Ballas
37
6
0
14 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
40
134
0
14 Oct 2022
Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets
Few-Shot Visual Question Generation: A Novel Task and Benchmark Datasets
Anurag Roy
David Johnson Ekka
Saptarshi Ghosh
Abir Das
23
1
0
13 Oct 2022
Individualized Conditioning and Negative Distances for Speaker
  Separation
Individualized Conditioning and Negative Distances for Speaker Separation
Tao Sun
Nidal Abuhajar
Shuyu Gong
Zhewei Wang
Charles D. Smith
Xianhui Wang
Li Xu
Jundong Liu
VLM
34
1
0
12 Oct 2022
Toward Sustainable Continual Learning: Detection and Knowledge
  Repurposing of Similar Tasks
Toward Sustainable Continual Learning: Detection and Knowledge Repurposing of Similar Tasks
Sijia Wang
Yoojin Choi
Junya Chen
Mostafa El-Khamy
Ricardo Henao
CLL
30
0
0
11 Oct 2022
Revisiting adapters with adversarial training
Revisiting adapters with adversarial training
Sylvestre-Alvise Rebuffi
Francesco Croce
Sven Gowal
AAML
41
16
0
10 Oct 2022
Using Both Demonstrations and Language Instructions to Efficiently Learn
  Robotic Tasks
Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks
Albert Yu
Raymond J. Mooney
LM&Ro
32
19
0
10 Oct 2022
Sequential Ensembling for Semantic Segmentation
Sequential Ensembling for Semantic Segmentation
Rawal Khirodkar
Brandon A. Smith
Siddhartha Chandra
Amit Agrawal
A. Criminisi
45
2
0
08 Oct 2022
Enhancing Interpretability and Interactivity in Robot Manipulation: A
  Neurosymbolic Approach
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach
Georgios Tziafas
Hamidreza Kasaei
LM&Ro
20
3
0
03 Oct 2022
Towards Multi-spatiotemporal-scale Generalized PDE Modeling
Towards Multi-spatiotemporal-scale Generalized PDE Modeling
Jayesh K. Gupta
Johannes Brandstetter
AI4CE
65
120
0
30 Sep 2022
Continuous PDE Dynamics Forecasting with Implicit Neural Representations
Continuous PDE Dynamics Forecasting with Implicit Neural Representations
Yuan Yin
Matthieu Kirchmeyer
Jean-Yves Franceschi
A. Rakotomamonjy
Patrick Gallinari
AI4CE
35
49
0
29 Sep 2022
Attention Beats Concatenation for Conditioning Neural Fields
Attention Beats Concatenation for Conditioning Neural Fields
Daniel Rebain
Mark J. Matthews
K. M. Yi
Gopal Sharma
Dmitry Lagun
Andrea Tagliasacchi
AI4CE
48
22
0
21 Sep 2022
Explainable AI for clinical and remote health applications: a survey on
  tabular and time series data
Explainable AI for clinical and remote health applications: a survey on tabular and time series data
Flavio Di Martino
Franca Delmastro
AI4TS
33
91
0
14 Sep 2022
A Universally-Deployable ASR Frontend for Joint Acoustic Echo
  Cancellation, Speech Enhancement, and Voice Separation
A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Tom O'Malley
A. Narayanan
Quan Wang
27
5
0
14 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
175
468
0
12 Sep 2022
Instruction-driven history-aware policies for robotic manipulations
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
113
102
0
11 Sep 2022
Ask Before You Act: Generalising to Novel Environments by Asking
  Questions
Ask Before You Act: Generalising to Novel Environments by Asking Questions
Ross Murphy
S. Mosesov
Javier Leguina Peral
Thymo ter Doest
LRM
32
0
0
10 Sep 2022
Scalable Adversarial Online Continual Learning
Scalable Adversarial Online Continual Learning
T. Dam
Mahardhika Pratama
Md Meftahul Ferdaus
S. Anavatti
Hussein Abbas
CLL
41
3
0
04 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
226
1,320
0
02 Sep 2022
The Neural Process Family: Survey, Applications and Perspectives
The Neural Process Family: Survey, Applications and Perspectives
Saurav Jha
Dong Gong
Xuesong Wang
Richard Turner
L. Yao
BDL
87
24
0
01 Sep 2022
Light-weight probing of unsupervised representations for Reinforcement
  Learning
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSL
OffRL
46
13
0
25 Aug 2022
FS-BAN: Born-Again Networks for Domain Generalization Few-Shot
  Classification
FS-BAN: Born-Again Networks for Domain Generalization Few-Shot Classification
Yunqing Zhao
Ngai-man Cheung
BDL
27
12
0
23 Aug 2022
Neuro-Symbolic Visual Dialog
Neuro-Symbolic Visual Dialog
Adnen Abdessaied
Mihai Bâce
Andreas Bulling
NAI
21
3
0
22 Aug 2022
Structural Biases for Improving Transformers on Translation into
  Morphologically Rich Languages
Structural Biases for Improving Transformers on Translation into Morphologically Rich Languages
Paul Soulos
Sudha Rao
Caitlin Smith
Eric Rosen
Asli Celikyilmaz
...
Coleman Haley
Roland Fernandez
Hamid Palangi
Jianfeng Gao
P. Smolensky
34
6
0
11 Aug 2022
Counterfactual Image Synthesis for Discovery of Personalized Predictive
  Image Markers
Counterfactual Image Synthesis for Discovery of Personalized Predictive Image Markers
Amar Kumar
Anjun Hu
Brennan Nichyporuk
Jean-Pierre Falet
Douglas L. Arnold
Sotirios A. Tsaftaris
Tal Arbel
MedIm
32
10
0
03 Aug 2022
A New Probabilistic V-Net Model with Hierarchical Spatial Feature
  Transform for Efficient Abdominal Multi-Organ Segmentation
A New Probabilistic V-Net Model with Hierarchical Spatial Feature Transform for Efficient Abdominal Multi-Organ Segmentation
Minfeng Xu
Heng Guo
Jianfeng Zhang
K. Yan
Le Lu
32
5
0
02 Aug 2022
One for All: One-stage Referring Expression Comprehension with Dynamic
  Reasoning
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjD
LRM
25
9
0
31 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa
3DGS
44
134
0
27 Jul 2022
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for
  Editable Portrait Image Synthesis
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis
Jeong-gi Kwak
Yuanming Li
Dongsik Yoon
Donghyeon Kim
D. Han
Hanseok Ko
26
25
0
21 Jul 2022
Style Transfer of Audio Effects with Differentiable Signal Processing
Style Transfer of Audio Effects with Differentiable Signal Processing
C. Steinmetz
Nicholas J. Bryan
Joshua D. Reiss
31
41
0
18 Jul 2022
Context-Consistent Semantic Image Editing with Style-Preserved
  Modulation
Context-Consistent Semantic Image Editing with Style-Preserved Modulation
Wuyang Luo
Su Yang
Hong Wang
Bo Long
Weishan Zhang
19
10
0
13 Jul 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid
  Counterfactual Training for Robust Content-based Image Retrieval
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval
Wenqiao Zhang
Jiannan Guo
Meng Li
Haochen Shi
Shengyu Zhang
Juncheng Li
Siliang Tang
Yueting Zhuang
60
6
0
09 Jul 2022
L$_0$onie: Compressing COINs with L$_0$-constraints
L0_00​onie: Compressing COINs with L0_00​-constraints
Juan Ramirez
Jose Gallego-Posada
38
5
0
08 Jul 2022
Previous
123...141516...252627
Next