ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,315 papers shown
Title
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language
  Models
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
22
191
0
09 Aug 2021
FiLMing Multimodal Sarcasm Detection with Attention
FiLMing Multimodal Sarcasm Detection with Attention
Sundesh Gupta
Aditya Shah
Miten Shah
Laribok Syiemlieh
Chandresh Kumar Maurya
23
12
0
09 Aug 2021
A Unified Model for Zero-shot Music Source Separation, Transcription and
  Synthesis
A Unified Model for Zero-shot Music Source Separation, Transcription and Synthesis
Liwei Lin
Qiuqiang Kong
Junyan Jiang
Gus Xia
25
26
0
07 Aug 2021
Multimodal Meta-Learning for Time Series Regression
Multimodal Meta-Learning for Time Series Regression
Sebastian Pineda-Arango
Felix Heinrich
Kiran Madhusudhanan
Lars Schmidt-Thieme
AI4TS
20
15
0
05 Aug 2021
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive
  Speech Synthesis
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis
Julian Zaïdi
Hugo Seuté
Benjamin van Niekerk
M. Carbonneau
34
20
0
04 Aug 2021
Learn to Match: Automatic Matching Network Design for Visual Tracking
Learn to Match: Automatic Matching Network Design for Visual Tracking
Zhipeng Zhang
Yihao Liu
Tianlin Li
Bing Li
Weiming Hu
38
167
0
02 Aug 2021
Adaptive Denoising via GainTuning
Adaptive Denoising via GainTuning
S. Mohan
Joshua L. Vincent
R. Manzorro
Peter A Crozier
Eero P. Simoncelli
C. Fernandez‐Granda
28
24
0
27 Jul 2021
Greedy Gradient Ensemble for Robust Visual Question Answering
Greedy Gradient Ensemble for Robust Visual Question Answering
Xinzhe Han
Shuhui Wang
Chi Su
Qingming Huang
Q. Tian
26
75
0
27 Jul 2021
Towards the Unseen: Iterative Text Recognition by Distilling from Errors
Towards the Unseen: Iterative Text Recognition by Distilling from Errors
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Yi-Zhe Song
38
16
0
26 Jul 2021
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback
Mike Wu
Noah D. Goodman
Chris Piech
Chelsea Finn
40
19
0
23 Jul 2021
Improving the Generalization of Meta-learning on Unseen Domains via
  Adversarial Shift
Improving the Generalization of Meta-learning on Unseen Domains via Adversarial Shift
Pinzhuo Tian
Yao Gao
OOD
17
1
0
23 Jul 2021
Neural Abstructions: Abstractions that Support Construction for Grounded
  Language Learning
Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning
Kaylee Burns
Christopher D. Manning
Li Fei-Fei
32
0
0
20 Jul 2021
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation
  From Reverberant Speech
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech
C. Steinmetz
V. Ithapu
P. Calamia
50
39
0
15 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
Paul Pu Liang
Yiwei Lyu
Xiang Fan
Zetian Wu
Yun Cheng
...
Peter Wu
Michelle A. Lee
Yuke Zhu
Ruslan Salakhutdinov
Louis-Philippe Morency
VLM
37
159
0
15 Jul 2021
Combining 3D Image and Tabular Data via the Dynamic Affine Feature Map
  Transform
Combining 3D Image and Tabular Data via the Dynamic Affine Feature Map Transform
Sebastian Polsterl
Tom Nuno Wolf
Christian Wachinger
MedIm
32
44
0
13 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
744
0
07 Jul 2021
Memory Efficient Meta-Learning with Large Images
Memory Efficient Meta-Learning with Large Images
J. Bronskill
Daniela Massiceti
Massimiliano Patacchiola
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
27
20
0
02 Jul 2021
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement
  Learning Agents
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
Grgur Kovač
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
ALM
32
6
0
02 Jul 2021
Cross-domain Few-shot Learning with Task-specific Adapters
Cross-domain Few-shot Learning with Task-specific Adapters
Weihong Li
Xialei Liu
Hakan Bilen
OOD
34
113
0
01 Jul 2021
Domain Conditional Predictors for Domain Adaptation
Domain Conditional Predictors for Domain Adaptation
João Monteiro
Xavier Gibert
Jianqiao Feng
Vincent Dumoulin
Dar-Shyang Lee
OOD
TTA
AI4CE
22
5
0
25 Jun 2021
Q-space Conditioned Translation Networks for Directional Synthesis of
  Diffusion Weighted Images from Multi-modal Structural MRI
Q-space Conditioned Translation Networks for Directional Synthesis of Diffusion Weighted Images from Multi-modal Structural MRI
Mengwei Ren
Heejong Kim
Neel Dey
Guido Gerig
DiffM
MedIm
44
16
0
24 Jun 2021
Learning to Predict Visual Attributes in the Wild
Learning to Predict Visual Attributes in the Wild
Khoi Pham
Kushal Kafle
Zhe Lin
Zhi Ding
Scott D. Cohen
Q. Tran
Abhinav Shrivastava
21
108
0
17 Jun 2021
Unsupervised Video Prediction from a Single Frame by Estimating 3D
  Dynamic Scene Structure
Unsupervised Video Prediction from a Single Frame by Estimating 3D Dynamic Scene Structure
Paul Henderson
Christoph H. Lampert
Bernd Bickel
VGen
20
7
0
16 Jun 2021
Grounding Spatio-Temporal Language with Transformers
Grounding Spatio-Temporal Language with Transformers
Tristan Karch
Laetitia Teodorescu
Katja Hofmann
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
27
11
0
16 Jun 2021
How Modular Should Neural Module Networks Be for Systematic
  Generalization?
How Modular Should Neural Module Networks Be for Systematic Generalization?
Vanessa D’Amario
Tomotake Sasaki
Xavier Boix
15
17
0
15 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable
  High-resolution Drum Sound Synthesis
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard
Gaëtan Hadjeres
DiffM
27
42
0
14 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
27
11
0
12 Jun 2021
Learning Compositional Shape Priors for Few-Shot 3D Reconstruction
Learning Compositional Shape Priors for Few-Shot 3D Reconstruction
Mateusz Michalkiewicz
Stavros Tsogkas
Sarah Parisot
Mahsa Baktash
Anders P. Eriksson
Eugene Belilovsky
3DV
3DPC
13
2
0
11 Jun 2021
ViT-Inception-GAN for Image Colourising
ViT-Inception-GAN for Image Colourising
Tejas Bana
Jatan Loya
Siddhant Kulkarni
ViT
21
1
0
11 Jun 2021
NAAQA: A Neural Architecture for Acoustic Question Answering
NAAQA: A Neural Architecture for Acoustic Question Answering
Jerome Abdelnour
Jean Rouat
G. Salvi
6
4
0
11 Jun 2021
Optimizing Reusable Knowledge for Continual Learning via Metalearning
Optimizing Reusable Knowledge for Continual Learning via Metalearning
J. Hurtado
Alain Raymond-Sáez
Alvaro Soto
CLL
34
37
0
09 Jun 2021
Geometry-Consistent Neural Shape Representation with Implicit
  Displacement Fields
Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields
Yifan Wang
Lukas Rahmann
O. Sorkine-Hornung
27
65
0
09 Jun 2021
Pretraining Representations for Data-Efficient Reinforcement Learning
Pretraining Representations for Data-Efficient Reinforcement Learning
Max Schwarzer
Nitarshan Rajkumar
Michael Noukhovitch
Ankesh Anand
Laurent Charlin
Devon Hjelm
Philip Bachman
Aaron Courville
OffRL
47
114
0
09 Jun 2021
Understanding top-down attention using task-oriented ablation design
Understanding top-down attention using task-oriented ablation design
Freddie Bickford-Smith
Brett D. Roads
Xiaoliang Luo
Bradley C. Love
46
1
0
08 Jun 2021
A Survey of Transformers
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
58
1,089
0
08 Jun 2021
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared
  Hypernetworks
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks
Rabeeh Karimi Mahabadi
Sebastian Ruder
Mostafa Dehghani
James Henderson
MoE
39
296
0
08 Jun 2021
DINs: Deep Interactive Networks for Neurofibroma Segmentation in
  Neurofibromatosis Type 1 on Whole-Body MRI
DINs: Deep Interactive Networks for Neurofibroma Segmentation in Neurofibromatosis Type 1 on Whole-Body MRI
Jian-Wei Zhang
Wei Chen
K. I. Ly
Xubin Zhang
Fan Yan
J. Jordan
G. Harris
S. Plotkin
Pengyi Hao
W. Cai
22
6
0
07 Jun 2021
Go with the Flows: Mixtures of Normalizing Flows for Point Cloud
  Generation and Reconstruction
Go with the Flows: Mixtures of Normalizing Flows for Point Cloud Generation and Reconstruction
Janis Postels
Mengya Liu
Riccardo Spezialetti
Luc Van Gool
Federico Tombari
AI4CE
3DPC
22
22
0
06 Jun 2021
Neural Implicit 3D Shapes from Single Images with Spatial Patterns
Neural Implicit 3D Shapes from Single Images with Spatial Patterns
Yixin Zhuang
Yunzhe Liu
Yujie Wang
Baoquan Chen
3DPC
22
0
0
06 Jun 2021
Light Field Networks: Neural Scene Representations with
  Single-Evaluation Rendering
Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering
Vincent Sitzmann
Semon Rezchikov
William T. Freeman
J. Tenenbaum
F. Durand
3DV
54
288
0
04 Jun 2021
Cross-Trajectory Representation Learning for Zero-Shot Generalization in
  RL
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Bogdan Mazoure
Ahmed M. Ahmed
Patrick MacAlpine
R. Devon Hjelm
Andrey Kolobov
35
27
0
04 Jun 2021
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal
  Numerical Reasoning
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning
Jiaqi Chen
Jianheng Tang
Jinghui Qin
Xiaodan Liang
Lingbo Liu
Eric Xing
Liang Lin
AIMat
22
160
0
30 May 2021
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial
  Attacks
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks
Dequan Wang
An Ju
Evan Shelhamer
David Wagner
Trevor Darrell
AAML
26
27
0
18 May 2021
Learning a Universal Template for Few-shot Dataset Generalization
Learning a Universal Template for Few-shot Dataset Generalization
Eleni Triantafillou
Hugo Larochelle
R. Zemel
Vincent Dumoulin
32
92
0
14 May 2021
Meta-Inductive Node Classification across Graphs
Meta-Inductive Node Classification across Graphs
Zhihao Wen
Yuan Fang
Zemin Liu
43
34
0
14 May 2021
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch
  Whitening
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening
Wenqi Shao
Hang Yu
Zhaoyang Zhang
Hang Xu
Zhenguo Li
Ping Luo
AAML
12
2
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
85
7,480
0
11 May 2021
Multi-modal Conditional Bounding Box Regression for Music Score
  Following
Multi-modal Conditional Bounding Box Regression for Music Score Following
Florian Henkel
Gerhard Widmer
14
4
0
10 May 2021
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language
  and Symbolic Reasoning
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning
Pan Lu
Ran Gong
Shibiao Jiang
Liang Qiu
Siyuan Huang
Xiaodan Liang
Song-Chun Zhu
AIMat
LRM
23
210
0
10 May 2021
Generative Adversarial Registration for Improved Conditional Deformable
  Templates
Generative Adversarial Registration for Improved Conditional Deformable Templates
Neel Dey
Mengwei Ren
Adrian Dalca
Guido Gerig
GAN
MedIm
30
34
0
07 May 2021
Previous
123...181920...252627
Next