ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
v1v2 (latest)

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAttAIMatOffRLAI4CE
ArXiv (abs)PDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,349 papers shown
Title
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and
  Manipulation
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation
Juil Koo
Seungwoo Yoo
Minh Hoai Nguyen
Minhyuk Sung
DiffM
75
45
0
21 Mar 2023
Efficient Feature Distillation for Zero-shot Annotation Object Detection
Efficient Feature Distillation for Zero-shot Annotation Object Detection
Zhuoming Liu
Xuefeng Hu
Ram Nevatia
VLMObjD
70
1
0
21 Mar 2023
CC3D: Layout-Conditioned Generation of Compositional 3D Scenes
CC3D: Layout-Conditioned Generation of Compositional 3D Scenes
Sherwin Bahmani
Jeong Joon Park
Despoina Paschalidou
Xingguang Yan
Gordon Wetzstein
Leonidas Guibas
Andrea Tagliasacchi
3DV
107
44
0
21 Mar 2023
IMF: Interactive Multimodal Fusion Model for Link Prediction
IMF: Interactive Multimodal Fusion Model for Link Prediction
Xinhang Li
Xiangyu Zhao
Jiaxing Xu
Yong Zhang
Chunxiao Xing
101
48
0
20 Mar 2023
SDF-3DGAN: A 3D Object Generative Method Based on Implicit Signed
  Distance Function
SDF-3DGAN: A 3D Object Generative Method Based on Implicit Signed Distance Function
Lutao Jiang
Ruyi Ji
Libo Zhang
3DV3DGS
108
6
0
13 Mar 2023
MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual
  Fine-Grained Learning
MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Ruize Xu
Ruoxuan Feng
Shi-Xiong Zhang
Di Hu
80
24
0
09 Mar 2023
Do Prosody Transfer Models Transfer Prosody?
Do Prosody Transfer Models Transfer Prosody?
A. Sigurgeirsson
Simon King
DiffM
65
8
0
07 Mar 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
352
1,247
0
07 Mar 2023
Patched Diffusion Models for Unsupervised Anomaly Detection in Brain MRI
Patched Diffusion Models for Unsupervised Anomaly Detection in Brain MRI
F. Behrendt
Debayan Bhattacharya
Julia Kruger
R. Opfer
Alexander Schlaefer
DiffMMedIm
75
41
0
07 Mar 2023
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised
  Speech and Text Representations
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Yuma Koizumi
Heiga Zen
Shigeki Karita
Yifan Ding
Kohei Yatabe
Nobuyuki Morioka
Yu Zhang
Wei Han
Ankur Bapna
M. Bacchiani
94
29
0
03 Mar 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models
Open-World Object Manipulation using Pre-trained Vision-Language Models
Austin Stone
Ted Xiao
Yao Lu
K. Gopalakrishnan
Kuang-Huei Lee
...
Sean Kirmani
Brianna Zitkovich
F. Xia
Chelsea Finn
Karol Hausman
LM&Ro
265
156
0
02 Mar 2023
Continuous descriptor-based control for deep audio synthesis
Continuous descriptor-based control for deep audio synthesis
Ninon Devis
Nils Demerlé
Sarah Nabi
David Genova
P. Esling
55
9
0
27 Feb 2023
Locale Encoding For Scalable Multilingual Keyword Spotting Models
Locale Encoding For Scalable Multilingual Keyword Spotting Models
Pai Zhu
Hyun Jin Park
Alex Park
Angelo Scorza Scarpati
Ignacio López Moreno
50
5
0
25 Feb 2023
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping
  in Clutter
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter
Kechun Xu
Shuqing Zhao
Zhongxiang Zhou
Zizhang Li
Huaijin Pi
Yifeng Zhu
Yue Wang
R. Xiong
86
49
0
24 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
159
80
0
22 Feb 2023
Link Prediction on Latent Heterogeneous Graphs
Link Prediction on Latent Heterogeneous Graphs
Trung-Kien Nguyen
Zemin Liu
Yuan Fang
64
10
0
21 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image
  Synthesis
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
113
9
0
17 Feb 2023
Entity Aware Modelling: A Survey
Entity Aware Modelling: A Survey
Rahul Ghosh
Haoyu Yang
A. Khandelwal
Erhu He
Arvind Renganathan
Somya Sharma
X. Jia
Vipin Kumar
86
7
0
16 Feb 2023
Multi-Channel Target Speaker Extraction with Refinement: The WavLab
  Submission to the Second Clarity Enhancement Challenge
Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge
Samuele Cornell
Zhongqiu Wang
Yoshiki Masuyama
Shinji Watanabe
Manuel Pariente
Nobutaka Ono
81
12
0
15 Feb 2023
Self-Supervised Temporal Graph learning with Temporal and Structural
  Intensity Alignment
Self-Supervised Temporal Graph learning with Temporal and Structural Intensity Alignment
Meng Liu
K. Liang
Bin Xiao
Wenxuan Tu
Sihang Zhou
Xihong Yang
Xinwang Liu
Yue Liu
102
72
0
15 Feb 2023
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch
  to Portrait Generation
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Yasheng Sun
Qianyi Wu
Hang Zhou
Kaisiyuan Wang
Tianshu Hu
Chen-Chieh Liao
Shio Miyafuji
Ziwei Liu
Hideki Koike
3DH
68
3
0
14 Feb 2023
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Yiheng Zhu
Jialun Wu
Chaowen Hu
Jiahuan Yan
Chang-Yu Hsieh
Tingjun Hou
Jian Wu
109
36
0
08 Feb 2023
On Generalized Degree Fairness in Graph Neural Networks
On Generalized Degree Fairness in Graph Neural Networks
Zemin Liu
Trung-Kien Nguyen
Yuan Fang
74
28
0
08 Feb 2023
HumanMAC: Masked Motion Completion for Human Motion Prediction
HumanMAC: Masked Motion Completion for Human Motion Prediction
Ling-Hao Chen
Jiawei Zhang
Ye-rong Li
Yiren Pang
Xiaobo Xia
Tongliang Liu
DiffMVGen
111
62
0
07 Feb 2023
Graph Generation with Diffusion Mixture
Graph Generation with Diffusion Mixture
Jaehyeong Jo
Dongki Kim
Sung Ju Hwang
DiffM
106
23
0
07 Feb 2023
Learning to Count Isomorphisms with Graph Neural Networks
Learning to Count Isomorphisms with Graph Neural Networks
Xingtong Yu
Zemin Liu
Yuan Fang
Xinming Zhang
GNN
98
15
0
07 Feb 2023
Relating EEG to continuous speech using deep neural networks: a review
Relating EEG to continuous speech using deep neural networks: a review
Corentin Puffay
Bernd Accou
Lies Bollens
Mohammad Jalilpour-Monesi
Jonas Vanthornhout
Hugo Van hamme
T. Francart
91
42
0
03 Feb 2023
On the Efficacy of Differentially Private Few-shot Image Classification
On the Efficacy of Differentially Private Few-shot Image Classification
Marlon Tobaben
Aliaksandra Shysheya
J. Bronskill
Andrew Paverd
Shruti Tople
Santiago Zanella Béguelin
Richard Turner
Antti Honkela
96
12
0
02 Feb 2023
Adaptive Siamese Tracking with a Compact Latent Network
Adaptive Siamese Tracking with a Compact Latent Network
Xingping Dong
Jianbing Shen
Fatih Porikli
Jiebo Luo
Ling Shao
92
30
0
02 Feb 2023
Jointist: Simultaneous Improvement of Multi-instrument Transcription and
  Music Source Separation via Joint Training
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Ju-Chiang Wang
Yun-Ning Hung
Dorien Herremans
92
6
0
01 Feb 2023
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
Grace Zhang
Ayush Jain
Injune Hwang
Shao-Hua Sun
Joseph J. Lim
75
5
0
01 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
83
31
0
31 Jan 2023
A Comprehensive Survey of Continual Learning: Theory, Method and
  Application
A Comprehensive Survey of Continual Learning: Theory, Method and Application
Liyuan Wang
Xingxing Zhang
Hang Su
Jun Zhu
KELMCLL
238
716
0
31 Jan 2023
Hierarchical Imitation Learning with Vector Quantized Models
Hierarchical Imitation Learning with Vector Quantized Models
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
90
13
0
30 Jan 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu
Zehua Chen
Yiitan Yuan
Xinhao Mei
Xubo Liu
Danilo Mandic
Wenwu Wang
Mark D. Plumbley
DiffM
177
510
0
29 Jan 2023
Diffusion Models as Artists: Are we Closing the Gap between Humans and
  Machines?
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?
Victor Boutin
Thomas Fel
Lakshya Singhal
Rishav Mukherji
Akash Nagaraj
Julien Colin
Thomas Serre
DiffM
74
6
0
27 Jan 2023
PLay: Parametrically Conditioned Layout Generation using Latent
  Diffusion
PLay: Parametrically Conditioned Layout Generation using Latent Diffusion
Ching-Yi Cheng
Forrest Huang
Gang Li
Yang Li
DiffM
72
29
0
27 Jan 2023
Modality-Agnostic Variational Compression of Implicit Neural
  Representations
Modality-Agnostic Variational Compression of Implicit Neural Representations
Jonathan Richard Schwarz
Jihoon Tack
Yee Whye Teh
Jaeho Lee
Jinwoo Shin
86
26
0
23 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
126
42
0
21 Jan 2023
Robust Scheduling with GFlowNets
Robust Scheduling with GFlowNets
David W. Zhang
Corrado Rainone
M. Peschl
Roberto Bondesan
132
57
0
17 Jan 2023
"No, to the Right" -- Online Language Corrections for Robotic
  Manipulation via Shared Autonomy
"No, to the Right" -- Online Language Corrections for Robotic Manipulation via Shared Autonomy
Yuchen Cui
Siddharth Karamcheti
Raj Palleti
Nidhya Shivakumar
Percy Liang
Dorsa Sadigh
LM&Ro
115
83
0
06 Jan 2023
Exploring Efficient Few-shot Adaptation for Vision Transformers
Exploring Efficient Few-shot Adaptation for Vision Transformers
C. Xu
Siqian Yang
Yabiao Wang
Zhanxiong Wang
Yanwei Fu
Xiangyang Xue
97
17
0
06 Jan 2023
Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ
  Trade-Off Between Accuracy and Robustness of DNNs
Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs
Souvik Kundu
Sairam Sundaresan
S. N. Sridhar
Shunlin Lu
Han Tang
Peter A. Beerel
AAMLMoE
110
4
0
27 Dec 2022
VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and
  Challenges
VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges
R. Zakari
Jim Wilson Owusu
Hailin Wang
Ke Qin
Zaharaddeen Karami Lawal
Yue-hong Dong
LRM
75
16
0
26 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLMLRM
133
150
0
20 Dec 2022
Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know
  How to Reason?
Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?
Monika Wysoczañska
Tom Monnier
Tomasz Trzciñski
David Picard
ReLMOCL
73
1
0
20 Dec 2022
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
175
2,440
0
19 Dec 2022
Correspondence Distillation from NeRF-based GAN
Correspondence Distillation from NeRF-based GAN
Yushi Lan
Chen Change Loy
Bo Dai
75
9
0
19 Dec 2022
MetaPortrait: Identity-Preserving Talking Head Generation with Fast
  Personalized Adaptation
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Bo Zhang
Chenyang Qi
Pan Zhang
Bo Zhang
Hsiang-Tao Wu
Dong Chen
Qifeng Chen
Yong Wang
Fang Wen
117
59
0
15 Dec 2022
Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
Yushi Lan
Xuyi Meng
Shuai Yang
Chen Change Loy
Bo Dai
3DH3DV
99
31
0
14 Dec 2022
Previous
123...131415...252627
Next