Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
v1
v2 (latest)
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,349 papers shown
Title
Learning Control of Neural Sound Effects Synthesis from Physically Inspired Models
Yisu Zong
Joshua Reiss
79
1
0
13 Mar 2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
Qi Lv
Hao Li
Xiang Deng
Rui Shao
Yinchuan Li
Haifeng Zhang
Longxiang Gao
Michael Yu Wang
Liqiang Nie
118
2
0
13 Mar 2025
IQPFR: An Image Quality Prior for Blind Face Restoration and Beyond
Peng Hu
Chunming He
Lei Xu
Jingduo Tian
Sina Farsiu
Yize Zhang
Pei Liu
Xiu Li
109
0
0
12 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
166
1
0
12 Mar 2025
DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning
Chengxuan Qian
Kai Han
Jing Wang
Zhenlong Yuan
Rui Qian
Chongwen Lyu
Jun Chen
103
3
0
09 Mar 2025
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
Tian-Yu Xiang
Ao-Qun Jin
Xiao-Hu Zhou
Mei-Jiang Gui
Xiao-Liang Xie
...
Shuang-Yi Wang
Sheng-Bin Duang
Si-Cheng Wang
Zheng Lei
Z. Hou
112
2
0
06 Mar 2025
RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning
Xi Ye
Rui Heng Yang
Jun Jin
Yiming Li
Amir Rasouli
75
0
0
06 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
184
10
0
05 Mar 2025
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
Han Xue
Jieji Ren
Wendi Chen
Gu Zhang
Yuan Fang
Guoying Gu
Huazhe Xu
Cewu Lu
99
12
0
04 Mar 2025
ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation
Yufei Wang
Ziyu Wang
Mino Nakura
Pratik Bhowal
Chia-Liang Kuo
Yi-Ting Chen
Zackory M. Erickson
David Held
149
0
0
04 Mar 2025
Robustness to Geographic Distribution Shift using Location Encoders
Ruth Crasto
OOD
121
0
0
03 Mar 2025
Boolean-aware Attention for Dense Retrieval
Quan Mai
Susan Gauch
Douglas Adams
67
1
0
03 Mar 2025
XIRVIO: Critic-guided Iterative Refinement for Visual-Inertial Odometry with Explainable Adaptive Weighting
Chit Yuen Lam
Ronald Clark
Basaran Bahadir Kocer
VGen
160
0
0
01 Mar 2025
Deep Learning of the Evolution Operator Enables Forecasting of Out-of-Training Dynamics in Chaotic Systems
Ira J. S. Shokar
Peter H. Haynes
R. Kerswell
AI4TS
86
1
0
28 Feb 2025
Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport
Jingru Fu
Yuqi Zheng
Neel Dey
D. Ferreira
R. Moreno
MedIm
76
0
0
28 Feb 2025
DGFM: Full Body Dance Generation Driven by Music Foundation Models
Xinran Liu
Zhenhua Feng
Diptesh Kanojia
Wenwu Wang
DiffM
154
1
0
27 Feb 2025
On the Interpolation Effect of Score Smoothing
Zhengdao Chen
DiffM
148
1
0
26 Feb 2025
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
Xinran Liu
Xu Dong
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
96
0
0
25 Feb 2025
Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments
Shitong Xu
Yiyuan Yang
Niki Trigoni
Andrew Markham
76
0
0
23 Feb 2025
MedFuncta: Modality-Agnostic Representations Based on Efficient Neural Fields
Paul Friedrich
Florentin Bieder
P. Cattin
MedIm
167
0
0
20 Feb 2025
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Atalay Donat
Xiaogang Jia
Xi Huang
Aleksandar Taranovic
Denis Blessing
Ge Li
Hongyi Zhou
Hanyi Zhang
Rudolf Lioutikov
Gerhard Neumann
3DPC
SSL
147
1
0
20 Feb 2025
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
Zhiyuan Liu
Yanchen Luo
Han Huang
Enzhi Zhang
Changhao Nai
Sihang Li
Yaorui Shi
Xiang Wang
Kenji Kawaguchi
Tat-Seng Chua
201
4
0
18 Feb 2025
Responsive Noise-Relaying Diffusion Policy: Responsive and Efficient Visuomotor Control
Zhuoqun Chen
Xiu Yuan
Tongzhou Mu
Hao Su
111
1
0
18 Feb 2025
Predicate Hierarchies Improve Few-Shot State Classification
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
148
0
0
18 Feb 2025
UPCMR: A Universal Prompt-guided Model for Random Sampling Cardiac MRI Reconstruction
Donghang Lyu
Chinmay Rao
Marius Staring
M. Osch
M. Doneva
Hildo J. Lamb
Nicola Pezzotti
77
1
0
18 Feb 2025
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
Quanmin Wei
Penglin Dai
Wei Li
Bingyi Liu
Xiao-Jun Wu
95
1
0
15 Feb 2025
CDM: Contact Diffusion Model for Multi-Contact Point Localization
Seo Wook Han
Min Jun Kim
DiffM
60
0
0
10 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
212
18
0
10 Feb 2025
HOG-Diff: Higher-Order Guided Diffusion for Graph Generation
Yiming Huang
Tolga Birdal
DiffM
123
0
0
06 Feb 2025
Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications
Wataru Hatanaka
R. Yamashina
Takamitsu Matsubara
237
0
0
31 Jan 2025
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Negin Hashemi Dijujin
Seyed Roozbeh Razavi Rohani
Mohammad Samiei
M. Baghshah
106
0
0
28 Jan 2025
UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images
Tatiana Taís Schein
Gustavo Pereira de Almeira
Stephanie Loi Brião
Rodrigo Andrade de Bem
Felipe Gomes de Oliveira
Paulo L. J. Drews-Jr
89
1
0
28 Jan 2025
BiFold: Bimanual Cloth Folding with Language Guidance
Oriol Barbany
Adrià Colomé
Carme Torras
52
1
0
27 Jan 2025
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
Weiyu Chen
Xiaoyuan Zhang
Baijiong Lin
Xi Lin
Han Zhao
Qingfu Zhang
James T. Kwok
172
5
0
19 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank Wood
411
0
0
17 Jan 2025
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Riccardo Simionato
Stefano Fasciani
149
1
0
17 Jan 2025
Enhanced Multi-Scale Cross-Attention for Person Image Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
DiffM
145
0
0
15 Jan 2025
Score-based 3D molecule generation with neural fields
Matthieu Kirchmeyer
Pedro H. O. Pinheiro
Saeed Saremi
DiffM
135
2
0
15 Jan 2025
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Yuwei Fang
Kwot Sin Lee
Ivan Skorokhodov
Kfir Aberman
Jun-Yan Zhu
Ming-Hsuan Yang
Sergey Tulyakov
DiffM
VGen
194
13
0
10 Jan 2025
Data-Driven Radio Propagation Modeling using Graph Neural Networks
Adrien Bufort
Laurent Lebocq
Stefan Cathabard
GNN
87
3
0
08 Jan 2025
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
118
0
0
06 Jan 2025
S-Diff: An Anisotropic Diffusion Model for Collaborative Filtering in Spectral Domain
Rui Xia
Yanhua Cheng
Yongxiang Tang
Xiaocheng Liu
Xialong Liu
Lisong Wang
Peng Jiang
DiffM
104
1
0
03 Jan 2025
JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling
Haorui Ji
Rong Wang
Taojun Lin
Hongdong Li
3DH
92
1
0
31 Dec 2024
Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
Tornike Karchkhadze
M. Izadi
Shlomo Dubnov
DiffM
86
5
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
465
11
0
24 Dec 2024
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
Andi Xu
Hongsong Wang
Pinle Ding
Jie Gui
DiffM
VGen
211
2
0
23 Dec 2024
HyperCLIP: Adapting Vision-Language models with Hypernetworks
Victor Akinwande
Mohammad Sadegh Norouzzadeh
Devin Willmott
Anna Bair
Madan Ravi Ganesh
J. Zico Kolter
CLIP
VLM
159
0
0
21 Dec 2024
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Ho Kei Cheng
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Alex Schwing
Yuki Mitsufuji
VGen
294
18
0
19 Dec 2024
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
139
8
0
18 Dec 2024
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
Moritz Reuss
Jyothish Pari
Pulkit Agrawal
Rudolf Lioutikov
DiffM
MoE
143
8
0
17 Dec 2024
Previous
1
2
3
4
5
6
...
25
26
27
Next