ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,310 papers shown
Title
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
Yan Zhang
Yao Feng
Alpár Cseke
Nitin Saini
Nathan Bajandas
Nicolas Heron
M. Black
DiffM
VGen
66
0
0
21 Mar 2025
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation
Jiangran Lyu
Ziming Li
Xuesong Shi
Chaoyi Xu
Yizhou Wang
He Wang
49
0
0
21 Mar 2025
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer
Hongda Liu
Longguang Wang
Ye Zhang
Ziru Yu
Yulan Guo
Mamba
70
0
0
20 Mar 2025
Diffusion-augmented Graph Contrastive Learning for Collaborative Filter
Diffusion-augmented Graph Contrastive Learning for Collaborative Filter
Fan Huang
Wei Wang
DiffM
69
0
0
20 Mar 2025
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling
Yanchen Luo
Zhiyuan Liu
Yi Zhao
Sihang Li
Kenji Kawaguchi
Tat-Seng Chua
Xuben Wang
MedIm
69
0
0
19 Mar 2025
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
Thanh-Son Nguyen
Hong Yang
Tzeh Yuan Neoh
Hao Zhang
Ee Yeo Keat
Basura Fernando
NAI
59
0
0
19 Mar 2025
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
A. Kazerouni
Soroush Mehraban
Michael Brudno
Babak Taati
46
0
0
19 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffM
VGen
73
1
0
18 Mar 2025
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
Wupeng Wang
Zexu Pan
Jingru Lin
Shuai Wang
Haizhou Li
53
0
0
16 Mar 2025
Image-Goal Navigation Using Refined Feature Guidance and Scene Graph Enhancement
Zhicheng Feng
Xieyuanli Chen
Chenghao Shi
Lun Luo
Z. Chen
Yun Liu
Huimin Lu
48
0
0
14 Mar 2025
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
Hejia Chen
Haoxian Zhang
Shoulong Zhang
Xiaoqiang Liu
Sisi Zhuang
Yuan Zhang
Pengfei Wan
Di Zhang
Shuai Li
59
1
0
14 Mar 2025
Learning Control of Neural Sound Effects Synthesis from Physically Inspired Models
Yisu Zong
Joshua Reiss
51
0
0
13 Mar 2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
Qi Lv
Hao Li
Xiang Deng
Rui Shao
Yinchuan Li
Jianye Hao
Longxiang Gao
Michael Yu Wang
Liqiang Nie
46
0
0
13 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
57
0
0
12 Mar 2025
IQPFR: An Image Quality Prior for Blind Face Restoration and Beyond
Peng Hu
Chunming He
Lei Xu
Jingduo Tian
Sina Farsiu
Yuyao Zhang
Pei Liu
Xiu Li
63
0
0
12 Mar 2025
DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning
Chengxuan Qian
Kai Han
J. Wang
Zhenlong Yuan
Rui Qian
Chongwen Lyu
Jun Chen
51
1
0
09 Mar 2025
RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning
Xi Ye
Rui Heng Yang
Jun Jin
Y. K. Li
Amir Rasouli
53
0
0
06 Mar 2025
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
Tian-Yu Xiang
Ao-Qun Jin
Xiao-Hu Zhou
Mei-Jiang Gui
Xiao-Liang Xie
...
Shuang-Yi Wang
Sheng-Bin Duang
Si-Cheng Wang
Zheng Lei
Z. Hou
58
1
0
06 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
85
6
0
05 Mar 2025
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
Han Xue
Jieji Ren
Wendi Chen
Gu Zhang
Yuan Fang
Guoying Gu
Huazhe Xu
Cewu Lu
44
5
0
04 Mar 2025
ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation
ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation
Yufei Wang
Ziyu Wang
Mino Nakura
Pratik Bhowal
Chia-Liang Kuo
Yi-Ting Chen
Zackory M. Erickson
David Held
66
0
0
04 Mar 2025
Robustness to Geographic Distribution Shift using Location Encoders
Ruth Crasto
OOD
81
0
0
03 Mar 2025
Boolean-aware Attention for Dense Retrieval
Quan Mai
Susan Gauch
Douglas Adams
34
1
0
03 Mar 2025
XIRVIO: Critic-guided Iterative Refinement for Visual-Inertial Odometry with Explainable Adaptive Weighting
Chit Yuen Lam
Ronald Clark
Basaran Bahadir Kocer
VGen
71
0
0
01 Mar 2025
Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport
Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport
Jingru Fu
Yuqi Zheng
Neel Dey
D. Ferreira
R. Moreno
MedIm
29
0
0
28 Feb 2025
Deep Learning of the Evolution Operator Enables Forecasting of Out-of-Training Dynamics in Chaotic Systems
Deep Learning of the Evolution Operator Enables Forecasting of Out-of-Training Dynamics in Chaotic Systems
Ira J. S. Shokar
Peter H. Haynes
R. Kerswell
AI4TS
35
1
0
28 Feb 2025
DGFM: Full Body Dance Generation Driven by Music Foundation Models
DGFM: Full Body Dance Generation Driven by Music Foundation Models
Xinran Liu
Zhenhua Feng
Diptesh Kanojia
Wenwu Wang
DiffM
66
1
0
27 Feb 2025
On the Interpolation Effect of Score Smoothing
On the Interpolation Effect of Score Smoothing
Zhengdao Chen
DiffM
83
0
0
26 Feb 2025
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
Xinran Liu
Xu Dong
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
62
0
0
25 Feb 2025
Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments
Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments
Shitong Xu
Yiyuan Yang
Niki Trigoni
Andrew Markham
34
0
0
23 Feb 2025
MedFuncta: Modality-Agnostic Representations Based on Efficient Neural Fields
MedFuncta: Modality-Agnostic Representations Based on Efficient Neural Fields
Paul Friedrich
Florentin Bieder
P. Cattin
MedIm
62
0
0
20 Feb 2025
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Atalay Donat
Xiaogang Jia
Xi Huang
Aleksandar Taranovic
Denis Blessing
Ge Li
Hongyi Zhou
Hanyi Zhang
Rudolf Lioutikov
Gerhard Neumann
3DPC
SSL
73
1
0
20 Feb 2025
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
Zhiyuan Liu
Yanchen Luo
Han Huang
Enzhi Zhang
Sihang Li
Fan Zhang
Yaorui Shi
Xuben Wang
Kenji Kawaguchi
Tat-Seng Chua
100
3
0
18 Feb 2025
UPCMR: A Universal Prompt-guided Model for Random Sampling Cardiac MRI Reconstruction
UPCMR: A Universal Prompt-guided Model for Random Sampling Cardiac MRI Reconstruction
Donghang Lyu
Chinmay Rao
Marius Staring
M. Osch
M. Doneva
Hildo J. Lamb
Nicola Pezzotti
46
1
0
18 Feb 2025
Predicate Hierarchies Improve Few-Shot State Classification
Predicate Hierarchies Improve Few-Shot State Classification
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
79
0
0
18 Feb 2025
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
Quanmin Wei
Penglin Dai
Wei Li
Bingyi Liu
Xiao-Jun Wu
46
0
0
15 Feb 2025
CDM: Contact Diffusion Model for Multi-Contact Point Localization
Seo Wook Han
Min Jun Kim
DiffM
35
0
0
10 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
117
7
0
10 Feb 2025
HOG-Diff: Higher-Order Guided Diffusion for Graph Generation
HOG-Diff: Higher-Order Guided Diffusion for Graph Generation
Yiming Huang
Tolga Birdal
DiffM
78
0
0
06 Feb 2025
Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications
Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications
Wataru Hatanaka
R. Yamashina
Takamitsu Matsubara
108
0
0
31 Jan 2025
UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images
Tatiana Taís Schein
Gustavo Pereira de Almeira
Stephanie Loi Brião
Rodrigo Andrade de Bem
Felipe Gomes de Oliveira
Paulo L. J. Drews-Jr
51
0
0
28 Jan 2025
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Negin Hashemi Dijujin
Seyed Roozbeh Razavi Rohani
Mohammad Samiei
M. Baghshah
58
0
0
28 Jan 2025
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
Weiyu Chen
Xiaoyuan Zhang
Baijiong Lin
Xi Lin
Han Zhao
Qingfu Zhang
James T. Kwok
75
2
0
19 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank D. Wood
137
0
0
17 Jan 2025
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
Riccardo Simionato
Stefano Fasciani
78
1
0
17 Jan 2025
Enhanced Multi-Scale Cross-Attention for Person Image Generation
Enhanced Multi-Scale Cross-Attention for Person Image Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
DiffM
70
0
0
15 Jan 2025
Score-based 3D molecule generation with neural fields
Score-based 3D molecule generation with neural fields
Matthieu Kirchmeyer
Pedro H. O. Pinheiro
Saeed Saremi
DiffM
48
0
0
15 Jan 2025
Multi-subject Open-set Personalization in Video Generation
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Yuwei Fang
Kwot Sin Lee
Ivan Skorokhodov
Kfir Aberman
Jun-Yan Zhu
Ming Yang
Sergey Tulyakov
DiffM
VGen
69
7
0
10 Jan 2025
Data-Driven Radio Propagation Modeling using Graph Neural Networks
Data-Driven Radio Propagation Modeling using Graph Neural Networks
Adrien Bufort
Laurent Lebocq
Stefan Cathabard
GNN
46
3
0
08 Jan 2025
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
38
0
0
06 Jan 2025
Previous
12345...252627
Next