Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
v1
v2 (latest)
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,349 papers shown
Title
Interpretable Multi-task Learning with Shared Variable Embeddings
Maciej Żelaszczyk
Jacek Mańdziuk
65
0
0
10 May 2024
Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction
Zhihao Wen
Yuan Fang
Pengcheng Wei
Fayao Liu
Zhenghua Chen
Min-man Wu
AI4CE
75
2
0
07 May 2024
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model
Joo Young Choi
Jaesung R. Park
Inkyu Park
Jaewoong Cho
Albert No
Ernest K. Ryu
AI4CE
114
5
0
07 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
107
4
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
176
48
0
06 May 2024
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
162
17
0
05 May 2024
Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
F. Sarfraz
Bahram Zonooz
Elahe Arani
CLL
72
3
0
04 May 2024
CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Kota Kondo
Andrea Tagliabue
Xiaoyi Cai
Claudius T. Tewari
Olivia Garcia
Marcos Espitia-Alvarez
Jonathan P. How
109
9
0
02 May 2024
TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms
Yueyuan Sui
Minghui Zhao
Junxi Xia
Xiaofan Jiang
S. Xia
Mamba
93
11
0
02 May 2024
Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand
Slawek Smyl
Boris N. Oreshkin
Paweł Pełka
Grzegorz Dudek
AI4TS
79
0
0
26 Apr 2024
Latent Modulated Function for Computational Optimal Continuous Image Representation
Zongyao He
Zhi Jin
SupR
109
12
0
25 Apr 2024
Leveraging Large Language Models for Multimodal Search
Oriol Barbany
Michael Huang
Xinliang Zhu
Arnab Dhua
97
10
0
24 Apr 2024
Audio Anti-Spoofing Detection: A Survey
Menglu Li
Yasaman Ahmadiadli
Xiao-Ping Zhang
104
25
0
22 Apr 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
70
1
0
19 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
72
0
0
19 Apr 2024
Retrieval-Augmented Embodied Agents
Yichen Zhu
Zhicai Ou
Xiaofeng Mou
Jian Tang
106
20
0
17 Apr 2024
Neural Shrödinger Bridge Matching for Pansharpening
Zihan Cao
Xiao Wu
Liang-Jian Deng
DiffM
104
2
0
17 Apr 2024
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation Models
Chao Tang
Dehao Huang
Wenlong Dong
Ruinian Xu
Kuanqi Cai
92
13
0
16 Apr 2024
Find The Gap: Knowledge Base Reasoning For Visual Question Answering
Elham J. Barezi
Parisa Kordjamshidi
63
1
0
16 Apr 2024
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
Yu-Ju Tsai
Jin-Cheng Jhang
Jingjing Zheng
Wei Wang
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
Ming-Hsuan Yang
3DV
77
4
0
15 Apr 2024
RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion
Guoxuan Chi
Zheng Yang
Chenshu Wu
Jingao Xu
Yuchong Gao
Yunhao Liu
Tony Xiao Han
DiffM
88
35
0
14 Apr 2024
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
108
11
0
12 Apr 2024
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Yang Yang
Hongpeng Pan
Qingjun Jiang
Yi Tian Xu
Jinghui Tang
64
6
0
12 Apr 2024
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
Taegyun Kwon
Dasaem Jeong
Juhan Nam
385
3
0
10 Apr 2024
Domain Generalisation via Imprecise Learning
Anurag Singh
Siu Lun Chau
S. Bouabid
Krikamol Muandet
AI4CE
OOD
100
10
0
06 Apr 2024
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
108
26
0
06 Apr 2024
Implicit Assimilation of Sparse In Situ Data for Dense & Global Storm Surge Forecasting
Patrick Ebel
Brandon Victor
Peter Naylor
Gabriele Meoni
Federico Serva
Rochelle Schneider
40
0
0
05 Apr 2024
Joint-Task Regularization for Partially Labeled Multi-Task Learning
Kento Nishi
Junsik Kim
Wanhua Li
Hanspeter Pfister
112
2
0
02 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
75
16
0
02 Apr 2024
Video Interpolation with Diffusion Models
Siddhant Jain
Daniel Watson
Eric Tabellion
Aleksander Holyñski
Ben Poole
Janne Kontkanen
SupR
VGen
DiffM
108
41
0
01 Apr 2024
Condition-Aware Neural Network for Controlled Image Generation
Han Cai
Zhekai Zhang
Zhuoyang Zhang
Qinsheng Zhang
Ming-Yu Liu
Song Han
DiffM
58
8
0
01 Apr 2024
SineNet: Learning Temporal Dynamics in Time-Dependent Partial Differential Equations
Xuan Zhang
Jacob Helwig
Yu-Ching Lin
Yaochen Xie
Cong Fu
Stephan Wojtowytsch
Shuiwang Ji
AI4CE
103
8
0
28 Mar 2024
Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
Akshay Paruchuri
S. Ehrenstein
Shuxian Wang
Inbar Fried
Stephen M. Pizer
Marc Niethammer
Roni Sengupta
MDE
98
6
0
26 Mar 2024
ReMamber: Referring Image Segmentation with Mamba Twister
Yu-Hao Yang
Chaofan Ma
Jiangchao Yao
Zhun Zhong
Ya Zhang
Yanfeng Wang
Mamba
108
24
0
26 Mar 2024
Diffusion-based Negative Sampling on Graphs for Link Prediction
Trung-Kien Nguyen
Yuan Fang
DiffM
76
11
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
120
26
0
25 Mar 2024
Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models
Hanzhi Yin
Gang Cheng
Christian J. Steinmetz
Ruibin Yuan
Richard M. Stern
Roger B. Dannenberg
51
6
0
24 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
110
19
0
22 Mar 2024
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot
Wenxuan Song
Han Zhao
Pengxiang Ding
Can Cui
Shangke Lyu
Yaning Fan
Donglin Wang
OffRL
118
14
0
20 Mar 2024
HyperFusion: A Hypernetwork Approach to Multimodal Integration of Tabular and Medical Imaging Data for Predictive Modeling
Daniel Duenias
Brennan Nichyporuk
Tal Arbel
Tammy Riklin-Raviv
95
7
0
20 Mar 2024
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Elaine Sui
Xiaohan Wang
Serena Yeung-Levy
VLM
90
5
0
19 Mar 2024
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Vidhi Jain
Maria Attarian
Nikhil J. Joshi
Ayzaan Wahid
Danny Driess
...
Stefan Welker
Christine Chan
Igor Gilitschenski
Yonatan Bisk
Debidatta Dwibedi
136
32
0
19 Mar 2024
Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Miltiadis Kofinas
Boris Knyazev
Yan Zhang
Yunlu Chen
Gertjan J. Burghouts
E. Gavves
Cees G. M. Snoek
David W. Zhang
114
37
0
18 Mar 2024
Efficient Trajectory Forecasting and Generation with Conditional Flow Matching
Sean Ye
Matthew C. Gombolay
91
3
0
16 Mar 2024
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Ronghui Li
YuXiang Zhang
Yachao Zhang
Hongwen Zhang
Jie Guo
Yan Zhang
Yebin Liu
Xiu Li
DiffM
108
35
0
15 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
97
18
0
14 Mar 2024
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
Shengzhuang Chen
Jihoon Tack
Yunqiao Yang
Yee Whye Teh
Jonathan Richard Schwarz
Ying Wei
MoE
123
4
0
13 Mar 2024
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
Ran Xu
Yan Shen
Xiaoqi Li
Ruihai Wu
Hao Dong
LM&Ro
82
10
0
13 Mar 2024
Semantic Residual Prompts for Continual Learning
Martin Menabue
Emanuele Frascaroli
Matteo Boschini
E. Sangineto
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
CLL
VLM
123
11
0
11 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
124
103
0
08 Mar 2024
Previous
1
2
3
...
7
8
9
...
25
26
27
Next