Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,315 papers shown
Title
Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis
Jordie Shier
Franco Caspe
Andrew Robertson
Mark Sandler
C. Saitis
Andrew Mcpherson
39
3
0
13 Sep 2023
Enhancing multimodal cooperation via sample-level modality valuation
Yake Wei
Ruoxuan Feng
Zihe Wang
Di Hu
38
11
0
12 Sep 2023
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram
Zhifeng Kong
Ming-Yu Liu
Ambrish Dantrey
Bryan Catanzaro
27
7
0
12 Sep 2023
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh
Stephen Tu
Vikas Sindhwani
DiffM
20
8
0
11 Sep 2023
DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields
Junzhe Zhang
Yushi Lan
Shuai Yang
Fangzhou Hong
Quan Wang
C. Yeo
Ziwei Liu
Chen Change Loy
3DH
AI4CE
43
12
0
08 Sep 2023
A Two-Stage Training Framework for Joint Speech Compression and Enhancement
Jiayi Huang
Zeyu Yan
Wenbin Jiang
Fei Wen
27
0
0
08 Sep 2023
From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao
Qi Yang
Feng Zhou
Changshui Zhang
36
17
0
08 Sep 2023
CoNeS: Conditional neural fields with shift modulation for multi-sequence MRI translation
Yunjie Chen
Marius Staring
O. M. Neve
Stephan R. Romeijn
Erik F. Hensen
Berit M. Verbist
J. Wolterink
Qian Tao
DiffM
MedIm
27
3
0
06 Sep 2023
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra
Andres Potapczynski
Marc Finzi
Geoff Pleiss
Andrew Gordon Wilson
20
7
0
06 Sep 2023
MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling
Bo Han
Yongkang Wong
Mohan Kankanhalli
Weidong Geng
DiffM
23
6
0
06 Sep 2023
PromptTTS 2: Describing and Generating Voices with Text Prompt
Yichong Leng
Zhifang Guo
Kai Shen
Xu Tan
Zeqian Ju
...
Lei He
Xiang-Yang Li
Sheng Zhao
Tao Qin
Jiang Bian
VLM
DiffM
52
41
0
05 Sep 2023
A Survey on Interpretable Cross-modal Reasoning
Dizhan Xue
Shengsheng Qian
Zuyi Zhou
Changsheng Xu
LRM
29
4
0
05 Sep 2023
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj
Jay Vakil
Mohit Sharma
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
LM&Ro
21
116
0
05 Sep 2023
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang
Dongchao Yang
Qichen Ye
Bowen Cao
Yuexian Zou
DiffM
40
3
0
03 Sep 2023
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
Cheng Shi
Sibei Yang
LRM
29
6
0
03 Sep 2023
Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation
Mengping Yang
Zhe Wang
Wenyi Feng
Qian Zhang
Tingzhe Xiao
DiffM
44
3
0
30 Aug 2023
RetroBridge: Modeling Retrosynthesis with Markov Bridges
Ilia Igashov
Arne Schneuing
Marwin H. S. Segler
Michael M. Bronstein
B. Correia
34
14
0
30 Aug 2023
PolarRec: Radio Interferometric Data Reconstruction with Polar Coordinate Representation
Ruoqing Wang
Zhu-xue Chen
Jiayi Zhu
Qiong Luo
Feng Wang
26
0
0
28 Aug 2023
InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules
Yanqi Bao
Tian Ding
Jing Huo
Wenbin Li
Yuxin Li
Yang Gao
AI4CE
40
7
0
26 Aug 2023
A Survey of Imbalanced Learning on Graphs: Problems, Techniques, and Future Directions
Zemin Liu
Yuan N. Li
Nan-Fang Chen
Qian Wang
Bryan Hooi
Bin He
FaML
14
13
0
26 Aug 2023
BridgeData V2: A Dataset for Robot Learning at Scale
Homer Walke
Kevin Black
Abraham Lee
Moo Jin Kim
Maximilian Du
...
Andre Wang He
Vivek Myers
Kuan Fang
Chelsea Finn
Sergey Levine
32
210
0
24 Aug 2023
TAI-GAN: Temporally and Anatomically Informed GAN for early-to-late frame conversion in dynamic cardiac PET motion correction
Xueqi Guo
Luyao Shi
Xiongchao Chen
Bo Zhou
Qiong Liu
...
Edward J. Miller
Albert J Sinusas
Bruce Spottiswoode
Chi Liu
Nicha Dvornek
MedIm
33
1
0
23 Aug 2023
Vision Transformer Adapters for Generalizable Multitask Learning
Deblina Bhattacharjee
Sabine Süsstrunk
Mathieu Salzmann
ViT
21
8
0
23 Aug 2023
Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models
Siyue Yao
Mingjie Sun
Bingliang Li
Fengyu Yang
Junle Wang
Ruimao Zhang
DiffM
42
18
0
23 Aug 2023
LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model
Siqi Yang
Zejun Yang
Zhisheng Wang
25
12
0
23 Aug 2023
LFS-GAN: Lifelong Few-Shot Image Generation
Juwon Seo
Jingwen Kang
Gyeong-Moon Park
CLL
41
11
0
23 Aug 2023
Metadata Improves Segmentation Through Multitasking Elicitation
Iaroslav Plutenko
Mikhail Papkov
K. Palo
L. Parts
D. Fishman
16
0
0
18 Aug 2023
Learning the meanings of function words from grounded language using a visual question answering model
Eva Portelance
Michael C. Frank
Dan Jurafsky
NAI
33
7
0
16 Aug 2023
Ranking-aware Uncertainty for Text-guided Image Retrieval
Junyang Chen
Hanjiang Lai
24
7
0
16 Aug 2023
Boosting Multi-modal Model Performance with Adaptive Gradient Modulation
Hong Li
Xingyu Li
Pengbo Hu
Yinuo Lei
Chunxiao Li
Yi Zhou
42
22
0
15 Aug 2023
CLE Diffusion: Controllable Light Enhancement Diffusion Model
Yuyang Yin
Dejia Xu
Chuangchuang Tan
P. Liu
Yao-Min Zhao
Yunchao Wei
DiffM
37
41
0
13 Aug 2023
PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers
Phillip Lippe
Bastiaan S. Veeling
P. Perdikaris
Richard Turner
Johannes Brandstetter
DiffM
AI4CE
33
78
0
10 Aug 2023
Separate Anything You Describe
Xubo Liu
Qiuqiang Kong
Yan Zhao
Haohe Liu
Yiitan Yuan
Yuzhuo Liu
Rui Xia
Yuxuan Wang
Mark D. Plumbley
Wenwu Wang
VLM
33
43
0
09 Aug 2023
The Five-Dollar Model: Generating Game Maps and Sprites from Sentence Embeddings
Timothy Merino
Roman Negri
Dipika Rajesh
M. Charity
Julian Togelius
DiffM
VLM
30
15
0
08 Aug 2023
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
K. Chen
Yusong Wu
Haohe Liu
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
DiffM
44
75
0
03 Aug 2023
A supervised hybrid quantum machine learning solution to the emergency escape routing problem
Nathan Haboury
Mohammad Kordzanganeh
Sebastian Schmitt
Ayushma Joshi
Igor Tokarev
Lukas Abdallah
Andrii Kurkin
Basil Kyriacou
A. Melnikov
28
7
0
28 Jul 2023
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
Kevin Denamganai
Daniel Hernández
Ozan Vardal
S. Missaoui
James Alfred Walker
31
0
0
28 Jul 2023
Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior
Adam Block
Ali Jadbabaie
Daniel Pfrommer
Max Simchowitz
Russ Tedrake
DiffM
47
22
0
27 Jul 2023
Complete and separate: Conditional separation with missing target source attribute completion
Dimitrios Bralios
Efthymios Tzinis
Paris Smaragdis
35
0
0
27 Jul 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Maxime Fontana
Michael W. Spratling
Miaojing Shi
56
6
0
25 Jul 2023
INFINITY: Neural Field Modeling for Reynolds-Averaged Navier-Stokes Equations
Louis Serrano
Léon Migus
Yuan Yin
Jocelyn Ahmed Mazari
Patrick Gallinari
AI4CE
16
4
0
25 Jul 2023
Neural Image Compression: Generalization, Robustness, and Spectral Biases
Kelsey Lieberman
James Diffenderfer
Charles Godfrey
B. Kailkhura
32
4
0
17 Jul 2023
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
21
19
0
15 Jul 2023
Diffusion Models for Multi-target Adversarial Tracking
Sean Ye
Manisha Natarajan
Zixuan Wu
Matthew C. Gombolay
DiffM
39
3
0
12 Jul 2023
FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Gang Wang
Peng Zhang
Jun Xiong
Fei Yang
Wei Huang
Yufei Zha
CVBM
25
1
0
08 Jul 2023
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
29
3
0
07 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
43
28
0
04 Jul 2023
Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations
Anil Osman Tur
Nicola Dall’Asen
Cigdem Beyan
Elisa Ricci
DiffM
VGen
27
14
0
04 Jul 2023
Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation
Tserendorj Adiya
Jae Shin Yoon
Jungeun Lee
Sang-hyeon Kim
Hwasup Lim
DiffM
31
0
0
02 Jul 2023
MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Jun Chen
Wei Rao
Zehao Wang
Jiuxin Lin
Yukai Ju
Shulin He
Yannan Wang
Zhiyong Wu
16
10
0
28 Jun 2023
Previous
1
2
3
...
10
11
12
...
25
26
27
Next