Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,315 papers shown
Title
Attribute Diversity Determines the Systematicity Gap in VQA
Ian Berlot-Attwell
Kumar Krishna Agrawal
A. M. Carrell
Yash Sharma
Naomi Saphra
31
1
0
15 Nov 2023
Large Language Models for Robotics: A Survey
Fanlong Zeng
Wensheng Gan
Yongheng Wang
Ning Liu
Philip S. Yu
LM&Ro
124
127
0
13 Nov 2023
Personalizing Keyword Spotting with Speaker Information
Beltrán Labrador
Pai Zhu
Guanlong Zhao
Angelo Scorza Scarpati
Quan Wang
Alicia Lozano-Diez
Alex Park
Ignacio López Moreno
26
1
0
06 Nov 2023
Sparse Training of Discrete Diffusion Models for Graph Generation
Yiming Qin
Clément Vignac
Pascal Frossard
27
12
0
03 Nov 2023
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
Te-Lin Wu
Zi-Yi Dou
Qingyuan Hu
Yu Hou
Nischal Reddy Chandra
Marjorie Freedman
R. Weischedel
Nanyun Peng
39
5
0
02 Nov 2023
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Yuan Gao
Nobuyuki Morioka
Yu Zhang
Nanxin Chen
DiffM
31
27
0
02 Nov 2023
Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study
Jonghun Kim
Hyunjin Park
MedIm
21
33
0
01 Nov 2023
Latent Field Discovery In Interacting Dynamical Systems With Neural Fields
Miltiadis Kofinas
Erik J. Bekkers
N. S. Nagaraja
E. Gavves
AI4CE
39
7
0
31 Oct 2023
Sim2Real for Environmental Neural Processes
Jonas Scholz
Tom R. Andersson
Anna Vaughan
James Requeima
Richard Turner
26
3
0
30 Oct 2023
A Survey on Knowledge Editing of Neural Networks
Vittorio Mazzia
Alessandro Pedrani
Andrea Caciolai
Kay Rottmann
Davide Bernardi
KELM
20
24
0
30 Oct 2023
Generative Neural Fields by Mixtures of Neural Implicit Functions
Tackgeun You
Mijeong Kim
Jungtaek Kim
Bohyung Han
DiffM
22
5
0
30 Oct 2023
Controllable Group Choreography using Contrastive Diffusion
Nhat Le
Tuong Khanh Long Do
Khoa Do
Hien Nguyen
Erman Tjiputra
Quang-Dieu Tran
Anh Nguyen
48
11
0
29 Oct 2023
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Xingrui Wang
Wufei Ma
Zhuowan Li
Adam Kortylewski
Alan Yuille
CoGe
27
12
0
27 Oct 2023
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
Sudarshan Babu
Richard Liu
Avery Zhou
Michael Maire
Greg Shakhnarovich
Rana Hanocka
AI4CE
19
11
0
26 Oct 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
26
10
0
25 Oct 2023
Visually Grounded Continual Language Learning with Selective Specialization
Kyra Ahrens
Lennart Bengtson
Jae Hee Lee
Stefan Wermter
29
0
0
24 Oct 2023
Inferring Relational Potentials in Interacting Systems
Armand Comas Massagu´e
Yilun Du
Christian Fernández
S. Ghimire
Octavia Camps
J. Tenenbaum
Mario Sznaier
34
4
0
23 Oct 2023
Towards a General Framework for Continual Learning with Pre-training
Liyuan Wang
Jingyi Xie
Xingxing Zhang
Hang Su
Jun Zhu
CLL
37
3
0
21 Oct 2023
Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers
Osman Batur .Ince
Tanin Zeraati
Semih Yagcioglu
Yadollah Yaghoobzadeh
Erkut Erdem
Aykut Erdem
24
1
0
18 Oct 2023
BUT CHiME-7 system description
M. Karafiát
Karel Veselý
Igor Szöke
Ladislav Mošner
Karel Beneš
Marcin Witkowski
Germán Barchi
L. Pepino
35
1
0
18 Oct 2023
Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing
Come Peladeau
Geoffroy Peeters
35
6
0
18 Oct 2023
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black
Mitsuhiko Nakamoto
P. Atreya
Homer Walke
Chelsea Finn
Aviral Kumar
Sergey Levine
DiffM
LM&Ro
35
132
0
16 Oct 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
39
14
0
16 Oct 2023
Subspace Adaptation Prior for Few-Shot Learning
Mike Huisman
Aske Plaat
Jan N. van Rijn
VLM
37
2
0
13 Oct 2023
Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Samira Abnar
Omid Saremi
Laurent Dinh
Shantel Wilson
Miguel Angel Bautista
...
Vimal Thilak
Etai Littwin
Jiatao Gu
Josh Susskind
Samy Bengio
41
5
0
13 Oct 2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Open X-Embodiment Collaboration
Abby OÑeill
Abdul Rehman
Abhinav Gupta
Abhiram Maddukuri
...
Zhuo Xu
Zichen Jeff Cui
Zichen Zhang
Zipeng Fu
Zipeng Lin
LM&Ro
56
467
0
13 Oct 2023
In-Context Learning for Few-Shot Molecular Property Prediction
Christopher Fifty
J. Leskovec
Sebastian Thrun
36
5
0
13 Oct 2023
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction
Kohei Saijo
Wangyou Zhang
Zhong-Qiu Wang
Shinji Watanabe
Tetsunori Kobayashi
Tetsuji Ogawa
VLM
28
6
0
12 Oct 2023
DiPPeR: Diffusion-based 2D Path Planner applied on Legged Robots
Jianwei Liu
Maria Stamatopoulou
Dimitrios Kanoulas
10
17
0
11 Oct 2023
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
Xinyu Sun
Peihao Chen
Jugang Fan
Thomas H. Li
Jian Chen
Mingkui Tan
32
12
0
11 Oct 2023
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Zuxin Liu
Jesse Zhang
Kavosh Asadi
Yao Liu
Ding Zhao
Shoham Sabach
Rasool Fakoor
ALM
AI4CE
23
26
0
09 Oct 2023
Learning to Predict Structural Vibrations
J. V. Delden
Julius Schultz
Christopher Blech
Sabine C. Langer
Timo Luddecke
AI4CE
29
1
0
09 Oct 2023
Task Aware Modulation using Representation Learning: An Approach for Few Shot Learning in Heterogeneous Systems
Arvind Renganathan
Rahul Ghosh
A. Khandelwal
Vipin Kumar
25
0
0
07 Oct 2023
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Jiacheng Zhu
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
28
12
0
05 Oct 2023
Pose-Free Generalizable Rendering Transformer
Zhiwen Fan
Panwang Pan
Peihao Wang
Yi Ding
Hanwen Jiang
Dejia Xu
Zehao Zhu
Dilin Wang
Zhangyang Wang
32
2
0
05 Oct 2023
PACIA: Parameter-Efficient Adapter for Few-Shot Molecular Property Prediction
Shiguang Wu
Yaqing Wang
Quanming Yao
27
4
0
01 Oct 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
39
392
0
30 Sep 2023
RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations
Carlos Rombaldo Junior
Ingolf Becker
Zongyu Guo
Shane Johnson
11
12
0
29 Sep 2023
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture
Zixuan Chen
Ze Ji
Shuyang Liu
Jing Huo
Yiyu Chen
Yang Gao
13
1
0
28 Sep 2023
Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback
Teresa Yeo
Oğuzhan Fatih Kar
Zahra Sodagar
Amir Zamir
TTA
OOD
31
3
0
27 Sep 2023
Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription
Frank Cwitkowitz
K. Cheuk
Woosung Choi
Marco A. Martínez-Ramírez
Keisuke Toyama
Wei-Hsiang Liao
Yuki Mitsufuji
42
5
0
27 Sep 2023
NoSENSE: Learned unrolled cardiac MRI reconstruction without explicit sensitivity maps
F. Zimmermann
A. Kofler
35
1
0
27 Sep 2023
SPGM: Prioritizing Local Features for enhanced speech separation performance
J. Yip
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
...
Trung Hieu Nguyen
Kun Zhou
Dianwen Ng
Eng Siong Chng
B. Ma
MoE
VLM
33
4
0
22 Sep 2023
Learning to Drive Anywhere
Ruizhao Zhu
Peng Huang
Eshed Ohn-Bar
Venkatesh Saligrama
45
6
0
21 Sep 2023
Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis
Ben Maman
Johannes Zeitler
Meinard Muller
Amit H. Bermano
DiffM
22
4
0
21 Sep 2023
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
131
81
0
18 Sep 2023
Speeding Up Speech Synthesis In Diffusion Models By Reducing Data Distribution Recovery Steps Via Content Transfer
Peter Ochieng
DiffM
30
0
0
18 Sep 2023
Latent assimilation with implicit neural representations for unknown dynamics
Zhuoyuan Li
Bin Dong
Pingwen Zhang
AI4CE
24
3
0
18 Sep 2023
Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Carson Yu Liu
Gelareh Mohammadi
Yang Song
W. Johal
15
2
0
17 Sep 2023
D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Amir Rahimi
Vanessa D’Amario
Moyuru Yamada
Kentaro Takemoto
Tomotake Sasaki
Xavier Boix
41
1
0
15 Sep 2023
Previous
1
2
3
...
9
10
11
...
25
26
27
Next