Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,315 papers shown
Title
Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes
Raphael Lafargue
Yassir Bendou
Bastien Pasdeloup
J. Diguet
Ian Reid
Vincent Gripon
Jack Valmadre
39
0
0
29 Jan 2024
MResT: Multi-Resolution Sensing for Real-Time Control with Vision-Language Models
Saumya Saxena
Mohit Sharma
Oliver Kroemer
34
4
0
25 Jan 2024
Diverse and Lifespan Facial Age Transformation Synthesis with Identity Variation Rationality Metric
Jiucheng Xie
Jun Yang
Wenqing Wang
Feng Xu
Jiang Xiong
Hao Gao
40
2
0
25 Jan 2024
Dynamic Long-Term Time-Series Forecasting via Meta Transformer Networks
M. A. Ma'sum
MD Rasel Sarkar
Mahardhika Pratama
Savitha Ramasamy
S. Anavatti
Lin Liu
Habibullah Habibullah
Ryszard Kowalczyk
AI4TS
32
0
0
25 Jan 2024
Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
Younglo Lee
Shukjae Choi
Byeonghak Kim
Zhong-Qiu Wang
Shinji Watanabe
MoE
16
9
0
23 Jan 2024
MINT: A wrapper to make multi-modal and multi-image AI models interactive
Jan Freyberg
Abhijit Guha Roy
Terry Spitz
Beverly Freeman
M. Schaekermann
...
D. Webster
Alan Karthikesalingam
Yun-Hui Liu
Krishnamurthy Dvijotham
Umesh Telang
36
0
0
22 Jan 2024
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis
Yoonjin Chung
Junwon Lee
Juhan Nam
48
13
0
17 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
34
1
0
17 Jan 2024
FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning
Jianlan Luo
Charles Xu
Fangchen Liu
Liam Tan
Zipeng Lin
Jeffrey Wu
Pieter Abbeel
Sergey Levine
37
27
0
16 Jan 2024
Multi-task real-robot data with gaze attention for dual-arm fine manipulation
Heecheol Kim
Y. Ohmura
Y. Kuniyoshi
38
2
0
15 Jan 2024
Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model
Taehee Kim
Yeongjae Cho
Heejun Shin
Yohan Jo
Dongmyung Shin
37
4
0
12 Jan 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
69
3
0
10 Jan 2024
Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
Minjie Zhu
Yichen Zhu
Jinming Li
Junjie Wen
Zhiyuan Xu
...
Chaomin Shen
Yaxin Peng
Dong Liu
Feifei Feng
Jian Tang
LM&Ro
35
15
0
08 Jan 2024
StreamVC: Real-Time Low-Latency Voice Conversion
Yang Yang
Y. Kartynnik
Yunpeng Li
Jiuqiang Tang
Xing Li
George Sung
Matthias Grundmann
30
12
0
05 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
125
239
0
05 Jan 2024
aMUSEd: An Open MUSE Reproduction
Suraj Patil
William Berman
Robin Rombach
Patrick von Platen
VLM
25
18
0
03 Jan 2024
Balanced Multi-modal Federated Learning via Cross-Modal Infiltration
Yunfeng Fan
Wenchao Xu
Yining Qi
Jiaqi Zhu
Song Guo
34
0
0
31 Dec 2023
Classifier-free graph diffusion for molecular property targeting
Matteo Ninniri
Marco Podda
Davide Bacciu
40
5
0
28 Dec 2023
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Rongyu Zhang
Yulin Luo
Jiaming Liu
Huanrui Yang
Zhen Dong
...
Tomoyuki Okuno
Yohei Nakata
Kurt Keutzer
Yuan Du
Shanghang Zhang
MoMe
MoE
40
3
0
27 Dec 2023
Active Third-Person Imitation Learning
Timo Klein
Susanna Weinberger
Adish Singla
Sebastian Tschiatschek
28
1
0
27 Dec 2023
Personalized Federated Learning with Contextual Modulation and Meta-Learning
Anna Vettoruzzo
Mohamed-Rafik Bouguelia
Thorsteinn Rögnvaldsson
FedML
25
1
0
23 Dec 2023
Towards End-to-End Structure Solutions from Information-Compromised Diffraction Data via Generative Deep Learning
Gabriel Guo
Judah Goldfeder
Ling Lan
Aniv Ray
Albert Hanming Yang
Boyuan Chen
S. Billinge
Hod Lipson
30
3
0
23 Dec 2023
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Pengxiang Ding
Han Zhao
Wenxuan Song
Zhitao Wang
Zhenyu Wei
Shangke Lyu
Ningxi Yang
Donglin Wang
34
19
0
22 Dec 2023
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
Saurabh Saxena
Junhwa Hur
Charles Herrmann
Deqing Sun
David J. Fleet
DiffM
41
26
0
20 Dec 2023
Diffusion Models With Learned Adaptive Noise
S. Sahoo
Aaron Gokaslan
Christopher De Sa
Volodymyr Kuleshov
DiffM
36
8
0
20 Dec 2023
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Stanislaw Szymanowicz
Christian Rupprecht
Andrea Vedaldi
3DGS
48
176
0
20 Dec 2023
Leveraging Normalization Layer in Adapters With Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning
Yongjin Yang
Taehyeon Kim
SeYoung Yun
35
4
0
18 Dec 2023
GraspLDM: Generative 6-DoF Grasp Synthesis using Latent Diffusion Models
K. R. Barad
Andrej Orsula
Antoine Richard
Jan Dentler
Miguel Olivares-Mendez
Carol Martinez
34
16
0
18 Dec 2023
How to Train Neural Field Representations: A Comprehensive Study and Benchmark
Samuele Papa
Riccardo Valperga
David M. Knigge
Miltiadis Kofinas
Phillip Lippe
J. Sonke
E. Gavves
31
7
0
16 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
40
29
0
13 Dec 2023
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI
Kai Huang
Boyuan Yang
Wei Gao
37
1
0
13 Dec 2023
More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory
Peiwen Sun
Yifan Zhang
Zishan Liu
Donghao Chen
Honggang Zhang
24
0
0
12 Dec 2023
One-Step Diffusion Distillation via Deep Equilibrium Models
Zhengyang Geng
Ashwini Pokle
Trevor Killeen
34
28
0
12 Dec 2023
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
59
177
0
11 Dec 2023
Spatial and Temporal Hierarchy for Autonomous Navigation using Active Inference in Minigrid Environment
Daria de Tinguy
Toon Van de Maele
Tim Verbelen
Bart Dhoedt
38
6
0
08 Dec 2023
Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-Based Approach for One-Shot Singing Voice Conversion
Binzhu Sha
Xu Li
Zhiyong Wu
Yin Shan
Helen M. Meng
23
7
0
08 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
39
38
0
07 Dec 2023
Guided Reconstruction with Conditioned Diffusion Models for Unsupervised Anomaly Detection in Brain MRIs
F. Behrendt
Debayan Bhattacharya
R. Mieling
Lennart Maack
Julia Kruger
R. Opfer
Alexander Schlaefer
DiffM
MedIm
25
10
0
07 Dec 2023
Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Tung Nguyen
Rohan Shah
Hritik Bansal
T. Arcomano
Sandeep Madireddy
R. Maulik
V. Kotamarthi
Ian Foster
Aditya Grover
AI4TS
19
58
0
06 Dec 2023
C3: High-performance and low-complexity neural compression from a single image or video
Hyunjik Kim
Matthias Bauer
Lucas Theis
Jonathan Richard Schwarz
Emilien Dupont
VGen
25
24
0
05 Dec 2023
DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh
Jiaming Song
Guilin Liu
Jan Kautz
Arash Vahdat
39
67
0
04 Dec 2023
Diffusion Models Without Attention
Jing Nathan Yan
Jiatao Gu
Alexander M. Rush
32
61
0
30 Nov 2023
Transfer Learning in Robotics: An Upcoming Breakthrough? A Review of Promises and Challenges
Noémie Jaquier
Michael C. Welle
A. Gams
Kunpeng Yao
Bernardo Fichera
A. Billard
Aleš Ude
Tamim Asfour
Danica Kragic
35
14
0
29 Nov 2023
SODA: Bottleneck Diffusion Models for Representation Learning
Drew A. Hudson
Daniel Zoran
Mateusz Malinowski
Andrew Kyle Lampinen
Andrew Jaegle
James L. McClelland
Loic Matthey
Felix Hill
Alexander Lerchner
DiffM
30
48
0
29 Nov 2023
Task adaption by biologically inspired stochastic comodulation
Gauthier Boeshertz
Caroline Haimerl
Cristina Savin
38
0
0
25 Nov 2023
Coordinate-Aware Modulation for Neural Fields
J. Lee
Daniel Rho
Seungtae Nam
Jong Hwan Ko
Eunbyung Park
24
5
0
25 Nov 2023
GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar
Berna Kabadayi
Wojciech Zielonka
Bharat Lal Bhatnagar
Gerard Pons-Moll
Justus Thies
3DH
40
7
0
22 Nov 2023
Self-Supervised Music Source Separation Using Vector-Quantized Source Category Estimates
Marco Pasini
Stefan Lattner
George Fazekas
35
1
0
21 Nov 2023
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy
Jan Peters
Carlo DÉramo
MoE
33
15
0
19 Nov 2023
Multimodal Representation Learning by Alternating Unimodal Adaptation
Xiaohui Zhang
Jaehong Yoon
Mohit Bansal
Huaxiu Yao
34
22
0
17 Nov 2023
Previous
1
2
3
...
8
9
10
...
25
26
27
Next