v1v2 (latest)

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017

Aaron Courville

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,349 papers shown

Title
Advances in Neural Rendering A. Tewari Justus Thies B. Mildenhall P. Srinivasan E. Tretschk ... S. Fanello Jun Zhu Gordon Wetzstein Michael Zollhoefer D. B. Goldman 3DH AI4CE 170 457 0 10 Nov 2021
Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images Jianfei Guo Zhiyuan Yang Xi Lin Qingfu Zhang 3DH 96 5 0 08 Nov 2021
LILA: Language-Informed Latent Actions Siddharth Karamcheti Megha Srivastava Percy Liang Dorsa Sadigh LM&Ro 103 32 0 05 Nov 2021
Unsupervised Learning of Compositional Energy Concepts Yilun Du Shuang Li Yash Sharma J. Tenenbaum Igor Mordatch CoGe OCL 93 81 0 04 Nov 2021
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition Midia Yousefi John H.L. Hanse 28 5 0 30 Oct 2021
Cross-attention conformer for context modeling in speech enhancement for ASR A. Narayanan Chung-Cheng Chiu Tom O'Malley Quan Wang Yanzhang He 70 14 0 30 Oct 2021
A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis Xingang Pan Xudong Xu Chen Change Loy Christian Theobalt Bo Dai 3DH 112 89 0 29 Oct 2021
Node-wise Localization of Graph Neural Networks Zemin Liu Yuan Fang Chenghao Liu Guosheng Lin 88 25 0 27 Oct 2021
Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning Milad Abdollahzadeh Touba Malekzadeh Ngai-Man Cheung 67 29 0 27 Oct 2021
A Unified Survey on Anomaly, Novelty, Open-Set, and Out-of-Distribution Detection: Solutions and Future Challenges Mohammadreza Salehi Hossein Mirzaei Dan Hendrycks Yixuan Li M. Rohban Mohammad Sabokrou OOD 167 199 0 26 Oct 2021
Identifying and Benchmarking Natural Out-of-Context Prediction Problems David Madras D. Psaltis CML OOD 114 4 0 25 Oct 2021
SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation Giorgio Giannone Fattane Pourakpour DRL BDL 57 10 0 23 Oct 2021
Fast Model Editing at Scale E. Mitchell Charles Lin Antoine Bosselut Chelsea Finn Christopher D. Manning KELM 363 379 0 21 Oct 2021
SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark Victor Zhong Austin W. Hanjie Sida Wang Karthik Narasimhan Luke Zettlemoyer 32 12 0 20 Oct 2021
CIPS-3D: A 3D-Aware Generator of GANs Based on Conditionally-Independent Pixel Synthesis Peng Zhou Lingxi Xie Bingbing Ni Qi Tian 92 175 0 19 Oct 2021
Towards Language-guided Visual Recognition via Dynamic Convolutions Gen Luo Yiyi Zhou Xiaoshuai Sun Yongjian Wu Yue Gao Rongrong Ji ObjD 98 19 0 17 Oct 2021
Truthful AI: Developing and governing AI that does not lie Owain Evans Owen Cotton-Barratt Lukas Finnveden Adam Bales Avital Balwit Peter Wills Luca Righetti William Saunders HILM 304 117 0 13 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech emotion recognition Andreas Triantafyllopoulos U. Reichel Shuo Liu Simon Huber F. Eyben Björn W. Schuller 97 11 0 13 Oct 2021
Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning Sungyong Baik Janghoon Choi Heewon Kim Dohee Cho Jaesik Min Kyoung Mu Lee 87 104 0 08 Oct 2021
Self-Evolutionary Optimization for Pareto Front Learning Simyung Chang Kiyoon Yoo Jiho Jang Nojun Kwak 77 4 0 07 Oct 2021
Top-N: Equivariant set and graph generation without exchangeability Clément Vignac P. Frossard BDL 148 35 0 05 Oct 2021
DualNet: Continual Learning, Fast and Slow Quang Pham Chenghao Liu Guosheng Lin CLL 147 45 0 01 Oct 2021
Visually Grounded Concept Composition Bowen Zhang Hexiang Hu Linlu Qiu Peter Shaw Fei Sha CoGe 122 6 0 29 Sep 2021
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II Michal Opanowicz 30 0 0 26 Sep 2021
Systematic Generalization on gSCAN: What is Nearly Solved and What is Next? Linlu Qiu Hexiang Hu Bowen Zhang Peter Shaw Fei Sha 86 21 0 25 Sep 2021
Recent Advances of Continual Learning in Computer Vision: An Overview Haoxuan Qu Hossein Rahmani Li Xu Bryan M. Williams Jun Liu VLM CLL 136 78 0 23 Sep 2021
Improving Scheduled Sampling with Elastic Weight Consolidation for Neural Machine Translation Michalis Korakakis Andreas Vlachos CLL 62 2 0 13 Sep 2021
Infusing Future Information into Monotonic Attention Through Language Models Mohd Abbas Zaidi S. Indurthi Beomseok Lee Nikhil Kumar Lakumarapu Sangha Kim 60 2 0 07 Sep 2021
Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention Katsuyuki Nakamura Hiroki Ohashi Mitsuhiro Okada EgoV 94 13 0 07 Sep 2021
Edge-featured Graph Neural Architecture Search Shaofei Cai Liang Li Xinzhe Han Zhengjun Zha Qingming Huang 54 7 0 03 Sep 2021
Learning Disentangled Representations in the Imaging Domain Xiao Liu Pedro Sanchez Spyridon Thermos Alison Q. OÑeil Sotirios A. Tsaftaris OOD DRL 198 72 0 26 Aug 2021
Self-Attention for Audio Super-Resolution Nathanaël Carraz Rakotonirina SupR 69 24 0 26 Aug 2021
CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation Xidong Feng Chong Chen Dong Li Mengchen Zhao Jianye Hao Jun Wang OffRL 74 24 0 24 Aug 2021
Unsupervised Image Generation with Infinite Generative Adversarial Networks Hui Ying He Wang Tianjia Shao Yin Yang Kun Zhou GAN 56 2 0 18 Aug 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction Xiaogang Xu Yi Wang Liwei Wang Bei Yu Jiaya Jia VGen 80 5 0 12 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models Zheyuan Liu Cristian Rodriguez-Opazo Damien Teney Stephen Gould VLM 79 207 0 09 Aug 2021
FiLMing Multimodal Sarcasm Detection with Attention Sundesh Gupta Aditya Shah Miten Shah Laribok Syiemlieh Chandresh Kumar Maurya 59 13 0 09 Aug 2021
A Unified Model for Zero-shot Music Source Separation, Transcription and Synthesis Liwei Lin Qiuqiang Kong Junyan Jiang Gus Xia 66 26 0 07 Aug 2021
Multimodal Meta-Learning for Time Series Regression Sebastian Pineda-Arango Felix Heinrich Kiran Madhusudhanan Lars Schmidt-Thieme AI4TS 69 15 0 05 Aug 2021
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis Julian Zaïdi Hugo Seuté Benjamin van Niekerk M. Carbonneau 61 21 0 04 Aug 2021
Learn to Match: Automatic Matching Network Design for Visual Tracking Zhipeng Zhang Yihao Liu Tianlin Li Bing Li Weiming Hu 86 173 0 02 Aug 2021
Adaptive Denoising via GainTuning S. Mohan Joshua L. Vincent R. Manzorro Peter A Crozier Eero P. Simoncelli C. Fernandez‐Granda 74 24 0 27 Jul 2021
Greedy Gradient Ensemble for Robust Visual Question Answering Xinzhe Han Shuhui Wang Chi Su Qingming Huang Q. Tian 65 78 0 27 Jul 2021
Towards the Unseen: Iterative Text Recognition by Distilling from Errors A. Bhunia Pinaki Nath Chowdhury Aneeshan Sain Yi-Zhe Song 79 16 0 26 Jul 2021
ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback Mike Wu Noah D. Goodman Chris Piech Chelsea Finn 89 19 0 23 Jul 2021
Improving the Generalization of Meta-learning on Unseen Domains via Adversarial Shift Pinzhuo Tian Yao Gao OOD 51 1 0 23 Jul 2021
Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning Kaylee Burns Christopher D. Manning Li Fei-Fei 53 0 0 20 Jul 2021
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech C. Steinmetz V. Ithapu P. Calamia 83 40 0 15 Jul 2021
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning Paul Pu Liang Yiwei Lyu Xiang Fan Zetian Wu Yun Cheng ... Peter Wu Michelle A. Lee Yuke Zhu Ruslan Salakhutdinov Louis-Philippe Morency VLM 111 172 0 15 Jul 2021
Combining 3D Image and Tabular Data via the Dynamic Affine Feature Map Transform Sebastian Polsterl Tom Nuno Wolf Christian Wachinger MedIm 75 45 0 13 Jul 2021