ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,315 papers shown
Title
End-to-End Binaural Speech Synthesis
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
32
8
0
08 Jul 2022
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for
  Grounding Relative Directions via Multi-Task Learning
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning
Kyra Ahrens
Matthias Kerzel
Jae Hee Lee
C. Weber
S. Wermter
21
0
0
06 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced
  Model Sparsity
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
21
7
0
06 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer
  Programming via Early Fixing
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
26
3
0
05 Jul 2022
Learning Noise-independent Speech Representation for High-quality Voice
  Conversion for Noisy Target Speakers
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Liumeng Xue
Shan Yang
Na Hu
Dan Su
Linfu Xie
37
2
0
02 Jul 2022
SD-LayerNet: Semi-supervised retinal layer segmentation in OCT using
  disentangled representation with anatomical priors
SD-LayerNet: Semi-supervised retinal layer segmentation in OCT using disentangled representation with anatomical priors
Botond Fazekas
Guilherme Aresta
Dmitrii Lachinov
Sophie Riedl
Julia Mai
U. Schmidt-Erfurth
Hrvoje Bogunović
13
13
0
01 Jul 2022
A Hierarchical Speaker Representation Framework for One-shot Singing
  Voice Conversion
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Xu Li
Shansong Liu
Ying Shan
37
13
0
28 Jun 2022
ContraReg: Contrastive Learning of Multi-modality Unsupervised
  Deformable Image Registration
ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image Registration
Neel Dey
Jo Schlemper
S. Salehi
Bo Zhou
Guido Gerig
M. Sofka
MedIm
39
17
0
27 Jun 2022
Task-Adaptive Few-shot Node Classification
Task-Adaptive Few-shot Node Classification
Song Wang
Kaize Ding
Chuxu Zhang
Chen Chen
Jundong Li
OffRL
38
48
0
23 Jun 2022
Jointist: Joint Learning for Multi-instrument Transcription and Its
  Applications
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Amy Hung
Ju-Chiang Wang
Dorien Herremans
29
6
0
22 Jun 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image
  Classification
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Massimiliano Patacchiola
J. Bronskill
Aliaksandra Shysheya
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
32
9
0
20 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in
  Language-guided RL
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
33
24
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and
  Federated Image Classification
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DH
FedML
46
27
0
17 Jun 2022
Channel Importance Matters in Few-Shot Image Classification
Channel Importance Matters in Few-Shot Image Classification
Xu Luo
Jing Xu
Zenglin Xu
VLM
30
42
0
16 Jun 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning
  Tasks
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
55
128
0
14 Jun 2022
Singular Value Fine-tuning: Few-shot Segmentation requires
  Few-parameters Fine-tuning
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
Yanpeng Sun
Qiang Chen
Xiangyu He
Jian Wang
Haocheng Feng
Junyu Han
Errui Ding
Jian Cheng
Zechao Li
Jingdong Wang
40
56
0
13 Jun 2022
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural
  Images with Aperture Rendering Neural Radiance Fields
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields
Takuhiro Kaneko
37
14
0
13 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
25
49
0
11 Jun 2022
Feature-informed Embedding Space Regularization For Audio Classification
Feature-informed Embedding Space Regularization For Audio Classification
Yun-Ning Hung
Alexander Lerch
30
5
0
10 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
30
97
0
07 Jun 2022
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear
  Modulation
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
Mehmet Özgür Türkoglu
Alexander Becker
H. Gündüz
Mina Rezaei
Bernd Bischl
Rodrigo Caye Daudt
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
FedML
UQCV
48
25
0
31 May 2022
Few-Shot Diffusion Models
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
186
49
0
30 May 2022
A Continuous Time Framework for Discrete Denoising Models
A Continuous Time Framework for Discrete Denoising Models
Andrew Campbell
Joe Benton
Valentin De Bortoli
Tom Rainforth
George Deligiannidis
Arnaud Doucet
DiffM
197
137
0
30 May 2022
Visual Superordinate Abstraction for Robust Concept Learning
Visual Superordinate Abstraction for Robust Concept Learning
Qinjie Zheng
Chaoyue Wang
Dadong Wang
Dacheng Tao
VLM
28
2
0
28 May 2022
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified
  Acoustic Echo Suppression And Speech Enhancement
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Meng Yu
Yong-mei Xu
Chunlei Zhang
Shizhong Zhang
Dong Yu
28
11
0
20 May 2022
Meta-Learning Sparse Compression Networks
Meta-Learning Sparse Compression Networks
Jonathan Richard Schwarz
Yee Whye Teh
65
26
0
18 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge
  Using Language
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
Alex Schwing
RALM
26
12
0
12 May 2022
Feature Extractor Stacking for Cross-domain Few-shot Learning
Feature Extractor Stacking for Cross-domain Few-shot Learning
Hongyu Wang
Eibe Frank
Bernhard Pfahringer
Michael Mayo
G. Holmes
21
4
0
12 May 2022
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech
  Recognition
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Sanghyun Yoo
Inchul Song
Yoshua Bengio
27
28
0
06 May 2022
What is Right for Me is Not Yet Right for You: A Dataset for Grounding
  Relative Directions via Multi-Task Learning
What is Right for Me is Not Yet Right for You: A Dataset for Grounding Relative Directions via Multi-Task Learning
Jae Hee Lee
Matthias Kerzel
Kyra Ahrens
C. Weber
S. Wermter
45
9
0
05 May 2022
Few-Shot Musical Source Separation
Few-Shot Musical Source Separation
Yu Wang
Daniel Stoller
Rachel M. Bittner
J. P. Bello
21
14
0
03 May 2022
Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Yida Zhao
Yuqing Song
Qin Jin
8
29
0
24 Apr 2022
Training and challenging models for text-guided fashion image retrieval
Training and challenging models for text-guided fashion image retrieval
Eric Dodds
Jack Culpepper
Gaurav Srivastava
26
8
0
23 Apr 2022
KALA: Knowledge-Augmented Language Model Adaptation
KALA: Knowledge-Augmented Language Model Adaptation
Minki Kang
Jinheon Baek
Sung Ju Hwang
VLM
KELM
36
34
0
22 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for
  Vision-and-Language Tasks
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
22
43
0
16 Apr 2022
Sound Event Triage: Detecting Sound Events Considering Priority of
  Classes
Sound Event Triage: Detecting Sound Events Considering Priority of Classes
Noriyuki Tonami
Keisuke Imoto
32
1
0
13 Apr 2022
Probabilistic Compositional Embeddings for Multimodal Image Retrieval
Probabilistic Compositional Embeddings for Multimodal Image Retrieval
Andrei Neculai
Yanbei Chen
Zeynep Akata
CoGe
27
31
0
12 Apr 2022
Text-Driven Separation of Arbitrary Sounds
Text-Driven Separation of Arbitrary Sounds
Kevin Kilgour
Beat Gfeller
Qingqing Huang
A. Jansen
Scott Wisdom
Marco Tagliasacchi
30
30
0
12 Apr 2022
Pareto Conditioned Networks
Pareto Conditioned Networks
Mathieu Reymond
Eugenio Bargiacchi
Ann Nowé
4
16
0
11 Apr 2022
Canonical Mean Filter for Almost Zero-Shot Multi-Task classification
Canonical Mean Filter for Almost Zero-Shot Multi-Task classification
Yong Li
Heng Wang
Xiang Ye
17
0
0
08 Apr 2022
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for
  On-Device Speech Recognition
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Shaojin Ding
R. Rikhye
Qiao Liang
Yanzhang He
Quan Wang
A. Narayanan
Tom O'Malley
Ian McGraw
29
27
0
08 Apr 2022
Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D
  Reconstruction
Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Kalyan Vasudev Alwala
Abhinav Gupta
Shubham Tulsiani
32
30
0
07 Apr 2022
Heterogeneous Target Speech Separation
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
51
26
0
07 Apr 2022
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing
  for One-Shot Imitation Learning
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning
Eugene Valassakis
Georgios Papagiannis
Norman Di Palo
Edward Johns
32
41
0
06 Apr 2022
Global HRTF Interpolation via Learned Affine Transformation of
  Hyper-conditioned Features
Global HRTF Interpolation via Learned Affine Transformation of Hyper-conditioned Features
Jingeun Lee
Sungho Lee
Kyogu Lee
24
8
0
06 Apr 2022
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
Leonard Salewski
A. Sophia Koepke
Hendrik P. A. Lensch
Zeynep Akata
LRM
NAI
35
20
0
05 Apr 2022
A Survey on Graph Representation Learning Methods
A Survey on Graph Representation Learning Methods
Shima Khoshraftar
A. An
GNN
AI4TS
49
109
0
04 Apr 2022
Universal Adaptor: Converting Mel-Spectrograms Between Different
  Configurations for Speech Synthesis
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Fan Wang
Po-Chun Hsu
Da-Rong Liu
Hung-yi Lee
18
0
0
01 Apr 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition
  in Multi-party Meetings
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
29
13
0
31 Mar 2022
Disentangled3D: Learning a 3D Generative Model with Disentangled
  Geometry and Appearance from Monocular Images
Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images
A. Tewari
R. MallikarjunB.
Xingang Pan
Ohad Fried
Maneesh Agrawala
Christian Theobalt
CoGe
3DV
DRL
23
51
0
29 Mar 2022
Previous
123...151617...252627
Next