Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07871
Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FiLM: Visual Reasoning with a General Conditioning Layer"
50 / 1,315 papers shown
Title
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
32
8
0
08 Jul 2022
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning
Kyra Ahrens
Matthias Kerzel
Jae Hee Lee
C. Weber
S. Wermter
21
0
0
06 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
21
7
0
06 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
26
3
0
05 Jul 2022
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Liumeng Xue
Shan Yang
Na Hu
Dan Su
Linfu Xie
37
2
0
02 Jul 2022
SD-LayerNet: Semi-supervised retinal layer segmentation in OCT using disentangled representation with anatomical priors
Botond Fazekas
Guilherme Aresta
Dmitrii Lachinov
Sophie Riedl
Julia Mai
U. Schmidt-Erfurth
Hrvoje Bogunović
13
13
0
01 Jul 2022
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Xu Li
Shansong Liu
Ying Shan
37
13
0
28 Jun 2022
ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image Registration
Neel Dey
Jo Schlemper
S. Salehi
Bo Zhou
Guido Gerig
M. Sofka
MedIm
39
17
0
27 Jun 2022
Task-Adaptive Few-shot Node Classification
Song Wang
Kaize Ding
Chuxu Zhang
Chen Chen
Jundong Li
OffRL
38
48
0
23 Jun 2022
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Amy Hung
Ju-Chiang Wang
Dorien Herremans
29
6
0
22 Jun 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Massimiliano Patacchiola
J. Bronskill
Aliaksandra Shysheya
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
32
9
0
20 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
33
24
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DH
FedML
46
27
0
17 Jun 2022
Channel Importance Matters in Few-Shot Image Classification
Xu Luo
Jing Xu
Zenglin Xu
VLM
30
42
0
16 Jun 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
55
128
0
14 Jun 2022
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
Yanpeng Sun
Qiang Chen
Xiangyu He
Jian Wang
Haocheng Feng
Junyu Han
Errui Ding
Jian Cheng
Zechao Li
Jingdong Wang
40
56
0
13 Jun 2022
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields
Takuhiro Kaneko
37
14
0
13 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
25
49
0
11 Jun 2022
Feature-informed Embedding Space Regularization For Audio Classification
Yun-Ning Hung
Alexander Lerch
30
5
0
10 Jun 2022
Universal Speech Enhancement with Score-based Diffusion
Joan Serrà
Santiago Pascual
Jordi Pons
R. O. Araz
D. Scaini
DiffM
30
97
0
07 Jun 2022
FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
Mehmet Özgür Türkoglu
Alexander Becker
H. Gündüz
Mina Rezaei
Bernd Bischl
Rodrigo Caye Daudt
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
FedML
UQCV
48
25
0
31 May 2022
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
186
49
0
30 May 2022
A Continuous Time Framework for Discrete Denoising Models
Andrew Campbell
Joe Benton
Valentin De Bortoli
Tom Rainforth
George Deligiannidis
Arnaud Doucet
DiffM
197
137
0
30 May 2022
Visual Superordinate Abstraction for Robust Concept Learning
Qinjie Zheng
Chaoyue Wang
Dadong Wang
Dacheng Tao
VLM
28
2
0
28 May 2022
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Meng Yu
Yong-mei Xu
Chunlei Zhang
Shizhong Zhang
Dong Yu
28
11
0
20 May 2022
Meta-Learning Sparse Compression Networks
Jonathan Richard Schwarz
Yee Whye Teh
65
26
0
18 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
Alex Schwing
RALM
26
12
0
12 May 2022
Feature Extractor Stacking for Cross-domain Few-shot Learning
Hongyu Wang
Eibe Frank
Bernhard Pfahringer
Michael Mayo
G. Holmes
21
4
0
12 May 2022
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Sanghyun Yoo
Inchul Song
Yoshua Bengio
27
28
0
06 May 2022
What is Right for Me is Not Yet Right for You: A Dataset for Grounding Relative Directions via Multi-Task Learning
Jae Hee Lee
Matthias Kerzel
Kyra Ahrens
C. Weber
S. Wermter
45
9
0
05 May 2022
Few-Shot Musical Source Separation
Yu Wang
Daniel Stoller
Rachel M. Bittner
J. P. Bello
21
14
0
03 May 2022
Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Yida Zhao
Yuqing Song
Qin Jin
8
29
0
24 Apr 2022
Training and challenging models for text-guided fashion image retrieval
Eric Dodds
Jack Culpepper
Gaurav Srivastava
26
8
0
23 Apr 2022
KALA: Knowledge-Augmented Language Model Adaptation
Minki Kang
Jinheon Baek
Sung Ju Hwang
VLM
KELM
36
34
0
22 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
22
43
0
16 Apr 2022
Sound Event Triage: Detecting Sound Events Considering Priority of Classes
Noriyuki Tonami
Keisuke Imoto
32
1
0
13 Apr 2022
Probabilistic Compositional Embeddings for Multimodal Image Retrieval
Andrei Neculai
Yanbei Chen
Zeynep Akata
CoGe
27
31
0
12 Apr 2022
Text-Driven Separation of Arbitrary Sounds
Kevin Kilgour
Beat Gfeller
Qingqing Huang
A. Jansen
Scott Wisdom
Marco Tagliasacchi
30
30
0
12 Apr 2022
Pareto Conditioned Networks
Mathieu Reymond
Eugenio Bargiacchi
Ann Nowé
4
16
0
11 Apr 2022
Canonical Mean Filter for Almost Zero-Shot Multi-Task classification
Yong Li
Heng Wang
Xiang Ye
17
0
0
08 Apr 2022
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Shaojin Ding
R. Rikhye
Qiao Liang
Yanzhang He
Quan Wang
A. Narayanan
Tom O'Malley
Ian McGraw
29
27
0
08 Apr 2022
Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Kalyan Vasudev Alwala
Abhinav Gupta
Shubham Tulsiani
32
30
0
07 Apr 2022
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
51
26
0
07 Apr 2022
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning
Eugene Valassakis
Georgios Papagiannis
Norman Di Palo
Edward Johns
32
41
0
06 Apr 2022
Global HRTF Interpolation via Learned Affine Transformation of Hyper-conditioned Features
Jingeun Lee
Sungho Lee
Kyogu Lee
24
8
0
06 Apr 2022
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
Leonard Salewski
A. Sophia Koepke
Hendrik P. A. Lensch
Zeynep Akata
LRM
NAI
35
20
0
05 Apr 2022
A Survey on Graph Representation Learning Methods
Shima Khoshraftar
A. An
GNN
AI4TS
49
109
0
04 Apr 2022
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis
Fan Wang
Po-Chun Hsu
Da-Rong Liu
Hung-yi Lee
18
0
0
01 Apr 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
29
13
0
31 Mar 2022
Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images
A. Tewari
R. MallikarjunB.
Xingang Pan
Ohad Fried
Maneesh Agrawala
Christian Theobalt
CoGe
3DV
DRL
23
51
0
29 Mar 2022
Previous
1
2
3
...
15
16
17
...
25
26
27
Next