ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,313 papers shown
Title
Modeling Analog Dynamic Range Compressors using Deep Learning and
  State-space Models
Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models
Hanzhi Yin
Gang Cheng
Christian J. Steinmetz
Ruibin Yuan
Richard M. Stern
Roger B. Dannenberg
32
6
0
24 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
40
18
0
22 Mar 2024
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped
  Robot
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot
Wenxuan Song
Han Zhao
Pengxiang Ding
Can Cui
Shangke Lyu
Yaning Fan
Donglin Wang
OffRL
35
11
0
20 Mar 2024
HyperFusion: A Hypernetwork Approach to Multimodal Integration of Tabular and Medical Imaging Data for Predictive Modeling
HyperFusion: A Hypernetwork Approach to Multimodal Integration of Tabular and Medical Imaging Data for Predictive Modeling
Daniel Duenias
Brennan Nichyporuk
Tal Arbel
Tammy Riklin-Raviv
42
3
0
20 Mar 2024
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization
  with Vision-Language Models
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Elaine Sui
Xiaohan Wang
Serena Yeung-Levy
VLM
30
5
0
19 Mar 2024
Vid2Robot: End-to-end Video-conditioned Policy Learning with
  Cross-Attention Transformers
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Vidhi Jain
Maria Attarian
Nikhil J. Joshi
Ayzaan Wahid
Danny Driess
...
Stefan Welker
Christine Chan
Igor Gilitschenski
Yonatan Bisk
Debidatta Dwibedi
68
29
0
19 Mar 2024
Graph Neural Networks for Learning Equivariant Representations of Neural
  Networks
Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Miltiadis Kofinas
Boris Knyazev
Yan Zhang
Yunlu Chen
Gertjan J. Burghouts
E. Gavves
Cees G. M. Snoek
David W. Zhang
46
29
0
18 Mar 2024
Efficient Trajectory Forecasting and Generation with Conditional Flow
  Matching
Efficient Trajectory Forecasting and Generation with Conditional Flow Matching
Sean Ye
Matthew C. Gombolay
43
2
0
16 Mar 2024
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation
  Guided by the Characteristic Dance Primitives
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Ronghui Li
YuXiang Zhang
Yachao Zhang
Hongwen Zhang
Jie Guo
Yan Zhang
Yebin Liu
Xiu Li
DiffM
49
28
0
15 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient
  Task Adaptation
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
51
17
0
14 Mar 2024
Unleashing the Power of Meta-tuning for Few-shot Generalization Through
  Sparse Interpolated Experts
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
Shengzhuang Chen
Jihoon Tack
Yunqiao Yang
Yee Whye Teh
Jonathan Richard Schwarz
Ying Wei
MoE
43
1
0
13 Mar 2024
NaturalVLM: Leveraging Fine-grained Natural Language for
  Affordance-Guided Visual Manipulation
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
Ran Xu
Yan Shen
Xiaoqi Li
Ruihai Wu
Hao Dong
LM&Ro
30
9
0
13 Mar 2024
Semantic Residual Prompts for Continual Learning
Semantic Residual Prompts for Continual Learning
Martin Menabue
Emanuele Frascaroli
Matteo Boschini
E. Sangineto
Lorenzo Bonicelli
Angelo Porrello
Simone Calderara
CLL
VLM
43
9
0
11 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
59
72
0
08 Mar 2024
Discriminative Sample-Guided and Parameter-Efficient Feature Space
  Adaptation for Cross-Domain Few-Shot Learning
Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning
Rashindrie Perera
Saman K. Halgamuge
48
2
0
07 Mar 2024
Online Adaptation of Language Models with a Memory of Amortized Contexts
Online Adaptation of Language Models with a Memory of Amortized Contexts
Jihoon Tack
Jaehyung Kim
Eric Mitchell
Jinwoo Shin
Yee Whye Teh
Jonathan Richard Schwarz
KELM
50
18
0
07 Mar 2024
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Ge Yan
Yueh-hua Wu
Xiaolong Wang
VGen
37
20
0
07 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
37
26
0
05 Mar 2024
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by
  Magnitude Conditioning
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning
Kuan-Hsun Ho
J. Hung
Berlin Chen
42
0
0
04 Mar 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
54
21
0
29 Feb 2024
Boosting Neural Representations for Videos with a Conditional Decoder
Boosting Neural Representations for Videos with a Conditional Decoder
Xinjie Zhang
Ren Yang
Dailan He
Xingtong Ge
Tongda Xu
Yan Wang
Hongwei Qin
Jun Zhang
38
15
0
28 Feb 2024
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization
Yoshiki Masuyama
G. Wichern
François Germain
Zexu Pan
Sameer Khurana
Chiori Hori
Jonathan Le Roux
49
3
0
27 Feb 2024
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence
  Generation
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation
Zongying Lin
Hao Li
Liuzhenghao Lv
Lin Bin
Junwu Zhang
Calvin Yu-Chian Chwn
Li Yuan
Tian Yonghong
31
3
0
27 Feb 2024
Achievable Fairness on Your Data With Utility Guarantees
Achievable Fairness on Your Data With Utility Guarantees
Muhammad Faaiz Taufiq
Jean-François Ton
Yang Liu
29
1
0
27 Feb 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
58
5
0
26 Feb 2024
Closing the AI generalization gap by adjusting for dermatology condition
  distribution differences across clinical settings
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
R. Rikhye
Aaron Loh
G. Hong
Preeti Singh
M. A. Smith
...
P. Bui
Yuan Liu
Yun-Hui Liu
Justin M. Ko
Steven Lin
27
1
0
23 Feb 2024
Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis
  with Diffusion Models
Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models
Shunyu Liu
Jie Zhou
Qunxi Zhu
Qin Chen
Qingchun Bai
Junhua Xiao
Liang He
24
2
0
23 Feb 2024
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding
  Decomposition
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
Rendi Chevi
Alham Fikri Aji
25
2
0
22 Feb 2024
ISCUTE: Instance Segmentation of Cables Using Text Embedding
ISCUTE: Instance Segmentation of Cables Using Text Embedding
Shir Kozlovsky
O. Joglekar
Dotan Di Castro
32
2
0
19 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
39
108
0
16 Feb 2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action
  Abstractions in Control
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng
Ching-An Cheng
Hal Daumé
Furong Huang
Andrey Kolobov
33
9
0
16 Feb 2024
TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial
  Network for early-to-late frame conversion in dynamic cardiac PET inter-frame
  motion correction
TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction
Xueqi Guo
Luyao Shi
Xiongchao Chen
Qiong Liu
Bo Zhou
...
Albert J Sinusas
Lawrence H. Staib
Bruce Spottiswoode
Chi Liu
Nicha Dvornek
MedIm
24
1
0
14 Feb 2024
Learning by Watching: A Review of Video-based Learning Approaches for
  Robot Manipulation
Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation
Chrisantus Eze
Christopher Crick
SSL
82
12
0
11 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask
  Representation via Temporal Action-Driven Contrastive Loss
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
40
5
0
09 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
51
45
0
08 Feb 2024
Linearizing Models for Efficient yet Robust Private Inference
Linearizing Models for Efficient yet Robust Private Inference
Sreetama Sarkar
Souvik Kundu
P. Beerel
AAML
13
0
0
08 Feb 2024
Optimizing for ROC Curves on Class-Imbalanced Data by Training over a
  Family of Loss Functions
Optimizing for ROC Curves on Class-Imbalanced Data by Training over a Family of Loss Functions
Kelsey Lieberman
Shuai Yuan
Swarna Kamlam Ravindran
Carlo Tomasi
29
0
0
08 Feb 2024
NITO: Neural Implicit Fields for Resolution-free Topology Optimization
NITO: Neural Implicit Fields for Resolution-free Topology Optimization
A. Nobari
Giorgio Giannone
Lyle Regenwetter
Faez Ahmed
AI4CE
43
3
0
07 Feb 2024
Fast Timing-Conditioned Latent Audio Diffusion
Fast Timing-Conditioned Latent Audio Diffusion
Zach Evans
CJ Carr
Josiah Taylor
Scott H. Hawley
Jordi Pons
DiffM
82
102
0
07 Feb 2024
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with
  Semantic Graph Prior
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
22
32
0
07 Feb 2024
Bidirectional Autoregressive Diffusion Model for Dance Generation
Bidirectional Autoregressive Diffusion Model for Dance Generation
Canyu Zhang
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Mei Han
Jing Xiao
Song Wang
33
7
0
06 Feb 2024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced
  Auditory Experience
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
KELM
34
5
0
06 Feb 2024
CLIP Can Understand Depth
CLIP Can Understand Depth
Dunam Kim
Seokju Lee
VLM
MDE
51
2
0
05 Feb 2024
ViewFusion: Learning Composable Diffusion Models for Novel View
  Synthesis
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis
Bernard Spiegl
Andrea Perin
Stéphane Deny
Alexander Ilin
DiffM
16
2
0
05 Feb 2024
Using Motion Cues to Supervise Single-Frame Body Pose and Shape
  Estimation in Low Data Regimes
Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes
Andrey Davydov
Alexey Sidnev
A. Sanakoyeu
Yuhua Chen
Mathieu Salzmann
Pascal Fua
3DH
26
0
0
05 Feb 2024
DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation
DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation
Yuan Gao
Haokun Chen
Xiang Wang
Zhicai Wang
Xue Wang
Jinyang Gao
Bolin Ding
45
6
0
05 Feb 2024
Spatial-Temporal Activity-Informed Diarization and Separation
Spatial-Temporal Activity-Informed Diarization and Separation
Yicheng Hsu
Ssuhan Chen
Mingsian R. Bai
21
0
0
30 Jan 2024
GAPS: Geometry-Aware Problem Solver
GAPS: Geometry-Aware Problem Solver
Jiaxin Zhang
Yinghui Jiang
Yashar Moshfeghi
AIMat
AI4CE
22
3
0
29 Jan 2024
Few and Fewer: Learning Better from Few Examples Using Fewer Base
  Classes
Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes
Raphael Lafargue
Yassir Bendou
Bastien Pasdeloup
J. Diguet
Ian Reid
Vincent Gripon
Jack Valmadre
39
0
0
29 Jan 2024
MResT: Multi-Resolution Sensing for Real-Time Control with
  Vision-Language Models
MResT: Multi-Resolution Sensing for Real-Time Control with Vision-Language Models
Saumya Saxena
Mohit Sharma
Oliver Kroemer
34
4
0
25 Jan 2024
Previous
123...789...252627
Next