ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer
v1v2 (latest)

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAttAIMatOffRLAI4CE
ArXiv (abs)PDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,349 papers shown
Title
Discriminative Sample-Guided and Parameter-Efficient Feature Space
  Adaptation for Cross-Domain Few-Shot Learning
Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning
Rashindrie Perera
Saman K. Halgamuge
99
2
0
07 Mar 2024
Online Adaptation of Language Models with a Memory of Amortized Contexts
Online Adaptation of Language Models with a Memory of Amortized Contexts
Jihoon Tack
Jaehyung Kim
Eric Mitchell
Jinwoo Shin
Yee Whye Teh
Jonathan Richard Schwarz
KELM
99
20
0
07 Mar 2024
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Ge Yan
Yueh-hua Wu
Xiaolong Wang
VGen
120
22
0
07 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
99
26
0
05 Mar 2024
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by
  Magnitude Conditioning
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning
Kuan-Hsun Ho
J. Hung
Berlin Chen
66
0
0
04 Mar 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
106
23
0
29 Feb 2024
Boosting Neural Representations for Videos with a Conditional Decoder
Boosting Neural Representations for Videos with a Conditional Decoder
Xinjie Zhang
Ren Yang
Dailan He
Xingtong Ge
Tongda Xu
Yan Wang
Hongwei Qin
Jun Zhang
107
17
0
28 Feb 2024
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization
Yoshiki Masuyama
Gordon Wichern
François Germain
Zexu Pan
Sameer Khurana
Chiori Hori
Jonathan Le Roux
74
3
0
27 Feb 2024
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence
  Generation
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation
Zongying Lin
Hao Li
Liuzhenghao Lv
Lin Bin
Junwu Zhang
Calvin Yu-Chian Chwn
Li Yuan
Tian Yonghong
82
3
0
27 Feb 2024
Achievable Fairness on Your Data With Utility Guarantees
Achievable Fairness on Your Data With Utility Guarantees
Muhammad Faaiz Taufiq
Jean-François Ton
Yang Liu
57
1
0
27 Feb 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
127
7
0
26 Feb 2024
Closing the AI generalization gap by adjusting for dermatology condition
  distribution differences across clinical settings
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
R. Rikhye
Aaron Loh
G. Hong
Preeti Singh
M. A. Smith
...
P. Bui
Yuan Liu
Yun-Hui Liu
Justin M. Ko
Steven Lin
43
1
0
23 Feb 2024
Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis
  with Diffusion Models
Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models
Shunyu Liu
Jie Zhou
Qunxi Zhu
Qin Chen
Qingchun Bai
Junhua Xiao
Liang He
57
2
0
23 Feb 2024
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding
  Decomposition
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
Rendi Chevi
Alham Fikri Aji
108
3
0
22 Feb 2024
ISCUTE: Instance Segmentation of Cables Using Text Embedding
ISCUTE: Instance Segmentation of Cables Using Text Embedding
Shir Kozlovsky
O. Joglekar
Dotan Di Castro
91
2
0
19 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
119
127
0
16 Feb 2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action
  Abstractions in Control
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng
Ching-An Cheng
Hal Daumé
Furong Huang
Andrey Kolobov
85
12
0
16 Feb 2024
TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial
  Network for early-to-late frame conversion in dynamic cardiac PET inter-frame
  motion correction
TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction
Xueqi Guo
Luyao Shi
Xiongchao Chen
Qiong Liu
Bo Zhou
...
Albert J Sinusas
Lawrence H. Staib
Bruce Spottiswoode
Chi Liu
Nicha Dvornek
MedIm
42
1
0
14 Feb 2024
Learning by Watching: A Review of Video-based Learning Approaches for
  Robot Manipulation
Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation
Chrisantus Eze
Christopher Crick
SSL
121
13
0
11 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask
  Representation via Temporal Action-Driven Contrastive Loss
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
107
8
0
09 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRLVLMLM&Ro
116
54
0
08 Feb 2024
Linearizing Models for Efficient yet Robust Private Inference
Linearizing Models for Efficient yet Robust Private Inference
Sreetama Sarkar
Souvik Kundu
Peter A. Beerel
AAML
55
0
0
08 Feb 2024
Optimizing for ROC Curves on Class-Imbalanced Data by Training over a
  Family of Loss Functions
Optimizing for ROC Curves on Class-Imbalanced Data by Training over a Family of Loss Functions
Kelsey Lieberman
Shuai Yuan
Swarna Kamlam Ravindran
Carlo Tomasi
60
0
0
08 Feb 2024
NITO: Neural Implicit Fields for Resolution-free Topology Optimization
NITO: Neural Implicit Fields for Resolution-free Topology Optimization
Amin Heyrani Nobari
Giorgio Giannone
Lyle Regenwetter
Faez Ahmed
AI4CE
95
3
0
07 Feb 2024
Fast Timing-Conditioned Latent Audio Diffusion
Fast Timing-Conditioned Latent Audio Diffusion
Zach Evans
CJ Carr
Josiah Taylor
Scott H. Hawley
Jordi Pons
DiffM
142
117
0
07 Feb 2024
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with
  Semantic Graph Prior
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
70
40
0
07 Feb 2024
Bidirectional Autoregressive Diffusion Model for Dance Generation
Bidirectional Autoregressive Diffusion Model for Dance Generation
Canyu Zhang
Youbao Tang
Ning Zhang
Ruei-Sung Lin
Mei Han
Jing Xiao
Song Wang
78
9
0
06 Feb 2024
Listen, Chat, and Remix: Text-Guided Soundscape Remixing for Enhanced Auditory Experience
Listen, Chat, and Remix: Text-Guided Soundscape Remixing for Enhanced Auditory Experience
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
KELM
91
5
0
06 Feb 2024
CLIP Can Understand Depth
CLIP Can Understand Depth
Dunam Kim
Seokju Lee
VLMMDE
121
2
0
05 Feb 2024
Using Motion Cues to Supervise Single-Frame Body Pose and Shape
  Estimation in Low Data Regimes
Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes
Andrey Davydov
Alexey Sidnev
A. Sanakoyeu
Yuhua Chen
Mathieu Salzmann
Pascal Fua
3DH
65
0
0
05 Feb 2024
DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation
DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation
Yuan Gao
Haokun Chen
Xiang Wang
Zhicai Wang
Xue Wang
Jinyang Gao
Bolin Ding
76
6
0
05 Feb 2024
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis
Bernard Spiegl
Andrea Perin
Stéphane Deny
Alexander Ilin
DiffM
101
2
0
05 Feb 2024
Few-Shot Learning on Graphs: from Meta-learning to Pre-training and
  Prompting
Few-Shot Learning on Graphs: from Meta-learning to Pre-training and Prompting
Xingtong Yu
Yuan Fang
Zemin Liu
Yuxia Wu
Zhihao Wen
Jianyuan Bo
Xinming Zhang
Steven C. H. Hoi
VLM
74
5
0
02 Feb 2024
Spatial-Temporal Activity-Informed Diarization and Separation
Spatial-Temporal Activity-Informed Diarization and Separation
Yicheng Hsu
Ssuhan Chen
Mingsian R. Bai
53
0
0
30 Jan 2024
GAPS: Geometry-Aware Problem Solver
GAPS: Geometry-Aware Problem Solver
Jiaxin Zhang
Yinghui Jiang
Yashar Moshfeghi
AIMatAI4CE
36
3
0
29 Jan 2024
Few and Fewer: Learning Better from Few Examples Using Fewer Base
  Classes
Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes
Raphael Lafargue
Yassir Bendou
Bastien Pasdeloup
J. Diguet
Ian Reid
Vincent Gripon
Jack Valmadre
99
0
0
29 Jan 2024
MResT: Multi-Resolution Sensing for Real-Time Control with
  Vision-Language Models
MResT: Multi-Resolution Sensing for Real-Time Control with Vision-Language Models
Saumya Saxena
Mohit Sharma
Oliver Kroemer
85
4
0
25 Jan 2024
Diverse and Lifespan Facial Age Transformation Synthesis with Identity
  Variation Rationality Metric
Diverse and Lifespan Facial Age Transformation Synthesis with Identity Variation Rationality Metric
Jiucheng Xie
Jun Yang
Wenqing Wang
Feng Xu
Jiang Xiong
Hao Gao
70
2
0
25 Jan 2024
Dynamic Long-Term Time-Series Forecasting via Meta Transformer Networks
Dynamic Long-Term Time-Series Forecasting via Meta Transformer Networks
M. A. Ma'sum
MD Rasel Sarkar
Mahardhika Pratama
Savitha Ramasamy
S. Anavatti
Lin Liu
Habibullah Habibullah
Ryszard Kowalczyk
AI4TS
65
0
0
25 Jan 2024
Boosting Unknown-number Speaker Separation with Transformer
  Decoder-based Attractor
Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor
Younglo Lee
Shukjae Choi
Byeonghak Kim
Zhong-Qiu Wang
Shinji Watanabe
MoE
60
10
0
23 Jan 2024
MINT: A wrapper to make multi-modal and multi-image AI models
  interactive
MINT: A wrapper to make multi-modal and multi-image AI models interactive
Jan Freyberg
Abhijit Guha Roy
Terry Spitz
Beverly Freeman
M. Schaekermann
...
D. Webster
Alan Karthikesalingam
Yun-Hui Liu
Krishnamurthy Dvijotham
Umesh Telang
75
1
0
22 Jan 2024
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for
  Temporal-Event-Guided Foley Sound Synthesis
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis
Yoonjin Chung
Junwon Lee
Juhan Nam
99
15
0
17 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven
  Policy Learning
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
93
1
0
17 Jan 2024
FMB: a Functional Manipulation Benchmark for Generalizable Robotic
  Learning
FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning
Jianlan Luo
Charles Xu
Fangchen Liu
Liam Tan
Zipeng Lin
Jeffrey Wu
Pieter Abbeel
Sergey Levine
85
29
0
16 Jan 2024
Multi-task real-robot data with gaze attention for dual-arm fine
  manipulation
Multi-task real-robot data with gaze attention for dual-arm fine manipulation
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
124
2
0
15 Jan 2024
Generalizing Visual Question Answering from Synthetic to Human-Written
  Questions via a Chain of QA with a Large Language Model
Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model
Taehee Kim
Yeongjae Cho
Heejun Shin
Yohan Jo
Dongmyung Shin
103
4
0
12 Jan 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Neural Population Learning beyond Symmetric Zero-sum Games
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
95
3
0
10 Jan 2024
Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
Minjie Zhu
Yichen Zhu
Jinming Li
Junjie Wen
Zhiyuan Xu
...
Yaxin Peng
Chaomin Shen
Dong Liu
Feifei Feng
Jian Tang
LM&Ro
72
15
0
08 Jan 2024
StreamVC: Real-Time Low-Latency Voice Conversion
StreamVC: Real-Time Low-Latency Voice Conversion
Yang Yang
Y. Kartynnik
Yunpeng Li
Jiuqiang Tang
Xing Li
George Sung
Matthias Grundmann
110
15
0
05 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffMVGen
291
279
0
05 Jan 2024
Previous
123...8910...252627
Next