ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07871
  4. Cited By
FiLM: Visual Reasoning with a General Conditioning Layer

FiLM: Visual Reasoning with a General Conditioning Layer

22 September 2017
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
    FAtt
    AIMat
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "FiLM: Visual Reasoning with a General Conditioning Layer"

50 / 1,313 papers shown
Title
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner
Weiyu Li
Jiarui Liu
Rui Chen
Yixun Liang
Xuelin Chen
Ping Tan
Xiaoxiao Long
DiffM
41
49
0
23 May 2024
TerDiT: Ternary Diffusion Models with Transformers
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
55
2
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
43
0
23 May 2024
Rehearsal-free Federated Domain-incremental Learning
Rehearsal-free Federated Domain-incremental Learning
Rui Sun
Haoran Duan
Jiahua Dong
Varun Ojha
Tejal Shah
R. Ranjan
CLL
43
1
0
22 May 2024
Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and
  Next-Token Prediction
Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction
Maciej Kilian
Varun Jampani
Luke Zettlemoyer
DiffM
32
8
0
21 May 2024
ASMR: Activation-sharing Multi-resolution Coordinate Networks For
  Efficient Inference
ASMR: Activation-sharing Multi-resolution Coordinate Networks For Efficient Inference
Jason Chun Lok Li
Steven Tin Sui Luo
Le Xu
Ngai Wong
30
4
0
20 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
69
357
0
20 May 2024
Discrete-state Continuous-time Diffusion for Graph Generation
Discrete-state Continuous-time Diffusion for Graph Generation
Zhe Xu
Ruizhong Qiu
Yuzhong Chen
Huiyuan Chen
Xiran Fan
Menghai Pan
Zhichen Zeng
Mahashweta Das
Hanghang Tong
46
10
0
19 May 2024
CoLay: Controllable Layout Generation through Multi-conditional Latent
  Diffusion
CoLay: Controllable Layout Generation through Multi-conditional Latent Diffusion
Chin-Yi Cheng
Ruiqi Gao
Forrest Huang
Yang Li
DiffM
36
2
0
18 May 2024
Natural Language Can Help Bridge the Sim2Real Gap
Natural Language Can Help Bridge the Sim2Real Gap
Albert Yu
Adeline Foote
Raymond J. Mooney
Roberto Martín-Martín
LM&Ro
51
11
0
16 May 2024
Frequency-Domain Refinement with Multiscale Diffusion for Super
  Resolution
Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution
Xingjian Wang
Li Chai
Jiming Chen
34
1
0
16 May 2024
A tunable binaural audio telepresence system capable of balancing
  immersive and enhanced modes
A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes
Yicheng Hsu
Mingsian R. Bai
29
1
0
14 May 2024
Improving Multimodal Learning with Multi-Loss Gradient Modulation
Improving Multimodal Learning with Multi-Loss Gradient Modulation
Konstantinos Kontras
Christos Chatzichristos
Matthew Blaschko
M. D. Vos
32
3
0
13 May 2024
CaFA: Global Weather Forecasting with Factorized Attention on Sphere
CaFA: Global Weather Forecasting with Factorized Attention on Sphere
Zijie Li
Anthony Y. Zhou
Saurabh Patil
A. Farimani
45
6
0
12 May 2024
Interpretable Multi-task Learning with Shared Variable Embeddings
Interpretable Multi-task Learning with Shared Variable Embeddings
Maciej Żelaszczyk
Jacek Mańdziuk
29
0
0
10 May 2024
Temporal and Heterogeneous Graph Neural Network for Remaining Useful
  Life Prediction
Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction
Zhihao Wen
Yuan Fang
Pengcheng Wei
Fayao Liu
Zhenghua Chen
Min-man Wu
AI4CE
30
2
0
07 May 2024
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your
  Diffusion Model
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model
Joo Young Choi
Jaesung R. Park
Inkyu Park
Jaewoong Cho
Albert No
Ernest K. Ryu
AI4CE
35
4
0
07 May 2024
Learning Planning Abstractions from Language
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
46
2
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
87
38
0
06 May 2024
Matten: Video Generation with Mamba-Attention
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
72
12
0
05 May 2024
Beyond Unimodal Learning: The Importance of Integrating Multiple
  Modalities for Lifelong Learning
Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
F. Sarfraz
Bahram Zonooz
Elahe Arani
CLL
34
2
0
04 May 2024
CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Kota Kondo
Andrea Tagliabue
Xiaoyi Cai
Claudius T. Tewari
Olivia Garcia
Marcos Espitia-Alvarez
Jonathan P. How
46
5
0
02 May 2024
TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio
  and Bone Conduction Speech Super Resolution and Enhancement on Mobile and
  Wearable Platforms
TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms
Yueyuan Sui
Minghui Zhao
Junxi Xia
Xiaofan Jiang
S. Xia
Mamba
47
11
0
02 May 2024
Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand
Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand
Slawek Smyl
Boris N. Oreshkin
Paweł Pełka
Grzegorz Dudek
AI4TS
40
0
0
26 Apr 2024
Latent Modulated Function for Computational Optimal Continuous Image
  Representation
Latent Modulated Function for Computational Optimal Continuous Image Representation
Zongyao He
Zhi Jin
SupR
29
11
0
25 Apr 2024
Leveraging Large Language Models for Multimodal Search
Leveraging Large Language Models for Multimodal Search
Oriol Barbany
Michael Huang
Xinliang Zhu
Arnab Dhua
31
9
0
24 Apr 2024
Audio Anti-Spoofing Detection: A Survey
Audio Anti-Spoofing Detection: A Survey
Menglu Li
Yasaman Ahmadiadli
Xiao-Ping Zhang
48
17
0
22 Apr 2024
TRNet: Two-level Refinement Network leveraging Speech Enhancement for
  Noise Robust Speech Emotion Recognition
TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
35
0
0
19 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
43
1
0
19 Apr 2024
Retrieval-Augmented Embodied Agents
Retrieval-Augmented Embodied Agents
Yichen Zhu
Zhicai Ou
Xiaofeng Mou
Jian Tang
51
17
0
17 Apr 2024
Neural Shrödinger Bridge Matching for Pansharpening
Neural Shrödinger Bridge Matching for Pansharpening
Zihan Cao
Xiao Wu
Liang-Jian Deng
DiffM
61
2
0
17 Apr 2024
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation
  Models
FoundationGrasp: Generalizable Task-Oriented Grasping with Foundation Models
Chao Tang
Dehao Huang
Wenlong Dong
Ruinian Xu
Hong Zhang
36
11
0
16 Apr 2024
Find The Gap: Knowledge Base Reasoning For Visual Question Answering
Find The Gap: Knowledge Base Reasoning For Visual Question Answering
Elham J. Barezi
Parisa Kordjamshidi
34
0
0
16 Apr 2024
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
Yu-Ju Tsai
Jin-Cheng Jhang
Jingjing Zheng
Wei Wang
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
Ming-Hsuan Yang
3DV
33
4
0
15 Apr 2024
RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion
RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion
Guoxuan Chi
Zheng Yang
Chenshu Wu
Jingao Xu
Yuchong Gao
Yunhao Liu
Tony Xiao Han
DiffM
54
29
0
14 Apr 2024
Enhancing Visual Question Answering through Question-Driven Image
  Captions as Prompts
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
44
10
0
12 Apr 2024
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking
  Subnetworks
Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
Yang Yang
Hongpeng Pan
Qingjun Jiang
Yi Tian Xu
Jinghui Tang
29
4
0
12 Apr 2024
Towards Efficient and Real-Time Piano Transcription Using Neural
  Autoregressive Models
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
Taegyun Kwon
Dasaem Jeong
Juhan Nam
19
2
0
10 Apr 2024
Domain Generalisation via Imprecise Learning
Domain Generalisation via Imprecise Learning
Anurag Singh
Siu Lun Chau
S. Bouabid
Krikamol Muandet
AI4CE
OOD
38
5
0
06 Apr 2024
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
40
24
0
06 Apr 2024
Implicit Assimilation of Sparse In Situ Data for Dense & Global Storm
  Surge Forecasting
Implicit Assimilation of Sparse In Situ Data for Dense & Global Storm Surge Forecasting
Patrick Ebel
Brandon Victor
Peter Naylor
Gabriele Meoni
Federico Serva
Rochelle Schneider
35
0
0
05 Apr 2024
Joint-Task Regularization for Partially Labeled Multi-Task Learning
Joint-Task Regularization for Partially Labeled Multi-Task Learning
Kento Nishi
Junsik Kim
Wanhua Li
Hanspeter Pfister
43
1
0
02 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
35
15
0
02 Apr 2024
Video Interpolation with Diffusion Models
Video Interpolation with Diffusion Models
Siddhant Jain
Daniel Watson
Eric Tabellion
Aleksander Holyñski
Ben Poole
Janne Kontkanen
SupR
VGen
DiffM
44
33
0
01 Apr 2024
Condition-Aware Neural Network for Controlled Image Generation
Condition-Aware Neural Network for Controlled Image Generation
Han Cai
Muyang Li
Zhuoyang Zhang
Qinsheng Zhang
Ming-Yu Liu
Song Han
DiffM
22
8
0
01 Apr 2024
SineNet: Learning Temporal Dynamics in Time-Dependent Partial
  Differential Equations
SineNet: Learning Temporal Dynamics in Time-Dependent Partial Differential Equations
Xuan Zhang
Jacob Helwig
Yu-Ching Lin
Yaochen Xie
Cong Fu
Stephan Wojtowytsch
Shuiwang Ji
AI4CE
37
6
0
28 Mar 2024
Leveraging Near-Field Lighting for Monocular Depth Estimation from
  Endoscopy Videos
Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
Akshay Paruchuri
S. Ehrenstein
Shuxian Wang
Inbar Fried
Stephen M. Pizer
Marc Niethammer
Roni Sengupta
MDE
48
6
0
26 Mar 2024
ReMamber: Referring Image Segmentation with Mamba Twister
ReMamber: Referring Image Segmentation with Mamba Twister
Yu-Hao Yang
Chaofan Ma
Jiangchao Yao
Zhun Zhong
Ya Zhang
Yanfeng Wang
Mamba
58
20
0
26 Mar 2024
Diffusion-based Negative Sampling on Graphs for Link Prediction
Diffusion-based Negative Sampling on Graphs for Link Prediction
Trung-Kien Nguyen
Yuan Fang
DiffM
25
11
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in
  Diffusion Transformer
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
50
24
0
25 Mar 2024
Previous
123...678...252627
Next