Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.02799
Cited By
v1
v2
v3
v4 (latest)
Neural Module Networks
9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Module Networks"
50 / 634 papers shown
Title
Vision Generalist Model: A Survey
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
65
0
0
11 Jun 2025
Language-Vision Planner and Executor for Text-to-Visual Reasoning
Yichang Xu
Gaowen Liu
Ramana Rao Kompella
Sihao Hu
Tiansheng Huang
Fatih Ilhan
Selim Furkan Tekin
Zachary Yahn
Ling Liu
LRM
VLM
20
0
0
09 Jun 2025
Collaborative Learning in Agentic Systems: A Collective AI is Greater Than the Sum of Its Parts
Saptarshi Nath
Christos Peridis
Eseoghene Benjamin
Xinran Liu
Soheil Kolouri
Peter Kinnell
Zexin Li
Cong Liu
Shirin Dora
Andrea Soltoggio
30
0
0
05 Jun 2025
CIVET: Systematic Evaluation of Understanding in VLMs
Massimo Rizzoli
Simone Alghisi
Olha Khomyn
Gabriel Roccabruna
Seyed Mahed Mousavi
Giuseppe Riccardi
161
0
0
05 Jun 2025
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?
Yang Yao
Lingyu Li
Jiaxin Song
Chiyu Chen
Zhenqi He
...
Xin Wang
Tianle Gu
Jie Li
Yan Teng
Yingchun Wang
LRM
10
0
0
03 Jun 2025
SemIRNet: A Semantic Irony Recognition Network for Multimodal Sarcasm Detection
Jingxuan Zhou
Yuehao Wu
Yibo Zhang
Yeyubei Zhang
Yunchong Liu
Bolin Huang
Chunhong Yuan
12
0
0
28 May 2025
The Coverage Principle: A Framework for Understanding Compositional Generalization
Hoyeon Chang
Jinho Park
Hanseul Cho
Sohee Yang
Miyoung Ko
Hyeonbin Hwang
Seungpil Won
Dohaeng Lee
Youbin Ahn
Minjoon Seo
59
0
0
26 May 2025
Understanding Complexity in VideoQA via Visual Program Generation
Cristobal Eyzaguirre
Igor Vasiljevic
Achal Dave
Jiajun Wu
Rares Andrei Ambrus
Thomas Kollar
Juan Carlos Niebles
P. Tokmakov
73
0
0
19 May 2025
Neuro-Symbolic Concepts
Jiayuan Mao
Joshua B. Tenenbaum
Jiajun Wu
NAI
110
0
0
09 May 2025
A Theoretical Analysis of Compositional Generalization in Neural Networks: A Necessary and Sufficient Condition
Yuanpeng Li
CoGe
445
0
0
05 May 2025
Deep Learning with Pretrained Ínternal World' Layers: A Gemma 3-Based Modular Architecture for Wildfire Prediction
Ayoub Jadouli
Chaker El Amrani
KELM
AI4TS
146
0
0
20 Apr 2025
A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives
Delower Hossain
Jake Y Chen
NAI
85
1
0
23 Mar 2025
Hybrid Learners Do Not Forget: A Brain-Inspired Neuro-Symbolic Approach to Continual Learning
Amin Banayeeanzade
Mohammad Rostami
CLL
97
0
0
16 Mar 2025
Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU Networks
Devon Jarvis
Richard Klein
Benjamin Rosman
Andrew M. Saxe
MLT
148
2
0
08 Mar 2025
A Theory of Initialisation's Impact on Specialisation
Devon Jarvis
Sebastian Lee
Clémentine Dominé
Andrew M. Saxe
Stefano Sarao Mannelli
CLL
117
2
0
04 Mar 2025
Predicate Hierarchies Improve Few-Shot State Classification
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
146
0
0
18 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
129
4
0
09 Feb 2025
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
Shivalika Singh
Nakul Sharma
Manish Gupta
Anand Mishra
143
1
0
28 Jan 2025
PuzzleGPT: Emulating Human Puzzle-Solving Ability for Time and Location Prediction
Hammad A. Ayyubi
Xuande Feng
Junzhang Liu
Xudong Lin
Zhecan Wang
Shih-Fu Chang
72
1
0
24 Jan 2025
Compositional Instruction Following with Language Models and Reinforcement Learning
Vanya Cohen
Geraud Nangue Tasse
N. Gopalan
Steven D. James
Matthew C. Gombolay
Ray Mooney
Benjamin Rosman
107
0
0
21 Jan 2025
Physics of Skill Learning
Ziming Liu
Yizhou Liu
Eric J. Michaud
Jeff Gore
Max Tegmark
119
2
0
21 Jan 2025
Flexible task abstractions emerge in linear networks with fast and bounded units
Kai Sandbrink
Jan P. Bauer
A. Proca
Andrew M. Saxe
Christopher Summerfield
Ali Hummos
121
2
0
17 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
150
0
0
13 Jan 2025
Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Kairui Fu
Zheqi Lv
Shengyu Zhang
Fan Wu
Kun Kuang
68
1
0
07 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
78
0
0
01 Jan 2025
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
282
5
0
31 Dec 2024
Language Model as Visual Explainer
Xingyi Yang
Xinchao Wang
VLM
71
0
0
08 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
150
2
0
05 Dec 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
85
1
0
17 Nov 2024
Improving DNN Modularization via Activation-Driven Training
Tuan Ngo
Abid Hassan
Saad Shafiq
Nenad Medvidovic
MoMe
72
0
0
01 Nov 2024
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Sunil Aryal
Imran Razzak
Hakim Hacid
83
0
0
30 Oct 2024
A Complexity-Based Theory of Compositionality
Eric Elmoznino
Thomas Jiralerspong
Yoshua Bengio
Guillaume Lajoie
CoGe
155
10
0
18 Oct 2024
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
Wenhao Chai
Enxin Song
Y. Du
Chenlin Meng
Vashisht Madhavan
Omer Bar-Tal
Jeng-Neng Hwang
Saining Xie
Christopher D. Manning
3DV
217
37
0
04 Oct 2024
On The Specialization of Neural Modules
Devon Jarvis
Richard Klein
Benjamin Rosman
Andrew M. Saxe
133
14
0
23 Sep 2024
Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs
A. Mavrogiannis
Dehao Yuan
Yiannis Aloimonos
LM&Ro
80
0
0
23 Sep 2024
Breaking Neural Network Scaling Laws with Modularity
Akhilan Boopathy
Sunshine Jiang
William Yue
Jaedong Hwang
Abhiram Iyer
Ila Fiete
OOD
136
2
0
09 Sep 2024
One-shot Video Imitation via Parameterized Symbolic Abstraction Graphs
Jianren Wang
Kangni Liu
Dingkun Guo
Xian Zhou
Christopher G Atkeson
66
0
0
22 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yanjie Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
123
5
0
05 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
130
16
0
27 Jul 2024
Gradient-based inference of abstract task representations for generalization in neural networks
Ali Hummos
Felipe del-Rio
Brabeeba Mien Wang
Julio Hurtado
Cristian B. Calderon
G. Yang
71
4
0
24 Jul 2024
Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought
Jue Chen
Yongxin Deng
Xihe Qiu
Weidi Xu
Chao Qu
Wei Chu
Yinghui Xu
Yuan Qi
LRM
AI4CE
LM&Ro
84
3
0
18 Jul 2024
Compositional Models for Estimating Causal Effects
Purva Pruthi
David D. Jensen
CML
225
0
0
25 Jun 2024
UQE: A Query Engine for Unstructured Databases
Hanjun Dai
B. Wang
Xingchen Wan
Bo Dai
Sherry Yang
Azade Nova
Pengcheng Yin
P. Phothilimthana
Charles Sutton
Dale Schuurmans
91
7
0
23 Jun 2024
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Xiaocheng Yang
Bingsen Chen
Yik-Cheung Tam
LRM
91
12
0
28 May 2024
THREAD: Thinking Deeper with Recursive Spawning
Philip Schroeder
Nathaniel Morgan
Hongyin Luo
James R. Glass
LRM
LLMAG
ReLM
72
1
0
27 May 2024
A Survey of Multimodal Large Language Model from A Data-centric Perspective
Tianyi Bai
Hao Liang
Binwang Wan
Yanran Xu
Xi Li
...
Ping Huang
Jiulong Shan
Conghui He
Binhang Yuan
Wentao Zhang
137
45
0
26 May 2024
When does compositional structure yield compositional generalization? A kernel theory
Samuel Lippl
Kim Stachenfeld
NAI
CoGe
251
10
0
26 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
Lotem Elber-Dorozko
AI4CE
189
4
0
24 May 2024
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
Guangmin Zheng
Jin Wang
Xiaobing Zhou
Xuejie Zhang
LRM
58
2
0
16 May 2024
Interpretability Needs a New Paradigm
Andreas Madsen
Himabindu Lakkaraju
Siva Reddy
Sarath Chandar
72
3
0
08 May 2024
1
2
3
4
...
11
12
13
Next