Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.02799
Cited By
v1
v2
v3
v4 (latest)
Neural Module Networks
9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Module Networks"
50 / 634 papers shown
Title
Visually Grounded Continual Language Learning with Selective Specialization
Kyra Ahrens
Lennart Bengtson
Jae Hee Lee
Stefan Wermter
92
0
0
24 Oct 2023
Cross-Modal Conceptualization in Bottleneck Models
Danis Alukaev
S. Kiselev
Ilya Pershin
Bulat Ibragimov
Vladimir Ivanov
Alexey Kornaev
Ivan Titov
78
7
0
23 Oct 2023
API-Assisted Code Generation for Question Answering on Varied Table Structures
Yihan Cao
Shuyi Chen
Ryan Liu
Zhiruo Wang
Daniel Fried
LMTD
72
14
0
23 Oct 2023
MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model
Le Zhang
Yihong Wu
Fengran Mo
Jian-Yun Nie
Aishwarya Agrawal
MLLM
RALM
77
6
0
20 Oct 2023
Neurosymbolic Grounding for Compositional World Models
Atharva Sehgal
Arya Grayeli
Jennifer J. Sun
Swarat Chaudhuri
89
6
0
19 Oct 2023
Instilling Inductive Biases with Subnetworks
Enyan Zhang
Michael A. Lepori
Ellie Pavlick
AI4CE
78
5
0
17 Oct 2023
Neural Relational Inference with Fast Modular Meta-learning
Ferran Alet
Erica Weng
Tomás Lozano Pérez
L. Kaelbling
135
57
0
10 Oct 2023
NEUCORE: Neural Concept Reasoning for Composed Image Retrieval
Shu Zhao
Huijuan Xu
55
6
0
02 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
102
3
0
02 Oct 2023
Compositional Program Generation for Few-Shot Systematic Generalization
Tim Klinger
Luke Liu
Soham Dan
A. Rezaee
Parikshit Ram
Ali Movaghar
NAI
73
3
0
28 Sep 2023
D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Amir Rahimi
Vanessa D’Amario
Moyuru Yamada
Kentaro Takemoto
Tomotake Sasaki
Xavier Boix
66
1
0
15 Sep 2023
Dynamic MOdularized Reasoning for Compositional Structured Explanation Generation
Xiyan Fu
Anette Frank
LRM
69
1
0
14 Sep 2023
Neurons in Large Language Models: Dead, N-gram, Positional
Elena Voita
Javier Ferrando
Christoforos Nalmpantis
MILM
164
56
0
09 Sep 2023
A Survey on Interpretable Cross-modal Reasoning
Dizhan Xue
Shengsheng Qian
Zuyi Zhou
Changsheng Xu
LRM
105
4
0
05 Sep 2023
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Yixuan Li
Hao Wen
Yunxin Liu
96
1
0
29 Aug 2023
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation
Moon Ye-Bin
Jisoo Kim
Hong-Kyu Kim
Kilho Son
Tae-Hyun Oh
77
9
0
27 Jul 2023
Efficient Learning of Discrete-Continuous Computation Graphs
David Friede
Mathias Niepert
53
3
0
26 Jul 2023
Free-Form Composition Networks for Egocentric Action Recognition
Haoran Wang
Qinghua Cheng
Baosheng Yu
Yibing Zhan
Dapeng Tao
Liang Ding
Haibin Ling
EgoV
123
0
0
13 Jul 2023
AVSegFormer: Audio-Visual Segmentation with Transformer
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
VOS
113
52
0
03 Jul 2023
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering
A. S. Penamakuri
Manish Gupta
Mithun Das Gupta
Anand Mishra
67
7
0
29 Jun 2023
Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation
Harel Biggie
Ajay Narasimha Mopidevi
Dusty Woods
Christoffer Heckman
LM&Ro
67
11
0
15 Jun 2023
Modularity Trumps Invariance for Compositional Robustness
I. Mason
Anirban Sarkar
Tomotake Sasaki
Xavier Boix
OOD
86
1
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
86
7
0
14 Jun 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn
Difei Gao
Lei Ji
Luowei Zhou
Kevin Lin
Joya Chen
Zihan Fan
Mike Zheng Shou
MLLM
96
76
0
14 Jun 2023
Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions
N. Rodis
Christos Sardianos
Panagiotis I. Radoglou-Grammatikis
Panagiotis G. Sarigiannidis
Iraklis Varlamis
Georgios Th. Papadopoulos
111
23
0
09 Jun 2023
ModuleFormer: Modularity Emerges from Mixture-of-Experts
Songlin Yang
Zheyu Zhang
Tianyou Cao
Shawn Tan
Zhenfang Chen
Chuang Gan
KELM
MoE
54
10
0
07 Jun 2023
M
3
^3
3
IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
Lei Li
Yuwei Yin
Shicheng Li
Liang Chen
Peiyi Wang
...
Yazheng Yang
Jingjing Xu
Xu Sun
Lingpeng Kong
Qi Liu
MLLM
VLM
96
120
0
07 Jun 2023
Learning Transformer Programs
Dan Friedman
Alexander Wettig
Danqi Chen
89
36
0
01 Jun 2023
Differentiable Tree Operations Promote Compositional Generalization
Paul Soulos
J. E. Hu
Kate McCurdy
Yunmo Chen
Roland Fernandez
P. Smolensky
Jianfeng Gao
AI4CE
54
7
0
01 Jun 2023
Emergent Modularity in Pre-trained Transformers
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Chaojun Xiao
Xiaozhi Wang
Xu Han
Zhiyuan Liu
Ruobing Xie
Maosong Sun
Jie Zhou
MoE
114
25
0
28 May 2023
Modularized Zero-shot VQA with Pre-trained Models
Rui Cao
Jing Jiang
LRM
89
3
0
27 May 2023
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
62
52
0
24 May 2023
Decomposing Complex Queries for Tip-of-the-tongue Retrieval
Kevin Lin
Kyle Lo
Joseph E. Gonzalez
Dan Klein
ReLM
LMTD
49
17
0
24 May 2023
Improved Compositional Generalization by Generating Demonstrations for Meta-Learning
Sam Spilsbury
Alexander Ilin
104
1
0
22 May 2023
Semantic Composition in Visually Grounded Language Models
Rohan Pandey
CoGe
86
1
0
15 May 2023
HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
89
10
0
12 May 2023
Overinformative Question Answering by Humans and Machines
Polina Tsvilodub
Michael Franke
Robert D. Hawkins
Noah D. Goodman
42
3
0
11 May 2023
Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He
Xin Eric Wang
88
9
0
30 Apr 2023
Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu
Abhishek Gupta
Max Simchowitz
Kai Zhang
Pulkit Agrawal
100
16
0
27 Apr 2023
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Pan Lu
Baolin Peng
Hao Cheng
Michel Galley
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Jianfeng Gao
KELM
MLLM
LRM
155
325
0
19 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
98
34
0
10 Apr 2023
MOPA: Modular Object Navigation with PointGoal Agents
Sonia Raychaudhuri
Tommaso Campari
Unnat Jain
Manolis Savva
Angel X. Chang
3DPC
96
8
0
07 Apr 2023
Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
OOD
AAML
87
6
0
06 Apr 2023
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Jingdong Sun
Teruko Mitamura
Alexander G. Hauptmann
76
22
0
05 Apr 2023
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
93
53
0
23 Mar 2023
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
92
6
0
18 Mar 2023
ViperGPT: Visual Inference via Python Execution for Reasoning
Dídac Surís
Sachit Menon
Carl Vondrick
MLLM
LRM
ReLM
136
468
0
14 Mar 2023
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMe
OOD
159
80
0
22 Feb 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
206
16
0
17 Feb 2023
BinaryVQA: A Versatile Test Set to Evaluate the Out-of-Distribution Generalization of VQA Models
Ali Borji
CoGe
45
1
0
28 Jan 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next