Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10139
Cited By
Towards Solving Multimodal Comprehension
20 April 2021
Pritish Sahu
Karan Sikka
Ajay Divakaran
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Solving Multimodal Comprehension"
21 / 21 papers shown
Title
One missing piece in Vision and Language: A Survey on Comics Understanding
Emanuele Vivoli
Andrey Barsky
Mohamed Ali Souibgui
Artemis LLabres
Marco Bertini
Dimosthenis Karatzas
104
5
0
14 Sep 2024
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGen
SSL
135
713
0
13 Dec 2019
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
Lifu Huang
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
AIMat
RALM
LRM
118
459
0
31 Aug 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
1.3K
12,316
0
27 Aug 2019
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
184
883
0
27 Nov 2018
The NarrativeQA Reading Comprehension Challenge
Tomás Kociský
Jonathan Richard Schwarz
Phil Blunsom
Chris Dyer
Karl Moritz Hermann
Gábor Melis
Edward Grefenstette
142
786
0
19 Dec 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
101
1,542
0
13 Jun 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
319
2,391
0
20 Dec 2016
NewsQA: A Machine Comprehension Dataset
Adam Trischler
Tong Wang
Xingdi Yuan
Justin Harris
Alessandro Sordoni
Philip Bachman
Kaheer Suleman
112
893
0
29 Nov 2016
WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia
D. Hewlett
Alexandre Lacoste
Llion Jones
Illia Polosukhin
Andrew Fandrianto
Jay Han
Matthew Kelcey
David Berthelot
RALM
82
139
0
11 Aug 2016
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
316
8,177
0
16 Jun 2016
A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task
Danqi Chen
Jason Bolton
Christopher D. Manning
ELM
85
571
0
09 Jun 2016
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
142
1,612
0
31 May 2016
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
120
752
0
09 Dec 2015
The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations
Felix Hill
Antoine Bordes
S. Chopra
Jason Weston
RALM
122
638
0
07 Nov 2015
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
355
3,555
0
10 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
84
501
0
21 May 2015
Exploring Models and Data for Image Question Answering
Mengye Ren
Ryan Kiros
R. Zemel
80
719
0
08 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
110
600
0
05 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNN
KELM
154
1,709
0
15 Oct 2014
1