Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.00812
Cited By
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
4 September 2018
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes"
26 / 26 papers shown
Title
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Iñigo Pikabea
Iñaki Lacunza
Oriol Pareras
Carlos Escolano
Aitor Gonzalez-Agirre
Javier Hernando
Marta Villegas
VLM
113
0
0
28 Mar 2025
The NarrativeQA Reading Comprehension Challenge
Tomás Kociský
Jonathan Richard Schwarz
Phil Blunsom
Chris Dyer
Karl Moritz Hermann
Gábor Melis
Edward Grefenstette
93
759
0
19 Dec 2017
Simulating Action Dynamics with Neural Process Networks
Antoine Bosselut
Omer Levy
Ari Holtzman
C. Ennis
Dieter Fox
Yejin Choi
MILM
AI4CE
56
120
0
14 Nov 2017
FigureQA: An Annotated Figure Dataset for Visual Reasoning
Samira Ebrahimi Kahou
Vincent Michalski
Adam Atkinson
Ákos Kádár
Adam Trischler
Yoshua Bengio
ReLM
AIMat
43
317
0
19 Oct 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
168
2,576
0
09 May 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
271
2,346
0
20 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
285
3,187
0
02 Dec 2016
NewsQA: A Machine Comprehension Dataset
Adam Trischler
Tong Wang
Xingdi Yuan
Justin Harris
Alessandro Sordoni
Philip Bachman
Kaheer Suleman
68
891
0
29 Nov 2016
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
Payal Bajaj
Daniel Fernando Campos
Nick Craswell
Li Deng
Jianfeng Gao
...
Mir Rosenberg
Xia Song
Alina Stoica
Saurabh Tiwary
Tong Wang
RALM
108
2,698
0
28 Nov 2016
The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives
Mohit Iyyer
Varun Manjunatha
Anupam Guha
Yogarshi Vyas
Jordan L. Boyd-Graber
Hal Daumé
L. Davis
47
95
0
16 Nov 2016
WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia
D. Hewlett
Alexandre Lacoste
Llion Jones
Illia Polosukhin
Andrew Fandrianto
Jay Han
Matthew Kelcey
David Berthelot
RALM
55
138
0
11 Aug 2016
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal
Arjun Chandrasekaran
Dhruv Batra
Devi Parikh
Joey Tianyi Zhou
36
60
0
23 Jun 2016
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
144
8,067
0
16 Jun 2016
A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task
Danqi Chen
Jason Bolton
Christopher D. Manning
ELM
50
566
0
09 Jun 2016
A Diagram Is Worth A Dozen Images
Aniruddha Kembhavi
M. Salvato
Eric Kolve
Minjoon Seo
Hannaneh Hajishirzi
Ali Farhadi
3DV
33
456
0
24 Mar 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.3K
192,638
0
10 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
87
736
0
09 Dec 2015
The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations
Felix Hill
Antoine Bordes
S. Chopra
Jason Weston
RALM
76
633
0
07 Nov 2015
Unsupervised Semantic Parsing of Video Collections
Ozan Sener
Amir Zamir
Silvio Savarese
Ashutosh Saxena
42
98
0
28 Jun 2015
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
272
3,527
0
10 Jun 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
143
5,421
0
03 May 2015
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision
J. Malmaud
Jonathan Huang
V. Rathod
Nick Johnston
Andrew Rabinovich
Kevin Patrick Murphy
51
152
0
05 Mar 2015
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.1K
39,383
0
01 Sep 2014
Distributed Representations of Sentences and Documents
Quoc V. Le
Tomas Mikolov
FaML
172
9,231
0
16 May 2014
Natural Language Processing (almost) from Scratch
R. Collobert
Jason Weston
Léon Bottou
Michael Karlen
Koray Kavukcuoglu
Pavel P. Kuksa
123
7,711
0
02 Mar 2011
From Machine Learning to Machine Reasoning
Léon Bottou
LRM
ReLM
NAI
65
284
0
09 Feb 2011
1